The supplemental materials for:

HapBlock - The dynamic programming algorithms for Haplotype Block Partitioning and Tag SNPs selection by Haplotype Data and Genotype Data


Programs

Our algorithms have been implemented in a program by C++, here are the executable files:

NEW Updates: The HapBlock Program Can Handle the Genotype Data Now.

Attention: This program is free for academic use and not permitted for commercial purpose under any circumstances. The part of program is produced with collaboration with Steve Qin and Jun Liu in the department of statistics at Harvard University. The souce code of program has not been provided and may be available upon request.
Copyright © 2003 The University of Southern California. All RIGHTS RESERVED.

Test Data Sets

The haplotype data is simulated by Coalesence Process with recombination implemented in the lab of Richard Hudson. The following data sets are used in our testing and can be used for exploring our program:

Results: Blocks, Tag SNPs and Haplotype Patterns

We test our program based on aforementioned data using a number of different setting of parameters. In the following, we list the parameter file, the corresponding output files and the short explainations for those parameters and results. For the contents and format of these files and the meaning of each parameter in the parameter file, please refer our help file (PDF format).

Since only one definition of block has been implemented in the current version of program and the same data sets are used, the paramter files share many common parameters. The following definition is used in all parameter files: a set of consecutive SNPs with size one or more are defined as a block only if the percentage of common haplotypes is more than 80%. We also set the maximum number of samples, the maximum number of SNPs, the maximum length of a block as 100, 250 and 100, respevtively.

References


Created Date: March 20, 2003
Last Updated Date: March 25, 2003

Contact Us: