An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.

Zeggini E.; Rayner W.; Morris AP.; Hattersley AT.; Walker M.; Hitman GA.; Deloukas P.; Cardon LR.; McCarthy MI.

An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.

Zeggini E., Rayner W., Morris AP., Hattersley AT., Walker M., Hitman GA., Deloukas P., Cardon LR., McCarthy MI.

A substantial investment has been made in the generation of large public resources designed to enable the identification of tag SNP sets, but data establishing the adequacy of the sample sizes used are limited. Using large-scale empirical and simulated data sets, we found that the sample sizes used in the HapMap project are sufficient to capture common variation, but that performance declines substantially for variants with minor allele frequencies of <5%.

Original publication

DOI

10.1038/ng1670

Type

Journal article

Journal

Nat Genet

Publication Date

12/2005

Volume

Pages

1320 - 1322

Keywords

Chromosome Mapping, Databases, Nucleic Acid, Diabetes Mellitus, Type 2, Gene Frequency, Genetic Predisposition to Disease, Genome, Human, Humans, Linkage Disequilibrium, Polymorphism, Single Nucleotide, Sample Size

Cookies on this website

An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets.

Zeggini E., Rayner W., Morris AP., Hattersley AT., Walker M., Hitman GA., Deloukas P., Cardon LR., McCarthy MI.

DOI

Type

Journal

Publication Date

Volume

Pages

Keywords