Data quality control in genetic case-control association studies.

Anderson CA.; Pettersson FH.; Clarke GM.; Cardon LR.; Morris AP.; Zondervan KT.

Data quality control in genetic case-control association studies.

Anderson CA., Pettersson FH., Clarke GM., Cardon LR., Morris AP., Zondervan KT.

This protocol details the steps for data quality assessment and control that are typically carried out during case-control association studies. The steps described involve the identification and removal of DNA samples and markers that introduce bias. These critical steps are paramount to the success of a case-control study and are necessary before statistically testing for association. We describe how to use PLINK, a tool for handling SNP data, to perform assessments of failure rate per individual and per SNP and to assess the degree of relatedness between individuals. We also detail other quality-control procedures, including the use of SMARTPCA software for the identification of ancestral outliers. These platforms were selected because they are user-friendly, widely used and computationally efficient. Steps needed to detect and establish a disease association using case-control data are not discussed here. Issues concerning study design and marker selection in case-control studies have been discussed in our earlier protocols. This protocol, which is routinely used in our labs, should take approximately 8 h to complete.

Original publication

DOI

10.1038/nprot.2010.116

Type

Journal article

Journal

Nat Protoc

Publication Date

09/2010

Volume

Pages

1564 - 1573

Keywords

Animals, Case-Control Studies, Computational Biology, Female, Genetic Techniques, Genome-Wide Association Study, Humans, Male, Quality Control

Cookies on this website

Data quality control in genetic case-control association studies.

Anderson CA., Pettersson FH., Clarke GM., Cardon LR., Morris AP., Zondervan KT.

DOI

Type

Journal

Publication Date

Volume

Pages

Keywords