Abstract
Case-control association studies are widely used in the search for genetic variants that contribute to human diseases. It has long been known that such studies may suffer from high rates of false positives if there is unrecognized population structure. It is perhaps less widely appreciated that so-called "cryptic relatedness" (i.e., kinship among the cases or controls that is not known to the investigator) might also potentially inflate the false positive rate. Until now there has been little work to assess how serious this problem is likely to be in practice. In this paper, we develop a formal model of cryptic relatedness, and study its impact on association studies. We provide simple expressions that predict the extent of confounding due to cryptic relatedness. Surprisingly, these expressions are functions of directly observable parameters. Our analytical results show that, for well-designed studies in outbred populations, the degree of confounding due to cryptic relatedness will usually be negligible. However, in contrast, studies where there is a sampling bias toward collecting relatives may indeed suffer from excessive rates of false positives. Furthermore, cryptic relatedness may be a serious concern in founder populations that have grown rapidly and recently from a small size. As an example, we analyze the impact of excess relatedness among cases for six phenotypes measured in the Hutterite population.
Keywords
Affiliated Institutions
Related Publications
The confounding effect of cryptic relatedness for environmental risks of systolic blood pressure on cohort studies
Abstract The impact of cryptic relatedness ( CR ) on genomic association studies is well studied and known to inflate false‐positive rates as reported by several groups. In cont...
An Arabidopsis Example of Association Mapping in Structured Samples
A potentially serious disadvantage of association mapping is the fact that marker-trait associations may arise from confounding population structure as well as from linkage to c...
Genomic inflation factors under polygenic inheritance
Population structure, including population stratification and cryptic relatedness, can cause spurious associations in genome-wide association studies (GWAS). Usually, the scaled...
LD Score Regression Distinguishes Confounding from Polygenicity in Genome-Wide Association Studies
Abstract Both polygenicity 1,2 ( i.e. many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification 3 , can yield inflated distri...
Discerning the Ancestry of European Americans in Genetic Association Studies
European Americans are often treated as a homogeneous group, but in fact form a structured population due to historical immigration of diverse source populations. Discerning the...
Publication Info
- Year
- 2005
- Type
- article
- Volume
- 1
- Issue
- 3
- Pages
- e32-e32
- Citations
- 233
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1371/journal.pgen.0010032