Abstract

We have assessed the numbers of potentially deleterious variants in the genomes of apparently healthy humans by using (1) low-coverage whole-genome sequence data from 179 individuals in the 1000 Genomes Pilot Project and (2) current predictions and databases of deleterious variants. Each individual carried 281-515 missense substitutions, 40-85 of which were homozygous, predicted to be highly damaging. They also carried 40-110 variants classified by the Human Gene Mutation Database (HGMD) as disease-causing mutations (DMs), 3-24 variants in the homozygous state, and many polymorphisms putatively associated with disease. Whereas many of these DMs are likely to represent disease-allele-annotation errors, between 0 and 8 DMs (0-1 homozygous) per individual are predicted to be highly damaging, and some of them provide information of medical relevance. These analyses emphasize the need for improved annotation of disease alleles both in mutation databases and in the primary literature; some HGMD mutation data have been recategorized on the basis of the present findings, an iterative process that is both necessary and ongoing. Our estimates of deleterious-allele numbers are likely to be subject to both overcounting and undercounting. However, our current best mean estimates of ~400 damaging variants and ~2 bona fide disease mutations per individual are likely to increase rather than decrease as sequencing studies ascertain rare variants more effectively and as additional disease alleles are discovered.

Keywords

AlleleMissense mutationGeneticsBiologyDisease1000 Genomes ProjectGenomeMutationPopulationDatabaseGeneComputational biologyGenotypeMedicineSingle-nucleotide polymorphismComputer science

Affiliated Institutions

Related Publications

Publication Info

Year
2012
Type
article
Volume
91
Issue
6
Pages
1022-1032
Citations
277
Access
Closed

External Links

Citation Metrics

277
OpenAlex

Cite This

Yali Xue, Yuan Chen, Qasim Ayub et al. (2012). Deleterious- and Disease-Allele Prevalence in Healthy Individuals: Insights from Current Predictions, Mutation Databases, and Population-Scale Resequencing. The American Journal of Human Genetics , 91 (6) , 1022-1032. https://doi.org/10.1016/j.ajhg.2012.10.015

Identifiers

DOI
10.1016/j.ajhg.2012.10.015