Sequence variations in the public human genome data reflect a bottlenecked population history

2002 Proceedings of the National Academy of Sciences 120 citations

Abstract

Single-nucleotide polymorphisms (SNPs) constitute the great majority of variations in the human genome, and as heritable variable landmarks they are useful markers for disease mapping and resolving population structure. Redundant coverage in overlaps of large-insert genomic clones, sequenced as part of the Human Genome Project, comprises a quarter of the genome, and it is representative in terms of base compositional and functional sequence features. We mined these regions to produce 500,000 high-confidence SNP candidates as a uniform resource for describing nucleotide diversity and its regional variation within the genome. Distributions of marker density observed at different overlap length scales under a model of recombination and population size change show that the history of the population represented by the public genome sequence is one of collapse followed by a recent phase of mild size recovery. The inferred times of collapse and recovery are Upper Paleolithic, in agreement with archaeological evidence of the initial modern human colonization of Europe.

Keywords

Human genomeGenomeBiologyPopulationGeneticsSingle-nucleotide polymorphismSequence (biology)Evolutionary biologyDemographic historyGene densityEffective population sizeGenetic variationGeneGenotypeDemography

Affiliated Institutions

Related Publications

Publication Info

Year
2002
Type
article
Volume
100
Issue
1
Pages
376-381
Citations
120
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

120
OpenAlex

Cite This

Gábor Marth, Greg Schuler, Raymond T. Yeh et al. (2002). Sequence variations in the public human genome data reflect a bottlenecked population history. Proceedings of the National Academy of Sciences , 100 (1) , 376-381. https://doi.org/10.1073/pnas.222673099

Identifiers

DOI
10.1073/pnas.222673099