Abstract

The classical (two-round) hypothesis of vertebrate genome duplication proposes two successive whole-genome duplication(s) (polyploidizations) predating the origin of fishes, a view now being seriously challenged. As the debate largely concerns the relative merits of the 'big-bang mode' theory (large-scale duplication) and the 'continuous mode' theory (constant creation by small-scale duplications), we tested whether a significant proportion of paralogous genes in the contemporary human genome was indeed generated in the early stage of vertebrate evolution. After an extensive search of major databases, we dated 1,739 gene duplication events from the phylogenetic analysis of 749 vertebrate gene families. We found a pattern characterized by two waves (I, II) and an ancient component. Wave I represents a recent gene family expansion by tandem or segmental duplications, whereas wave II, a rapid paralogous gene increase in the early stage of vertebrate evolution, supports the idea of genome duplication(s) (the big-bang mode). Further analysis indicated that large- and small-scale gene duplications both make a significant contribution during the early stage of vertebrate evolution to build the current hierarchy of the human proteome.

Keywords

Gene duplicationBiologyVertebrateGenomeGeneGene familyTandem exon duplicationEvolutionary biologyGeneticsTandem repeatGenome evolutionPhylogenetic treeSegmental duplication

MeSH Terms

AnimalsEvolutionMolecularGene DuplicationHumansMolecular Sequence DataMultigene FamilyVertebrates

Affiliated Institutions

Related Publications

Publication Info

Year
2002
Type
article
Volume
31
Issue
2
Pages
205-209
Citations
252
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

252
OpenAlex
11
Influential
197
CrossRef

Cite This

Xun Gu, Yufeng Wang, Jianying Gu (2002). Age distribution of human gene families shows significant roles of both large- and small-scale duplications in vertebrate evolution. Nature Genetics , 31 (2) , 205-209. https://doi.org/10.1038/ng902

Identifiers

DOI
10.1038/ng902
PMID
12032571

Data Quality

Data completeness: 81%