Abstract
When many protein sequences are available for estimating the time of divergence between two species, it is customary to estimate the time for each protein separately and then use the average for all proteins as the final estimate. However, it can be shown that this estimate generally has an upward bias, and that an unbiased estimate is obtained by using distances based on concatenated sequences. We have shown that two concatenation-based distances, i.e., average gamma distance weighted with sequence length ( d 2 ) and multiprotein gamma distance ( d 3 ), generally give more satisfactory results than other concatenation-based distances. Using these two distance measures for 104 protein sequences, we estimated the time of divergence between mice and rats to be approximately 33 million years ago. Similarly, the time of divergence between humans and rodents was estimated to be approximately 96 million years ago. We also investigated the dependency of time estimates on statistical methods and various assumptions made by using sequence data from eubacteria, protists, plants, fungi, and animals. Our best estimates of the times of divergence between eubacteria and eukaryotes, between protists and other eukaryotes, and between plants, fungi, and animals were 3, 1.7, and 1.3 billion years ago, respectively. However, estimates of ancient divergence times are subject to a substantial amount of error caused by uncertainty of the molecular clock, horizontal gene transfer, errors in sequence alignments, etc.
Keywords
Affiliated Institutions
Related Publications
Effects of Models of Rate Evolution on Estimation of Divergence Dates with Special Reference to the Metazoan 18S Ribosomal RNA Phylogeny
The molecular clock, i.e., constancy of the rate of evolution over time, is commonly assumed in estimating divergence dates. However, this assumption is often violated and has d...
A Nonparametric Approach to Estimating Divergence Times in the Absence of Rate Constancy
A new method for estimating divergence times when evolutionary rates are variable across lineages is proposed. The method, called nonparametric rate smoothing (NPRS), relies on ...
A simple method of removing the effect of a bottleneck and unequal population sizes on pairwise genetic distances
In this paper, we derive the expectation of two popular genetic distances under a model of pure population fission allowing for unequal population sizes. Under the model, we sho...
Estimation of Primate Speciation Dates Using Local Molecular Clocks
Protein-coding genes of the mitochondrial genomes from 31 mammalian species were analyzed to estimate the speciation dates within primates and also between rats and mice. Three ...
Vagaries of the molecular clock
The hypothesis of the molecular evolutionary clock asserts that informational macromolecules (i.e., proteins and nucleic acids) evolve at rates that are constant through time an...
Publication Info
- Year
- 2001
- Type
- article
- Volume
- 98
- Issue
- 5
- Pages
- 2497-2502
- Citations
- 330
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1073/pnas.051611498