Abstract

ABSTRACT The recent advent of genome sequences as the only source available to classify many newly discovered viruses challenges the development of virus taxonomy by expert virologists who traditionally rely on extensive virus characterization. In this proof-of-principle study, we address this issue by presenting a computational approach (DEmARC) to classify viruses of a family into groups at hierarchical levels using a sole criterion—intervirus genetic divergence. To quantify genetic divergence, we used pairwise evolutionary distances (PEDs) estimated by maximum likelihood inference on a multiple alignment of family-wide conserved proteins. PEDs were calculated for all virus pairs, and the resulting distribution was modeled via a mixture of probability density functions. The model enables the quantitative inference of regions of distance discontinuity in the family-wide PED distribution, which define the levels of hierarchy. For each level, a limit on genetic divergence, below which two viruses join the same group, was objectively selected among a set of candidates by minimizing violations of intragroup PEDs to the limit. In a case study, we applied the procedure to hundreds of genome sequences of picornaviruses and extensively evaluated it by modulating four key parameters. It was found that the genetics-based classification largely tolerates variations in virus sampling and multiple alignment construction but is affected by the choice of protein and the measure of genetic divergence. In an accompanying paper (C. Lauber and A. E. Gorbalenya, J. Virol. 86:3905–3915, 2012), we analyze the substantial insight gained with the genetics-based classification approach by comparing it with the expert-based picornavirus taxonomy.

Keywords

BiologyVirus classificationPairwise comparisonInferenceDivergence (linguistics)GenomeGenetic diversityGenetic divergenceComputational biologyGeneticsEvolutionary biologyComputer scienceArtificial intelligencePopulationGene

MeSH Terms

Computational BiologyEvolutionMolecularGenetic VariationMolecular Sequence DataPhylogenyPicornaviridaeSequence Alignment

Affiliated Institutions

Related Publications

Publication Info

Year
2012
Type
article
Volume
86
Issue
7
Pages
3890-3904
Citations
99
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

99
OpenAlex
5
Influential

Cite This

Chris Lauber, Alexander E. Gorbalenya (2012). Partitioning the Genetic Diversity of a Virus Family: Approach and Evaluation through a Case Study of Picornaviruses. Journal of Virology , 86 (7) , 3890-3904. https://doi.org/10.1128/jvi.07173-11

Identifiers

DOI
10.1128/jvi.07173-11
PMID
22278230
PMCID
PMC3302503

Data Quality

Data completeness: 90%