Abstract

Model selection is a topic of special relevance in molecular phylogenetics that affects many, if not all, stages of phylogenetic inference. Here we discuss some fundamental concepts and techniques of model selection in the context of phylogenetics. We start by reviewing different aspects of the selection of substitution models in phylogenetics from a theoretical, philosophical and practical point of view, and summarize this comparison in table format. We argue that the most commonly implemented model selection approach, the hierarchical likelihood ratio test, is not the optimal strategy for model selection in phylogenetics, and that approaches like the Akaike Information Criterion (AIC) and Bayesian methods offer important advantages. In particular, the latter two methods are able to simultaneously compare multiple nested or nonnested models, assess model selection uncertainty, and allow for the estimation of phylogenies and model parameters using all available models (model-averaged inference or multimodel inference). We also describe how the relative importance of the different parameters included in substitution models can be depicted. To illustrate some of these points, we have applied AIC-based model averaging to 37 mitochondrial DNA sequences from the subgenus Ohomopterus(genus Carabus) ground beetles described by Sota and Vogler (2001).

Keywords

Akaike information criterionBayesian information criterionModel selectionBayesian inferenceSelection (genetic algorithm)InferenceLikelihood-ratio testBayesian probabilityPhylogeneticsBiologyInformation CriteriaStatisticsMachine learningComputer scienceEvolutionary biologyArtificial intelligenceMathematicsGenetics

Affiliated Institutions

Related Publications

Model Selection in Phylogenetics

▪ Abstract Investigation into model selection has a long history in the statistical literature. As model-based approaches begin dominating systematic biology, increased attentio...

2005 Annual Review of Ecology Evolution an... 417 citations

jModelTest: Phylogenetic Model Averaging

jModelTest is a new program for the statistical selection of models of nucleotide substitution based on "Phyml" (Guindon and Gascuel 2003. A simple, fast, and accurate algorithm...

2008 Molecular Biology and Evolution 10411 citations

Publication Info

Year
2004
Type
article
Volume
53
Issue
5
Pages
793-808
Citations
3936
Access
Closed

External Links

Citation Metrics

3936
OpenAlex

Cite This

David Posada, Thomas R. Buckley (2004). Model Selection and Model Averaging in Phylogenetics: Advantages of Akaike Information Criterion and Bayesian Approaches Over Likelihood Ratio Tests. Systematic Biology , 53 (5) , 793-808. https://doi.org/10.1080/10635150490522304

Identifiers

DOI
10.1080/10635150490522304