Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window

David Madigan; Adrian E. Raftery

doi:10.1080/01621459.1994.10476894

Abstract

Abstract We consider the problem of model selection and accounting for model uncertainty in high-dimensional contingency tables, motivated by expert system applications. The approach most used currently is a stepwise strategy guided by tests based on approximate asymptotic P values leading to the selection of a single model; inference is then conditional on the selected model. The sampling properties of such a strategy are complex, and the failure to take account of model uncertainty leads to underestimation of uncertainty about quantities of interest. In principle, a panacea is provided by the standard Bayesian formalism that averages the posterior distributions of the quantity of interest under each of the models, weighted by their posterior model probabilities. Furthermore, this approach is optimal in the sense of maximizing predictive ability. But this has not been used in practice, because computing the posterior model probabilities is hard and the number of models is very large (often greater than 1011). We argue that the standard Bayesian formalism is unsatisfactory and propose an alternative Bayesian approach that, we contend, takes full account of the true model uncertainty by averaging over a much smaller set of models. An efficient search algorithm is developed for finding these models. We consider two classes of graphical models that arise in expert systems: the recursive causal models and the decomposable log-linear models. For each of these, we develop efficient ways of computing exact Bayes factors and hence posterior model probabilities. For the decomposable log-linear models, this is based on properties of chordal graphs and hyper-Markov prior distributions and the resultant calculations can be carried out locally. The end product is an overall strategy for model selection and accounting for model uncertainty that searches efficiently through the very large classes of models involved. Three examples are given. The first two concern data sets that have been analyzed by several authors in the context of model selection. The third addresses a urological diagnostic problem. In each example, our model averaging approach provides better out-of-sample predictive performance than any single model that might reasonably have been selected.

Keywords

occamWindow (computing)Occam's razorComputer scienceSelection (genetic algorithm)Graphical modelModel selectionArtificial intelligenceStatisticsMathematicsProgramming language

Affiliated Institutions

University of Washington US

Related Publications

Comparative Performance of Bayesian and AIC-Based Measures of Phylogenetic Model Uncertainty

Michael E. Alfaro , John P. Huelsenbeck

Reversible-jump Markov chain Monte Carlo (RJ-MCMC) is a technique for simultaneously evaluating multiple related (but not necessarily nested) statistical models that has recentl...

2006 Systematic Biology 90 citations

Probabilistic graphical models : principles and techniques

Daniel L. Koller , Nir Friedman

Most tasks require a person or an automated system to reason -- to reach conclusions based on available information. The framework of probabilistic graphical models, presented i...

2009 6434 citations

Model Selection and Model Averaging in Phylogenetics: Advantages of Akaike Information Criterion and Bayesian Approaches Over Likelihood Ratio Tests

David Posada , Thomas R. Buckley

Model selection is a topic of special relevance in molecular phylogenetics that affects many, if not all, stages of phylogenetic inference. Here we discuss some fundamental conc...

2004 Systematic Biology 3936 citations

Causal Inference, Path Analysis, and Recursive Structural Equations Models

Paul W. Holland

Rubin's model for causal inference in experiments and observational studies is enlarged to analyze the problem of “causes causing causes” and is compared to path analysis and re...

1988 Sociological Methodology 305 citations

Bayesian Variable Selection in Linear Regression

Toby J. Mitchell , John J. Beauchamp

Abstract This article is concerned with the selection of subsets of predictor variables in a linear regression model for the prediction of a dependent variable. It is based on a...

1988 Journal of the American Statistical A... 1367 citations

Publication Info

Year: 1994
Type: article
Volume: 89
Issue: 428
Pages: 1535-1546
Citations: 1207
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1207

OpenAlex

Cite This

APA Style

                            
                                    David Madigan, 
                                
                                    Adrian E. Raftery
                                
                            (1994). 
                            Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window. 
                            Journal of the American Statistical Association
                            , 89
                            (428)
                            , 1535-1546.
                            https://doi.org/10.1080/01621459.1994.10476894

Identifiers

DOI: 10.1080/01621459.1994.10476894