A comparison of Bayesian and likelihood-based methods for fitting multilevel models

Abstract

We use simulation studies, whose design is realistic for educational and medical\nresearch (as well as other fields of inquiry), to compare Bayesian and likelihood-based\nmethods for fitting variance-components (VC) and random-effects logistic regression\n(RELR) models. The likelihood (and approximate likelihood) approaches we examine are\nbased on the methods most widely used in current applied multilevel (hierarchical)\nanalyses: maximum likelihood (ML) and restricted ML (REML) for Gaussian outcomes, and\nmarginal and penalized quasi-likelihood (MQL and PQL) for Bernoulli outcomes. Our\nBayesian methods use Markov chain Monte Carlo (MCMC) estimation, with adaptive hybrid\nMetropolis-Gibbs sampling for RELR models, and several diffuse prior distributions\n($\\Gamma^{ -1 }( \\epsilon, \\epsilon )$ and $U( 0, \\frac{ 1 }{ \\epsilon } )$ priors for\nvariance components). For evaluation criteria we consider bias of point estimates and\nnominal versus actual coverage of interval estimates in repeated sampling. In two-level\nVC models we find that (a) both likelihood-based and Bayesian approaches can be made to\nproduce approximately unbiased estimates, although the automatic manner in which REML\naccomplishes this is an advantage, but (b) both approaches had difficulty achieving\nnominal coverage in small samples and with small values of the intraclass correlation.\nWith the three-level RELR models we examine we find that (c) quasi-likelihood methods\nfor estimating random-effects variances perform badly with respect to bias and coverage\nin the example we simulated, and (d) Bayesian diffuse-prior methods lead to\nwell-calibrated point and interval RELR estimates. While it is true that the\nlikelihood-based methods we study are considerably faster computationally than MCMC, (i)\nsteady improvements in recent years in both hardware speed and efficiency of Monte Carlo\nalgorithms and (ii) the lack of calibration of likelihood-based methods in some common\nhierarchical settings combine to make MCMC-based Bayesian fitting of multilevel models\nan attractive approach, even with rather large data sets. Other analytic strategies\nbased on less approximate likelihood methods are also possible but would benefit from\nfurther study of the type summarized here.

Keywords

Marginal likelihoodGibbs samplingMarkov chain Monte CarloRestricted maximum likelihoodStatisticsMathematicsBayesian probabilityRandom effects modelPrior probabilityLikelihood functionEstimation theory

Affiliated Institutions

Related Publications

Improved Estimation Procedures for Multilevel Models with Binary Response: A Case-Study

Germán Rodrı́guez , Noreen Goldman

Summary During recent years, analysts have been relying on approximate methods of inference to estimate multilevel models for binary or count data. In an earlier study of random...

2001 Journal of the Royal Statistical Soci... 212 citations

The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo

Matthew D. Hoffman , Andrew Gelman

Hamiltonian Monte Carlo (HMC) is a Markov chain Monte Carlo (MCMC) algorithm that avoids the random walk behavior and sensitivity to correlated parameters that plague many MCMC ...

2011 arXiv (Cornell University) 1777 citations

Bayesian Statistics: An Introduction

Eric R. Ziegel , Peter M. Lee

Bayesian Statistics is the school of thought that combines prior beliefs with the likelihood of a hypothesis to arrive at posterior beliefs. The first edition of Peter Lees book...

1998 Technometrics 1121 citations

Markov Chain Monte Carlo Methods and the Label Switching Problem in Bayesian Mixture Modeling

Ajay Jasra , Chris Holmes , David A. Stephens

In the past ten years there has been a dramatic increase of interest in the Bayesian analysis of finite mixture models. This is primarily because of the emergence of Markov chai...

2005 Statistical Science 671 citations

Inference in Molecular Population Genetics

Matthew Stephens , Peter Donnelly

Summary Full likelihood-based inference for modern population genetics data presents methodological and computational challenges. The problem is of considerable practical import...

2000 Journal of the Royal Statistical Soci... 313 citations

Publication Info

Year: 2006
Type: article
Volume: 1
Issue: 3
Citations: 607
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

A comparison of Bayesian and likelihood-based methods for fitting multilevel models

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

607

OpenAlex

Cite This

APA Style

                            
                                    William J. Browne, 
                                
                                    David Draper
                                
                            (2006). 
                            A comparison of Bayesian and likelihood-based methods for fitting multilevel models. 
                            Bayesian Analysis
                            , 1
                            (3)
                            .
                            https://doi.org/10.1214/06-ba117

Identifiers

DOI: 10.1214/06-ba117