Abstract

[1st paragraph] At first sight, Bayesian inference with Markov Chain Monte Carlo (MCMC) appears to be straightforward. The user defines a full probability model, perhaps using one of the programs discussed in this issue; an underlying sampling engine takes the model definition and returns a sequence of dependent samples from the posterior distribution of the model parameters, given the supplied data. The user can derive any summary of the posterior distribution from this sample. For example, to calculate a 95% credible interval for a parameter α, it suffices to take 1000 MCMC iterations of α and sort them so that α<sub>1</sub><α<sub>2</sub><...<α<sub>1000</sub>. The credible interval estimate is then (α<sub>25</sub>, α<sub>975</sub>). However, there is a price to be paid for this simplicity. Unlike most numerical methods used in statistical inference, MCMC does not give a clear indication of whether it has converged. The underlying Markov chain theory only guarantees that the distribution of the output will converge to the posterior in the limit as the number of iterations increases to infinity. The user is generally ignorant about how quickly convergence occurs, and therefore has to fall back on post hoc testing of the sampled output. By convention, the sample is divided into two parts: a “burn in” period during which all samples are discarded, and the remainder of the run in which the chain is considered to have converged sufficiently close to the limiting distribution to be used. Two questions then arise: 1. How long should the burn in period be? 2. How many samples are required to accurately estimate posterior quantities of interest? The <b>coda</b> package for R contains a set of functions designed to help the user answer these questions. Some of these convergence diagnostics are simple graphical ways of summarizing the data. Others are formal statistical tests.

Keywords

Markov chain Monte CarloPosterior probabilityBayesian probabilityMarkov chainBayesian inferenceComputer scienceEconometricsMathematicsStatisticsAlgorithm

Related Publications

Publication Info

Year
2006
Type
article
Citations
3474
Access
Closed

External Links

Citation Metrics

3474
OpenAlex

Cite This

Martyn Plummer, Nicky Best, Kate Cowles et al. (2006). CODA: convergence diagnosis and output analysis for MCMC. Open Research Online (The Open University) .