Abstract
Abstract Missing data is a common complication in data analysis. In many medical settings missing data can cause difficulties in estimation, precision and inference. Multiple imputation (MI) ( Multiple Imputation for Nonresponse in Surveys . Wiley: New York, 1987) is a simulation‐based approach to deal with incomplete data. Although there are many different methods to deal with incomplete data, MI has become one of the leading methods. Since the late 1980s we observed a constant increase in the use and publication of MI‐related research. This tutorial does not attempt to cover all the material concerning MI, but rather provides an overview and combines together the theory behind MI, the implementation of MI, and discusses increasing possibilities of the use of MI using commercial and free software. We illustrate some of the major points using an example from an Alzheimer disease (AD) study. In this AD study, while clinical data are available for all subjects, postmortem data are only available for the subset of those who died and underwent an autopsy. Analysis of incomplete data requires making unverifiable assumptions. These assumptions are discussed in detail in the text. Relevant S‐Plus code is provided. Copyright © 2007 John Wiley & Sons, Ltd.
Keywords
Affiliated Institutions
Related Publications
Multiple Imputation and its Application
A practical guide to analysing partially observed data. Collecting, analysing and drawing inferences from data is central to research in the medical and social sciences. Unfortu...
Multiple Imputation in Practice
Missing data frequently complicates data analysis for scientific investigations. The development of statistical methods to address missing data has been an active area of resear...
Population‐calibrated multiple imputation for a binary/categorical covariate in categorical regression models
Multiple imputation (MI) has become popular for analyses with missing data in medical research. The standard implementation of MI is based on the assumption of data being missin...
A comparison of inclusive and restrictive strategies in modern missing data procedures.
Two classes of modern missing data procedures, maximum likelihood (ML) and multiple imputation (MI), tend to yield similar results when implemented in comparable ways. In either...
Multiple Imputation of Missing Values: Update of Ice
Royston (2004) introduced mvis, an implementation for Stata of MICE, a method of multiple multivariate imputation of missing values under missing-at-random (MAR) assumptions. In...
Publication Info
- Year
- 2007
- Type
- article
- Volume
- 26
- Issue
- 16
- Pages
- 3057-3077
- Citations
- 418
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.1002/sim.2787