Abstract
The R package mice imputes incomplete multivariate data by chained equations. The software mice 1.0 appeared in the year 2000 as an S-PLUS library, and in 2001 as an R package. mice 1.0 introduced predictor selection, passive imputation and automatic pooling. This article documents mice, which extends the functionality of mice 1.0 in several ways. In mice, the analysis of imputed data is made completely general, whereas the range of models under which pooling works is substantially extended. mice adds new functionality for imputing multilevel data, automatic predictor selection, data handling, post-processing imputed values, specialized pooling routines, model selection tools, and diagnostic graphs. Imputation of categorical data is improved in order to bypass problems caused by perfect prediction. Special attention is paid to transformations, sum scores, indices and interactions using passive imputation, and to the proper setup of the predictor matrix. mice can be downloaded from the Comprehensive R Archive Network. This article provides a hands-on, stepwise approach to solve applied incomplete data problems.
Keywords
Related Publications
Multiple Imputation of Missing Values
Following the seminal publications of Rubin about thirty years ago, statisticians have become increasingly aware of the inadequacy of “complete-case” analysis of datasets with m...
Multiple imputation using chained equations: Issues and guidance for practice
Abstract Multiple imputation by chained equations is a flexible and practical approach to handling missing data. We describe the principles of the method and show how to impute ...
A multivariate technique for multiply imputing missing values using a sequence of regression models
This article describes and evaluates a procedure for imputing missing values for a relatively complex data structure when the data are missing at random. The imputations are obt...
Multiple imputation of discrete and continuous data by fully conditional specification
The goal of multiple imputation is to provide valid inferences for statistical estimates from incomplete data. To achieve that goal, imputed values should preserve the structure...
Multiple Imputation of Missing Values: Further Update of Ice, with an Emphasis on Categorical Variables
Multiple imputation of missing data continues to be a topic of considerable interest and importance to applied researchers. In this article, the ice package for multiple imputat...
Publication Info
- Year
- 2014
- Type
- article
- Citations
- 6237
- Access
- Closed