Abstract

Researchers often face a dilemma: Should they collect little data and emphasize quality, or much data at the expense of quality? The utility of the 3-form design coupled with maximum likelihood methods for estimation of missing values was evaluated. In 3-form design surveys, four sets of items. X, A, B, and C are administered: Each third of the subjects receives X and one combination of two other item sets - AB, BC, or AC. Variances and covariances were estimated with pairwise deletion, mean replacement, single imputation, multiple imputation, raw data maximum likelihood, multiple-group covariance structure modeling, and Expectation-Maximization (EM) algorithm estimation. The simulation demonstrated that maximum likelihood estimation and multiple imputation methods produce the most efficient and least biased estimates of variances and covariances for normally distributed and slightly skewed data when data are missing completely at random (MCAR). Pairwise deletion provided equally unbiased estimates but was less efficient than ML procedures. Further simulation results demonstrated that nun-maximum likelihood methods break down when data are not missing completely at random. Application of these methods with empirical drug use data resulted in similar covariance matrices for pairwise and EM estimation, however, ML estimation produced better and more efficient regression estimates. Maximum likelihood estimation or multiple imputation procedures. which are now becoming more readily available, are always recommended. In order to maximize the efficiency of the ML parameter estimates, it is recommended that scale items be split across forms rather than being left intact within forms.

Keywords

Missing dataMaximum likelihoodStatisticsComputer scienceValue (mathematics)Data miningEconometricsMathematics

Affiliated Institutions

Related Publications

Applied Missing Data Analysis

Part 1. An Introduction to Missing Data. 1.1 Introduction. 1.2 Chapter Overview. 1.3 Missing Data Patterns. 1.4 A Conceptual Overview of Missing Data heory. 1.5 A More Formal De...

2010 6888 citations

Multiple Imputation for Nonresponse in Surveys

Tables and Figures. Glossary. 1. Introduction. 1.1 Overview. 1.2 Examples of Surveys with Nonresponse. 1.3 Properly Handling Nonresponse. 1.4 Single Imputation. 1.5 Multiple Imp...

1987 Wiley series in probability and stati... 19880 citations

Multiple Imputation for Missing Data

Two algorithms for producing multiple imputations for missing data are evaluated with simulated data. Software using a propensity score classifier with the approximate Bayesian ...

2000 Sociological Methods & Research 786 citations

Publication Info

Year
1996
Type
article
Volume
31
Issue
2
Pages
197-218
Citations
368
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

368
OpenAlex

Cite This

John W. Graham, Scott M. Hofer, David P. MacKinnon (1996). Maximizing the Usefulness of Data Obtained with Planned Missing Value Patterns: An Application of Maximum Likelihood Procedures. Multivariate Behavioral Research , 31 (2) , 197-218. https://doi.org/10.1207/s15327906mbr3102_3

Identifiers

DOI
10.1207/s15327906mbr3102_3