Abstract
Real-world learning tasks often involve high-dimensional data sets with complex patterns of missing features.In this paper we review the problem of learning from incomplete data from two statistical perspectives|the likelihood-based and the Bayesian.The goal is two-fold: to place current neural network approaches to missing data within a statistical framework, and to describe a set of algorithms, derived from the likelihood-based framework, that handle clustering, classi cation, and function approximation from incomplete data in a principled and e cient manner.These algorithms are based on mixture modeling and make t wo distinct appeals to the Expectation-Maximization (EM) principle (Dempster et al., 1977)|both for the estimation of mixture components and for coping with the missing data.
Keywords
Affiliated Institutions
Related Publications
Maximum Likelihood Estimation and Model Selection in Contingency Tables with Missing Data
Abstract In many studies the values of one or more variables are missing for subsets of the original sample. This article focuses on the problem of obtaining maximum likelihood ...
Simple and Globally Convergent Methods for Accelerating the Convergence of Any EM Algorithm
Abstract. The expectationâmaximization (EM) algorithm is a popular approach for obtaining maximum likelihood estimates in incomplete data problems because of its simplicity and ...
The EM Algorithm and Extensions
The first unified account of the theory, methodology, and applications of the EM algorithm and its extensionsSince its inception in 1977, the Expectation-Maximization (EM) algor...
Hierarchical Mixtures of Experts and the EM Algorithm
We present a tree-structured architecture for supervised learning. The statistical model underlying the architecture is a hierarchical mixture model in which both the mixture co...
Statistical approach to X-ray CT imaging and its applications in image analysis. II. A new stochastic model-based image segmentation technique for X-ray CT image
For pt.I, see ibid., vol.11, no.1, p.53.61 (1992). Based on the statistical properties of X-ray CT imaging given in pt.I, an unsupervised stochastic model-based image segmentati...
Publication Info
- Year
- 1994
- Type
- report
- Citations
- 233
- Access
- Closed
External Links
Social Impact
Social media, news, blog, policy document mentions
Citation Metrics
Cite This
Identifiers
- DOI
- 10.21236/ada295618