Learning from Incomplete Data.

Zoubin Ghahramani; Michael I. Jordan

doi:10.21236/ada295618

Abstract

Real-world learning tasks often involve high-dimensional data sets with complex patterns of missing features.In this paper we review the problem of learning from incomplete data from two statistical perspectives|the likelihood-based and the Bayesian.The goal is two-fold: to place current neural network approaches to missing data within a statistical framework, and to describe a set of algorithms, derived from the likelihood-based framework, that handle clustering, classi cation, and function approximation from incomplete data in a principled and e cient manner.These algorithms are based on mixture modeling and make t wo distinct appeals to the Expectation-Maximization (EM) principle (Dempster et al., 1977)|both for the estimation of mixture components and for coping with the missing data.

Keywords

Computer science

Affiliated Institutions

Massachusetts Institute of Technology US

Related Publications

Maximum Likelihood Estimation and Model Selection in Contingency Tables with Missing Data

Camil Fuchs

Abstract In many studies the values of one or more variables are missing for subsets of the original sample. This article focuses on the problem of obtaining maximum likelihood ...

1982 Journal of the American Statistical A... 146 citations

Simple and Globally Convergent Methods for Accelerating the Convergence of Any EM Algorithm

Ravi Varadhan , C. P. A. Roland

Abstract. The expectation‐maximization (EM) algorithm is a popular approach for obtaining maximum likelihood estimates in incomplete data problems because of its simplicity and ...

2008 Scandinavian Journal of Statistics 338 citations

The EM Algorithm and Extensions

Debashis Kushary , Geoffrey J. McLachlan , Thriyambakam Krishnan

The first unified account of the theory, methodology, and applications of the EM algorithm and its extensionsSince its inception in 1977, the Expectation-Maximization (EM) algor...

1998 Technometrics 5108 citations

Hierarchical Mixtures of Experts and the EM Algorithm

Michael I. Jordan , Robert A. Jacobs

We present a tree-structured architecture for supervised learning. The statistical model underlying the architecture is a hierarchical mixture model in which both the mixture co...

1994 Neural Computation 2555 citations

Statistical approach to X-ray CT imaging and its applications in image analysis. II. A new stochastic model-based image segmentation technique for X-ray CT image

T. Lei , Wilfred Sewchand

For pt.I, see ibid., vol.11, no.1, p.53.61 (1992). Based on the statistical properties of X-ray CT imaging given in pt.I, an unsupervised stochastic model-based image segmentati...

1992 IEEE Transactions on Medical Imaging 91 citations

Publication Info

Year: 1994
Type: report
Citations: 233
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Learning from Incomplete Data.

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

233

OpenAlex

Cite This

APA Style

                            
                                    Zoubin Ghahramani, 
                                
                                    Michael I. Jordan
                                
                            (1994). 
                            Learning from Incomplete Data.. 
                            
                            .
                            https://doi.org/10.21236/ada295618

Identifiers

DOI: 10.21236/ada295618