A Fast Learning Algorithm for Deep Belief Nets

Abstract

We show how to use “complementary priors” to eliminate the explaining-away effects that make inference difficult in densely connected belief nets that have many hidden layers. Using complementary priors, we derive a fast, greedy algorithm that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory. The fast, greedy algorithm is used to initialize a slower learning procedure that fine-tunes the weights using a contrastive version of the wake-sleep algorithm. After fine-tuning, a network with three hidden layers forms a very good generative model of the joint distribution of handwritten digit images and their labels. This generative model gives better digit classification than the best discriminative learning algorithms. The low-dimensional manifolds on which the digits lie are modeled by long ravines in the free-energy landscape of the top-level associative memory, and it is easy to explore these ravines by using the directed connections to display what the associative memory has in mind.

Keywords

Computer scienceAssociative propertyPrior probabilityDiscriminative modelGenerative modelContent-addressable memoryArtificial intelligenceInferenceAlgorithmGenerative grammarPattern recognition (psychology)Deep belief networkArtificial neural networkMathematicsBayesian probability

Affiliated Institutions

Related Publications

Deep Belief Networks using discriminative features for phone recognition

Abdelrahman Mohamed , Tara N. Sainath , George E. Dahl +3 more

Deep Belief Networks (DBNs) are multi-layer generative models. They can be trained to model windows of coefficients extracted from speech and they discover multiple layers of fe...

2011 289 citations

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe , Christian Szegedy

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. T...

2024 arXiv (Cornell University) 15635 citations

Unsupervised Feature Learning via Non-parametric Instance Discrimination

Zhirong Wu , Yuanjun Xiong , Stella X. Yu +1 more

Neural net classifiers trained on data with annotated class labels can also capture apparent visual similarity among categories without being directed to do so. We study whether...

2018 3435 citations

Learning Deconvolution Network for Semantic Segmentation

Hyeonwoo Noh , Seunghoon Hong , Bohyung Han

We propose a novel semantic segmentation algorithm by learning a deep deconvolution network. We learn the network on top of the convolutional layers adopted from VGG 16-layer ne...

2015 3978 citations

Directed diffusion

Chalermek Intanagonwiwat , Ramesh Govindan , Deborah Estrin

Advances in processor, memory and radio technology will enable small and cheap nodes capable of sensing, communication and computation. Networks of such nodes can coordinate to ...

2000 5385 citations

Publication Info

Year: 2006
Type: article
Volume: 18
Issue: 7
Pages: 1527-1554
Citations: 16027
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

A Fast Learning Algorithm for Deep Belief Nets

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

16027

OpenAlex

Cite This

APA Style

                            
                                    Geoffrey E. Hinton, 
                                
                                    Simon Osindero, 
                                
                                    Yee‐Whye Teh
                                
                            (2006). 
                            A Fast Learning Algorithm for Deep Belief Nets. 
                            Neural Computation
                            , 18
                            (7)
                            , 1527-1554.
                            https://doi.org/10.1162/neco.2006.18.7.1527

Identifiers

DOI: 10.1162/neco.2006.18.7.1527