Efficient Learning of Sparse Representations with an Energy-Based Model

Abstract

We describe a novel unsupervised method for learning sparse, overcomplete features. The model uses a linear encoder, and a linear decoder preceded by a sparsifying non-linearity that turns a code vector into a quasi-binary sparse code vector. Given an input, the optimal code minimizes the distance between the output of the decoder and the input patch while being as similar as possible to the encoder output. Learning proceeds in a two-phase EM-like fashion: (1) compute the minimum-energy code vector, (2) adjust the parameters of the encoder and decoder so as to decrease the energy. The model produces stroke detectors when trained on handwritten numerals, and Gabor-like filters when trained on natural image patches. Inference and learning are very fast, requiring no preprocessing, and no expensive sampling. Using the proposed unsupervised method to initialize the first layer of a convolutional network, we achieved an error rate slightly lower than the best reported result on the MNIST dataset. Finally, an extension of the method is described to learn topographical filter maps.

Keywords

Computer scienceArtificial intelligence

Related Publications

Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups

Geoffrey E. Hinton , Li Deng , Dong Yu +8 more

Most current speech recognition systems use hidden Markov models (HMMs) to deal with the temporal variability of speech and Gaussian mixture models (GMMs) to determine how well ...

2012 IEEE Signal Processing Magazine 10065 citations

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Haoyi Zhou , Shanghang Zhang , Jieqi Peng +4 more

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long sequence time-series forecasting (LSTF) demands ...

2021 Proceedings of the AAAI Conference on... 4749 citations

Momentum Contrast for Unsupervised Visual Representation Learning

Kaiming He , Haoqi Fan , Yuxin Wu +2 more

We present Momentum Contrast (MoCo) for unsupervised visual representation learning. From a perspective on contrastive learning as dictionary look-up, we build a dynamic diction...

2020 11112 citations

Sparse Multilayer Perceptron for Phoneme Recognition

G. S. V. S. Sivaram , Hynek Heřmanský

This paper introduces the sparse multilayer perceptron (SMLP) which jointly learns a sparse feature representation and nonlinear classifier boundaries to optimally discriminate ...

2011 IEEE Transactions on Audio Speech and... 65 citations

Attention Is All You Need

Ashish Vaswani , Noam Shazeer , Niki Parmar +5 more

The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also co...

2025 6466 citations

Publication Info

Year: 2007
Type: book-chapter
Pages: 1137-1144
Citations: 1077
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Efficient Learning of Sparse Representations with an Energy-Based Model

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1077

OpenAlex

Cite This

APA Style

                            
                                    Christopher S. Poultney, 
                                
                                    Sumit Chopra, 
                                
                                    Yann LeCun
                                
                                et al.
                            
                            (2007). 
                            Efficient Learning of Sparse Representations with an Energy-Based Model. 
                            The MIT Press eBooks
                            
                            , 1137-1144.
                            https://doi.org/10.7551/mitpress/7503.003.0147

Identifiers

DOI: 10.7551/mitpress/7503.003.0147