Sparse Feature Learning for Deep Belief Networks

Abstract

Unsupervised learning algorithms aim to discover the structure hidden in the data, and to learn representations that are more suitable as input to a supervised machine than the raw input. Many unsupervised methods are based on reconstructing the input from the representation, while constraining the representation to have certain desirable properties (e.g. low dimension, sparsity, etc). Others are based on approximating density by stochastically reconstructing the input from the representation. We describe a novel and efficient algorithm to learn sparse representations, and compare it theoretically and experimentally with a similar machine trained probabilistically, namely a Restricted Boltzmann Machine. We propose a simple criterion to compare and select different unsupervised machines based on the trade-off between the reconstruction error and the information content of the representation. We demonstrate this method by extracting features from a dataset of handwritten numerals, and from a dataset of natural image patches. We show that by stacking multiple levels of such machines and by training sequentially, high-order dependencies between the input observed variables can be captured.

Keywords

Computer scienceArtificial intelligenceBoltzmann machineRestricted Boltzmann machineRepresentation (politics)Pattern recognition (psychology)Feature learningUnsupervised learningFeature (linguistics)Sparse approximationMachine learningDimension (graph theory)Simple (philosophy)Deep learningMathematics

Affiliated Institutions

Courant Institute of Mathematical Sciences US

Related Publications

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair , Geoffrey E. Hinton

Restricted Boltzmann machines were developed using binary stochastic hidden units. These can be generalized by replacing each binary unit by an infinite number of copies that al...

2010 International Conference on Machine L... 13197 citations

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Richard Zhang , Phillip Isola , Alexei A. Efros +2 more

While it is nearly effortless for humans to quickly assess the perceptual similarity between two images, the underlying processes are thought to be quite complex. Despite this, ...

2018 10763 citations

LRBM: A Restricted Boltzmann Machine Based Approach for Representation Learning on Linked Data

Kang Li , Jing Gao , Suxin Guo +3 more

Linked data consist of both node attributes, e.g., Preferences, posts and degrees, and links which describe the connections between nodes. They have been widely used to represen...

2014 27 citations

Representation Learning: A Review and New Perspectives

Yoshua Bengio , Aaron Courville , P. M. Durai Raj Vincent

The success of machine learning algorithms generally depends on data representation, and we hypothesize that this is because different representations can entangle and hide more...

2013 IEEE Transactions on Pattern Analysis... 12373 citations

Beta Process Joint Dictionary Learning for Coupled Feature Spaces with Application to Single Image Super-Resolution

Li He , Hairong Qi , Russell Zaretzki

This paper addresses the problem of learning over-complete dictionaries for the coupled feature spaces, where the learned dictionaries also reflect the relationship between the ...

2013 179 citations

Publication Info

Year: 2007
Type: article
Volume: 20
Pages: 1185-1192
Citations: 713
Access: Closed

External Links

Citation Metrics

713

OpenAlex

Cite This

APA Style

                            
                                    Marc’Aurelio Ranzato, 
                                
                                    Y-Lan Boureau, 
                                
                                    Y. Le Cun
                                
                            (2007). 
                            Sparse Feature Learning for Deep Belief Networks. 
                            Neural Information Processing Systems
                            , 20
                            
                            , 1185-1192.