Learning hierarchical representations for face verification with convolutional deep belief networks

Abstract

Most modern face recognition systems rely on a feature representation given by a hand-crafted image descriptor, such as Local Binary Patterns (LBP), and achieve improved performance by combining several such representations. In this paper, we propose deep learning as a natural source for obtaining additional, complementary representations. To learn features in high-resolution images, we make use of convolutional deep belief networks. Moreover, to take advantage of global structure in an object class, we develop local convolutional restricted Boltzmann machines, a novel convolutional learning model that exploits the global structure by not assuming stationarity of features across the image, while maintaining scalability and robustness to small misalignments. We also present a novel application of deep learning to descriptors other than pixel intensity values, such as LBP. In addition, we compare performance of networks trained using unsupervised learning against networks with random filters, and empirically show that learning weights not only is necessary for obtaining good multilayer representations, but also provides robustness to the choice of the network architecture parameters. Finally, we show that a recognition system using only representations obtained from deep learning can achieve comparable accuracy with a system using a combination of hand-crafted image descriptors. Moreover, by combining these representations, we achieve state-of-the-art results on a real-world face verification database.

Keywords

Artificial intelligenceComputer scienceRobustness (evolution)Convolutional neural networkPattern recognition (psychology)Feature learningDeep learningScalabilityFeature extractionMachine learning

Affiliated Institutions

Related Publications

Learning to Align from Scratch

Gary B. Huang , Marwan Mattar , Honglak Lee +1 more

Unsupervised joint alignment of images has been demonstrated to improve performance on recognition tasks such as face verification. Such alignment reduces undesired variability ...

2012 CORE Scholar (Wright State University) 248 citations

Receptive Field Block Net for Accurate and Fast Object Detection

Songtao Liu , Di Huang , Yunhong Wang

Current top-performing object detectors depend on deep CNN backbones, such as ResNet-101 and Inception, benefiting from their powerful feature representations but suffering from...

2018 Lecture notes in computer science 1687 citations

CosFace: Large Margin Cosine Loss for Deep Face Recognition

Hao Wang , Yitong Wang , Zheng Zhou +5 more

Face recognition has made extraordinary progress owing to the advancement of deep convolutional neural networks (CNNs). The central task of face recognition, including face veri...

2018 2018 IEEE/CVF Conference on Computer ... 2715 citations

Fusing Robust Face Region Descriptors via Multiple Metric Learning for Face Recognition in the Wild

Zhen Cui , Li Wen , Dong Xu +2 more

In many real-world face recognition scenarios, face images can hardly be aligned accurately due to complex appearance variations or low-quality images. To address this issue, we...

2013 194 citations

Deep convolutional neural fields for depth estimation from a single image

Fayao Liu , Chunhua Shen , Guosheng Lin

We consider the problem of depth estimation from a sin- gle monocular image in this work. It is a challenging task as no reliable depth cues are available, e.g., stereo corre- s...

2015 863 citations

Publication Info

Year: 2012
Type: article
Pages: 2518-2525
Citations: 412
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Learning hierarchical representations for face verification with convolutional deep belief networks

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

412

OpenAlex

Cite This

APA Style

                            
                                    Guoyang Huang, 
                                
                                    Honglak Lee, 
                                
                                    Erik Learned-Miller
                                
                            (2012). 
                            Learning hierarchical representations for face verification with convolutional deep belief networks. 
                            
                            , 2518-2525.
                            https://doi.org/10.1109/cvpr.2012.6247968

Identifiers

DOI: 10.1109/cvpr.2012.6247968