Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction

Abstract

Recognizing objects in fine-grained domains can be extremely challenging due to the subtle differences between subcategories. Discriminative markings are often highly localized, leading traditional object recognition approaches to struggle with the large pose variation often present in these domains. Pose-normalization seeks to align training exemplars, either piecewise by part or globally for the whole object, effectively factoring out differences in pose and in viewing angle. Prior approaches relied on computationally-expensive filter ensembles for part localization and required extensive supervision. This paper proposes two pose-normalized descriptors based on computationally-efficient deformable part models. The first leverages the semantics inherent in strongly-supervised DPM parts. The second exploits weak semantic annotations to learn cross-component correspondences, computing pose-normalized descriptors from the latent parts of a weakly-supervised DPM. These representations enable pooling across pose and viewpoint, in turn facilitating tasks such as fine-grained recognition and attribute prediction. Experiments conducted on the Caltech-UCSD Birds 200 dataset and Berkeley Human Attribute dataset demonstrate significant improvements of our approach over state-of-art algorithms.

Keywords

Computer scienceArtificial intelligenceDiscriminative modelNormalization (sociology)PoolingPattern recognition (psychology)Semantics (computer science)Machine learning

Affiliated Institutions

Related Publications

A discriminatively trained, multiscale, deformable part model

Pedro F. Felzenszwalb , David McAllester , Deva Ramanan

This paper describes a discriminatively trained, multiscale, deformable part model for object detection. Our system achieves a two-fold improvement in average precision over the...

2008 2857 citations

SUN attribute database: Discovering, annotating, and recognizing scene attributes

Geneviève Patterson , James Hays

In this paper we present the first large-scale scene attribute database. First, we perform crowd-sourced human studies to find a taxonomy of 102 discriminative attributes. Next,...

2012 857 citations

A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition

Rob Fergus , Pietro Perona , Andrew Zisserman

We present a "parts and structure" model for object category recognition that can be learnt efficiently and in a semi-supervised manner: the model is learnt from example images ...

2005 266 citations

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition

Ali Sharif Razavian , Hossein Azizpour , Josephine Sullivan +1 more

Recent results indicate that the generic descriptors ex-tracted from the convolutional neural networks are very powerful. This paper adds to the mounting evidence that this is i...

2014 4279 citations

CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features

Sangdoo Yun , Dongyoon Han , Sanghyuk Chun +3 more

Regional dropout strategies have been proposed to enhance performance of convolutional neural network classifiers. They have proved to be effective for guiding the model to atte...

2019 4293 citations

Publication Info

Year: 2013
Type: article
Pages: 729-736
Citations: 201
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

201

OpenAlex

Cite This

APA Style

                            
                                    Ning Zhang, 
                                
                                    Ryan Farrell, 
                                
                                    Forrest Iandola
                                
                                et al.
                            
                            (2013). 
                            Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction. 
                            
                            , 729-736.
                            https://doi.org/10.1109/iccv.2013.96

Identifiers

DOI: 10.1109/iccv.2013.96