A discriminatively trained, multiscale, deformable part model

Abstract

This paper describes a discriminatively trained, multiscale, deformable part model for object detection. Our system achieves a two-fold improvement in average precision over the best performance in the 2006 PASCAL person detection challenge. It also outperforms the best results in the 2007 challenge in ten out of twenty categories. The system relies heavily on deformable parts. While deformable part models have become quite popular, their value had not been demonstrated on difficult benchmarks such as the PASCAL challenge. Our system also relies heavily on new methods for discriminative training. We combine a margin-sensitive approach for data mining hard negative examples with a formalism we call latent SVM. A latent SVM, like a hidden CRF, leads to a non-convex training problem. However, a latent SVM is semi-convex and the training problem becomes convex once latent information is specified for the positive examples. We believe that our training methods will eventually make possible the effective use of more latent information such as hierarchical (grammar) models and models involving latent three dimensional pose.

Keywords

Computer sciencePascal (unit)Artificial intelligenceDiscriminative modelSupport vector machineMachine learningRegular polygonPattern recognition (psychology)Training setMargin (machine learning)Mathematics

Affiliated Institutions

Related Publications

Object Detection with Discriminatively Trained Part-Based Models

Pedro F. Felzenszwalb , Ross Girshick , David McAllester +1 more

We describe an object detection system based on mixtures of multiscale deformable part models. Our system is able to represent highly variable object classes and achieves state-...

2009 IEEE Transactions on Pattern Analysis... 9911 citations

Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction

Ning Zhang , Ryan Farrell , Forrest Iandola +1 more

Recognizing objects in fine-grained domains can be extremely challenging due to the subtle differences between subcategories. Discriminative markings are often highly localized,...

2013 201 citations

CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features

Sangdoo Yun , Dongyoon Han , Sanghyuk Chun +3 more

Regional dropout strategies have been proposed to enhance performance of convolutional neural network classifiers. They have proved to be effective for guiding the model to atte...

2019 4293 citations

Unsupervised Feature Learning via Non-parametric Instance Discrimination

Zhirong Wu , Yuanjun Xiong , Stella X. Yu +1 more

Neural net classifiers trained on data with annotated class labels can also capture apparent visual similarity among categories without being directed to do so. We study whether...

2018 3435 citations

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe , Christian Szegedy

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. T...

2024 arXiv (Cornell University) 15635 citations

Publication Info

Year: 2008
Type: article
Pages: 1-8
Citations: 2857
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

A discriminatively trained, multiscale, deformable part model

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2857

OpenAlex

Cite This

APA Style

                            
                                    Pedro F. Felzenszwalb, 
                                
                                    David McAllester, 
                                
                                    Deva Ramanan
                                
                            (2008). 
                            A discriminatively trained, multiscale, deformable part model. 
                            
                            , 1-8.
                            https://doi.org/10.1109/cvpr.2008.4587597

Identifiers

DOI: 10.1109/cvpr.2008.4587597