Abstract

We present a fast and accurate algorithm for computing the 2D pose of objects in images called cascaded pose regression (CPR). CPR progressively refines a loosely specified initial guess, where each refinement is carried out by a different regressor. Each regressor performs simple image measurements that are dependent on the output of the previous regressors; the entire system is automatically learned from human annotated training examples. CPR is not restricted to rigid transformations: `pose' is any parameterized variation of the object's appearance such as the degrees of freedom of deformable and articulated objects. We compare CPR against both standard regression techniques and human performance (computed from redundant human annotations). Experiments on three diverse datasets (mice, faces, fish) suggest CPR is fast (2-3ms per pose estimate), accurate (approaching human performance), and easy to train from small amounts of labeled data.

Keywords

Computer scienceParameterized complexityArtificial intelligencePoseRegressionObject (grammar)Degrees of freedom (physics and chemistry)Computer visionPattern recognition (psychology)Variation (astronomy)3D pose estimationMachine learningAlgorithmMathematicsStatistics

Affiliated Institutions

Related Publications

Publication Info

Year
2010
Type
article
Pages
1078-1085
Citations
548
Access
Closed

External Links

Social Impact

Altmetric
PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

548
OpenAlex

Cite This

Piotr Dollár, Peter Welinder, Pietro Perona (2010). Cascaded pose regression. , 1078-1085. https://doi.org/10.1109/cvpr.2010.5540094

Identifiers

DOI
10.1109/cvpr.2010.5540094