Abstract

In this paper, we compare the performance of descriptors computed for local interest regions, as, for example, extracted by the Harris-Affine detector. Many different descriptors have been proposed in the literature. It is unclear which descriptors are more appropriate and how their performance depends on the interest region detector. The descriptors should be distinctive and at the same time robust to changes in viewing conditions as well as to errors of the detector. Our evaluation uses as criterion recall with respect to precision and is carried out for different image transformations. We compare shape context, steerable filters, PCA-SIFT, differential invariants, spin images, SIFT, complex filters, moment invariants, and cross-correlation for different types of interest regions. We also propose an extension of the SIFT descriptor and show that it outperforms the original method. Furthermore, we observe that the ranking of the descriptors is mostly independent of the interest region detector and that the SIFT-based descriptors perform best. Moments and steerable filters show the best performance among the low dimensional descriptors.

Keywords

Scale-invariant feature transformArtificial intelligencePattern recognition (psychology)Computer scienceAffine transformationDetectorContext (archaeology)MathematicsComputer visionFeature extraction

Affiliated Institutions

Related Publications

Publication Info

Year
2005
Type
article
Volume
27
Issue
10
Pages
1615-1630
Citations
6674
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

6674
OpenAlex

Cite This

Krystian Mikolajczyk, C. Schmid (2005). A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence , 27 (10) , 1615-1630. https://doi.org/10.1109/tpami.2005.188

Identifiers

DOI
10.1109/tpami.2005.188