Abstract

Adaptive tracking-by-detection methods are widely used in computer vision for tracking arbitrary objects. Current approaches treat the tracking problem as a classification task and use online learning techniques to update the object model. However, for these updates to happen one needs to convert the estimated object position into a set of labelled training examples, and it is not clear how best to perform this intermediate step. Furthermore, the objective for the classifier (label prediction) is not explicitly coupled to the objective for the tracker (accurate estimation of object position). In this paper, we present a framework for adaptive visual object tracking based on structured output prediction. By explicitly allowing the output space to express the needs of the tracker, we are able to avoid the need for an intermediate classification step. Our method uses a kernelized structured output support vector machine (SVM), which is learned online to provide adaptive tracking. To allow for real-time application, we introduce a budgeting mechanism which prevents the unbounded growth in the number of support vectors which would otherwise occur during tracking. Experimentally, we show that our algorithm is able to outperform state-of-the-art trackers on various benchmark videos. Additionally, we show that we can easily incorporate additional features and kernels into our framework, which results in increased performance.

Keywords

BitTorrent trackerComputer scienceArtificial intelligenceClassifier (UML)Benchmark (surveying)Support vector machineVideo trackingEye trackingTracking (education)Computer visionObject detectionStructured support vector machineMachine learningTracking systemObject (grammar)Pattern recognition (psychology)Kalman filter

Affiliated Institutions

Related Publications

A survey of kernels for structured data

Kernel methods in general and support vector machines in particular have been successful in various learning tasks on data represented in a single table. Much 'real-world' data,...

2003 ACM SIGKDD Explorations Newsletter 445 citations

Publication Info

Year
2011
Type
article
Pages
263-270
Citations
1832
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1832
OpenAlex

Cite This

Sam Hare, Amir Saffari, Philip H. S. Torr (2011). Struck: Structured output tracking with kernels. , 263-270. https://doi.org/10.1109/iccv.2011.6126251

Identifiers

DOI
10.1109/iccv.2011.6126251