Abstract

Recently, Siamese networks have drawn great attention in visual tracking community because of their balanced accuracy and speed. However, features used in most Siamese tracking approaches can only discriminate foreground from the non-semantic backgrounds. The semantic backgrounds are always considered as distractors, which hinders the robustness of Siamese trackers. In this paper, we focus on learning distractor-aware Siamese networks for accurate and long-term tracking. To this end, features used in traditional Siamese trackers are analyzed at first. We observe that the imbalanced distribution of training data makes the learned features less discriminative. During the off-line training phase, an effective sampling strategy is introduced to control this distribution and make the model focus on the semantic distractors. During inference, a novel distractor-aware module is designed to perform incremental learning, which can effectively transfer the general embedding to the current video domain. In addition, we extend the proposed approach for long-term tracking by introducing a simple yet effective local-to-global search region strategy. Extensive experiments on benchmarks show that our approach significantly outperforms the state-of-the-arts, yielding 9.6% relative gain in VOT2016 dataset and 35.9% relative gain in UAV20L dataset. The proposed tracker can perform at 160 FPS on short-term benchmarks and 110 FPS on long-term benchmarks.

Keywords

Computer scienceBitTorrent trackerArtificial intelligenceDiscriminative modelRobustness (evolution)Focus (optics)InferenceEmbeddingComputer visionTerm (time)Eye trackingMachine learning

Affiliated Institutions

Related Publications

Publication Info

Year
2018
Type
book-chapter
Pages
103-119
Citations
1479
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1479
OpenAlex
188
Influential

Cite This

Zheng Zhu, Qiang Wang, Bo Li et al. (2018). Distractor-Aware Siamese Networks for Visual Object Tracking. Lecture notes in computer science , 103-119. https://doi.org/10.1007/978-3-030-01240-3_7

Identifiers

DOI
10.1007/978-3-030-01240-3_7
PMID
41321684
PMCID
PMC12657676
arXiv
1808.06048

Data Quality

Data completeness: 79%