From Event: SPIE Optical Engineering + Applications, 2018
Visual object tracking plays a significant role in our daily life such as intelligent transportation and surveillance. However, an accurate and robust object tracker is hard to be obtained as target objects often go through huge appearance changes caused by deformation, abrupt motion, background clutter and occlusion. In this paper, we combine features extracted from deep convolutional neural networks pretrained on object recognition datasets with color name features and histogram of oriented gradient features skillfully to improve tracking accuracy and robustness. The outputs of the convolutional layers encode the senior semantic information of targets and such representations are robust to great appearance variations while their spatial resolution is too coarse to precisely locate targets. In contrast, color name features connected at the back of HOG features could provide more precise localization but are less invariant to appearance changes. We first infer the response of the convolutional features and HOG-CN features respectively, then make a linear combination of them. The maximum value of the result could represent the accurate localization of the target. We not only compare the tracking results of adopting a single feature alone, showing that the performance of them is inferior to ours, but also analyze the effect of exploiting features extracted from different convolutional layers on the tracking performance. What’s more, we introduce the adaptive target response map in our tracking algorithm to keep the target from drifting as much as possible. Extensive experimental results on a large scale benchmark dataset illustrates outstanding performance of the proposed algorithm.
Zhi Cai, Liquan Dong, Ming Liu, Yuejin Zhao, Haoyuan Du, Ruifeng Yuan, and Feilong Ma, "Visual tracking with kernelized correlation filters based on multiple features," Proc. SPIE 10751, Optics and Photonics for Information Processing XII, 107510O (Presented at SPIE Optical Engineering + Applications: August 20, 2018; Published: 7 September 2018); https://doi.org/10.1117/12.2319715.
Conference Presentations are recordings of oral presentations given at SPIE conferences and published as part of the proceedings. They include the speaker's narration with video of the slides and animations. Most include full-text papers. Interactive, searchable transcripts and closed captioning are now available for most presentations.
Search our growing collection of more than 18,000 conference presentations, including many plenaries and keynotes.