A Discriminative Single-Shot Segmentation Network for Visual Object Tracking

Lukežič, Alan; Matas, Jiří; Kristan, Matej

Computer Science > Computer Vision and Pattern Recognition

arXiv:2112.11846 (cs)

[Submitted on 22 Dec 2021 (v1), last revised 27 Dec 2021 (this version, v2)]

Title:A Discriminative Single-Shot Segmentation Network for Visual Object Tracking

Authors:Alan Lukežič, Jiří Matas, Matej Kristan

View PDF

Abstract:Template-based discriminative trackers are currently the dominant tracking paradigm due to their robustness, but are restricted to bounding box tracking and a limited range of transformation models, which reduces their localization accuracy. We propose a discriminative single-shot segmentation tracker -- D3S2, which narrows the gap between visual object tracking and video object segmentation. A single-shot network applies two target models with complementary geometric properties, one invariant to a broad range of transformations, including non-rigid deformations, the other assuming a rigid object to simultaneously achieve robust online target segmentation. The overall tracking reliability is further increased by decoupling the object and feature scale estimation. Without per-dataset finetuning, and trained only for segmentation as the primary output, D3S2 outperforms all published trackers on the recent short-term tracking benchmark VOT2020 and performs very close to the state-of-the-art trackers on the GOT-10k, TrackingNet, OTB100 and LaSoT. D3S2 outperforms the leading segmentation tracker SiamMask on video object segmentation benchmarks and performs on par with top video object segmentation algorithms.

Comments:	Extended version of the D3S tracker (CVPR2020). Accepted to IEEE TPAMI. arXiv admin note: substantial text overlap with arXiv:1911.08862
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2112.11846 [cs.CV]
	(or arXiv:2112.11846v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2112.11846

Submission history

From: Alan Lukezic [view email]
[v1] Wed, 22 Dec 2021 12:48:51 UTC (6,417 KB)
[v2] Mon, 27 Dec 2021 08:08:02 UTC (6,417 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Discriminative Single-Shot Segmentation Network for Visual Object Tracking

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Discriminative Single-Shot Segmentation Network for Visual Object Tracking

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators