TensorMask: A Foundation for Dense Object Segmentation

Chen, Xinlei; Girshick, Ross; He, Kaiming; Dollár, Piotr

Computer Science > Computer Vision and Pattern Recognition

arXiv:1903.12174 (cs)

[Submitted on 28 Mar 2019 (v1), last revised 27 Aug 2019 (this version, v2)]

Title:TensorMask: A Foundation for Dense Object Segmentation

Authors:Xinlei Chen, Ross Girshick, Kaiming He, Piotr Dollár

View PDF

Abstract:Sliding-window object detectors that generate bounding-box object predictions over a dense, regular grid have advanced rapidly and proven popular. In contrast, modern instance segmentation approaches are dominated by methods that first detect object bounding boxes, and then crop and segment these regions, as popularized by Mask R-CNN. In this work, we investigate the paradigm of dense sliding-window instance segmentation, which is surprisingly under-explored. Our core observation is that this task is fundamentally different than other dense prediction tasks such as semantic segmentation or bounding-box object detection, as the output at every spatial location is itself a geometric structure with its own spatial dimensions. To formalize this, we treat dense instance segmentation as a prediction task over 4D tensors and present a general framework called TensorMask that explicitly captures this geometry and enables novel operators on 4D tensors. We demonstrate that the tensor view leads to large gains over baselines that ignore this structure, and leads to results comparable to Mask R-CNN. These promising results suggest that TensorMask can serve as a foundation for novel advances in dense mask prediction and a more complete understanding of the task. Code will be made available.

Comments:	accepted to ICCV
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1903.12174 [cs.CV]
	(or arXiv:1903.12174v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1903.12174

Submission history

From: Xinlei Chen [view email]
[v1] Thu, 28 Mar 2019 17:59:33 UTC (9,328 KB)
[v2] Tue, 27 Aug 2019 22:59:25 UTC (9,243 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TensorMask: A Foundation for Dense Object Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TensorMask: A Foundation for Dense Object Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators