PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding

Xie, Saining; Gu, Jiatao; Guo, Demi; Qi, Charles R.; Guibas, Leonidas J.; Litany, Or

Computer Science > Computer Vision and Pattern Recognition

arXiv:2007.10985 (cs)

[Submitted on 21 Jul 2020 (v1), last revised 21 Nov 2020 (this version, v3)]

Title:PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding

Authors:Saining Xie, Jiatao Gu, Demi Guo, Charles R. Qi, Leonidas J. Guibas, Or Litany

View PDF

Abstract:Arguably one of the top success stories of deep learning is transfer learning. The finding that pre-training a network on a rich source set (eg., ImageNet) can help boost performance once fine-tuned on a usually much smaller target set, has been instrumental to many applications in language and vision. Yet, very little is known about its usefulness in 3D point cloud understanding. We see this as an opportunity considering the effort required for annotating data in 3D. In this work, we aim at facilitating research on 3D representation learning. Different from previous works, we focus on high-level scene understanding tasks. To this end, we select a suite of diverse datasets and tasks to measure the effect of unsupervised pre-training on a large source set of 3D scenes. Our findings are extremely encouraging: using a unified triplet of architecture, source dataset, and contrastive loss for pre-training, we achieve improvement over recent best results in segmentation and detection across 6 different benchmarks for indoor and outdoor, real and synthetic datasets -- demonstrating that the learned representation can generalize across domains. Furthermore, the improvement was similar to supervised pre-training, suggesting that future efforts should favor scaling data collection over more detailed annotation. We hope these findings will encourage more research on unsupervised pretext task design for 3D deep learning.

Comments:	ECCV 2020 (Spotlight); code available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2007.10985 [cs.CV]
	(or arXiv:2007.10985v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2007.10985

Submission history

From: Saining Xie [view email]
[v1] Tue, 21 Jul 2020 17:59:22 UTC (4,879 KB)
[v2] Wed, 22 Jul 2020 03:56:46 UTC (4,879 KB)
[v3] Sat, 21 Nov 2020 00:42:46 UTC (4,879 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators