NeRF-Supervision: Learning Dense Object Descriptors from Neural Radiance Fields

Yen-Chen, Lin; Florence, Pete; Barron, Jonathan T.; Lin, Tsung-Yi; Rodriguez, Alberto; Isola, Phillip

Computer Science > Robotics

arXiv:2203.01913 (cs)

[Submitted on 3 Mar 2022 (v1), last revised 27 Apr 2022 (this version, v2)]

Title:NeRF-Supervision: Learning Dense Object Descriptors from Neural Radiance Fields

Authors:Lin Yen-Chen, Pete Florence, Jonathan T. Barron, Tsung-Yi Lin, Alberto Rodriguez, Phillip Isola

View PDF

Abstract:Thin, reflective objects such as forks and whisks are common in our daily lives, but they are particularly challenging for robot perception because it is hard to reconstruct them using commodity RGB-D cameras or multi-view stereo techniques. While traditional pipelines struggle with objects like these, Neural Radiance Fields (NeRFs) have recently been shown to be remarkably effective for performing view synthesis on objects with thin structures or reflective materials. In this paper we explore the use of NeRF as a new source of supervision for robust robot vision systems. In particular, we demonstrate that a NeRF representation of a scene can be used to train dense object descriptors. We use an optimized NeRF to extract dense correspondences between multiple views of an object, and then use these correspondences as training data for learning a view-invariant representation of the object. NeRF's usage of a density field allows us to reformulate the correspondence problem with a novel distribution-of-depths formulation, as opposed to the conventional approach of using a depth map. Dense correspondence models supervised with our method significantly outperform off-the-shelf learned descriptors by 106% (PCK@3px metric, more than doubling performance) and outperform our baseline supervised with multi-view stereo by 29%. Furthermore, we demonstrate the learned dense descriptors enable robots to perform accurate 6-degree of freedom (6-DoF) pick and place of thin and reflective objects.

Comments:	ICRA 2022, Website: this https URL
Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.01913 [cs.RO]
	(or arXiv:2203.01913v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2203.01913

Submission history

From: Yen-Chen Lin [view email]
[v1] Thu, 3 Mar 2022 18:49:57 UTC (38,945 KB)
[v2] Wed, 27 Apr 2022 16:55:51 UTC (38,945 KB)

Computer Science > Robotics

Title:NeRF-Supervision: Learning Dense Object Descriptors from Neural Radiance Fields

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:NeRF-Supervision: Learning Dense Object Descriptors from Neural Radiance Fields

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators