Is Pseudo-Lidar needed for Monocular 3D Object detection?

Park, Dennis; Ambrus, Rares; Guizilini, Vitor; Li, Jie; Gaidon, Adrien

Computer Science > Computer Vision and Pattern Recognition

arXiv:2108.06417 (cs)

[Submitted on 13 Aug 2021]

Title:Is Pseudo-Lidar needed for Monocular 3D Object detection?

Authors:Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, Adrien Gaidon

View PDF

Abstract:Recent progress in 3D object detection from single images leverages monocular depth estimation as a way to produce 3D pointclouds, turning cameras into pseudo-lidar sensors. These two-stage detectors improve with the accuracy of the intermediate depth estimation network, which can itself be improved without manual labels via large-scale self-supervised learning. However, they tend to suffer from overfitting more than end-to-end methods, are more complex, and the gap with similar lidar-based detectors remains significant. In this work, we propose an end-to-end, single stage, monocular 3D object detector, DD3D, that can benefit from depth pre-training like pseudo-lidar methods, but without their limitations. Our architecture is designed for effective information transfer between depth estimation and 3D detection, allowing us to scale with the amount of unlabeled pre-training data. Our method achieves state-of-the-art results on two challenging benchmarks, with 16.34% and 9.28% AP for Cars and Pedestrians (respectively) on the KITTI-3D benchmark, and 41.5% mAP on NuScenes.

Comments:	In Proceedings of the ICCV 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2108.06417 [cs.CV]
	(or arXiv:2108.06417v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2108.06417

Submission history

From: Dennis Park [view email]
[v1] Fri, 13 Aug 2021 22:22:51 UTC (16,464 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Rares Ambrus
Vitor Guizilini
Jie Li
Adrien Gaidon

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Is Pseudo-Lidar needed for Monocular 3D Object detection?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Is Pseudo-Lidar needed for Monocular 3D Object detection?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators