Goal-oriented Autonomous Driving

Hu, Yihan; Yang, Jiazhi; Chen, Li; Li, Keyu; Sima, Chonghao; Zhu, Xizhou; Chai, Siqi; Du, Senyao; Lin, Tianwei; Wang, Wenhai; Lu, Lewei; Jia, Xiaosong; Liu, Qiang; Dai, Jifeng; Qiao, Yu; Li, Hongyang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.10156v1 (cs)

[Submitted on 20 Dec 2022 (this version), latest version 23 Mar 2023 (v2)]

Title:Goal-oriented Autonomous Driving

Authors:Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li

View PDF

Abstract:Modern autonomous driving system is characterized as modular tasks in sequential order, i.e., perception, prediction and planning. As sensors and hardware get improved, there is trending popularity to devise a system that can perform a wide diversity of tasks to fulfill higher-level intelligence. Contemporary approaches resort to either deploying standalone models for individual tasks, or designing a multi-task paradigm with separate heads. These might suffer from accumulative error or negative transfer effect. Instead, we argue that a favorable algorithm framework should be devised and optimized in pursuit of the ultimate goal, i.e. planning of the self-driving-car. Oriented at this goal, we revisit the key components within perception and prediction. We analyze each module and prioritize the tasks hierarchically, such that all these tasks contribute to planning (the goal). To this end, we introduce Unified Autonomous Driving (UniAD), the first comprehensive framework up-to-date that incorporates full-stack driving tasks in one network. It is exquisitely devised to leverage advantages of each module, and provide complementary feature abstractions for agent interaction from a global perspective. Tasks are communicated with unified query design to facilitate each other toward planning. We instantiate UniAD on the challenging nuScenes benchmark. With extensive ablations, the effectiveness of using such a philosophy is proven to surpass previous state-of-the-arts by a large margin in all aspects. The full suite of codebase and models would be available to facilitate future research in the community.

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2212.10156 [cs.CV]
	(or arXiv:2212.10156v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.10156

Submission history

From: Li Chen [view email]
[v1] Tue, 20 Dec 2022 10:47:53 UTC (6,568 KB)
[v2] Thu, 23 Mar 2023 16:26:08 UTC (6,362 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Goal-oriented Autonomous Driving

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Goal-oriented Autonomous Driving

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators