ReasonNet: End-to-End Driving with Temporal and Global Reasoning

Shao, Hao; Wang, Letian; Chen, Ruobing; Waslander, Steven L.; Li, Hongsheng; Liu, Yu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.10507 (cs)

[Submitted on 17 May 2023]

Title:ReasonNet: End-to-End Driving with Temporal and Global Reasoning

Authors:Hao Shao, Letian Wang, Ruobing Chen, Steven L. Waslander, Hongsheng Li, Yu Liu

View PDF

Abstract:The large-scale deployment of autonomous vehicles is yet to come, and one of the major remaining challenges lies in urban dense traffic scenarios. In such cases, it remains challenging to predict the future evolution of the scene and future behaviors of objects, and to deal with rare adverse events such as the sudden appearance of occluded objects. In this paper, we present ReasonNet, a novel end-to-end driving framework that extensively exploits both temporal and global information of the driving scene. By reasoning on the temporal behavior of objects, our method can effectively process the interactions and relationships among features in different frames. Reasoning about the global information of the scene can also improve overall perception performance and benefit the detection of adverse events, especially the anticipation of potential danger from occluded objects. For comprehensive evaluation on occlusion events, we also release publicly a driving simulation benchmark DriveOcclusionSim consisting of diverse occlusion events. We conduct extensive experiments on multiple CARLA benchmarks, where our model outperforms all prior methods, ranking first on the sensor track of the public CARLA Leaderboard.

Comments:	CVPR 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.10507 [cs.CV]
	(or arXiv:2305.10507v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.10507

Submission history

From: Hao Shao [view email]
[v1] Wed, 17 May 2023 18:24:43 UTC (7,216 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ReasonNet: End-to-End Driving with Temporal and Global Reasoning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ReasonNet: End-to-End Driving with Temporal and Global Reasoning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators