Skip to content
View TonyXuQAQ's full-sized avatar

Block or report TonyXuQAQ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 2,525 187 Updated Sep 26, 2024

[PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving; [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

Python 1,118 186 Updated Jun 28, 2024

[ECCV 2022] Map-free Visual Relocalization: Metric Pose Relative to a Single Image

Python 245 18 Updated Aug 2, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 755 51 Updated Sep 13, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 4,055 373 Updated Oct 1, 2024

Talk2BEV: Language-Enhanced Bird's Eye View Maps (Accepted to ICRA'24)

Python 93 9 Updated Jan 29, 2024

FFmpeg Builds for yt-dlp

Shell 657 54 Updated Sep 30, 2024
Python 56 6 Updated Jun 28, 2024

LimSim & LimSim++: Integrated traffic and autonomous driving simulators with (M)LLM support

Python 389 31 Updated Sep 29, 2024

Open weights LLM from Google DeepMind.

Python 2,416 306 Updated Sep 20, 2024

[ECCV'24] Online Vectorized HD Map Construction using Geometry

Python 195 16 Updated Aug 28, 2024

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 3,946 359 Updated Apr 8, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 8,890 783 Updated Aug 7, 2024

Explorations of Using Python to play Grand Theft Auto 5.

Python 3,909 823 Updated Mar 8, 2023
C++ 123 23 Updated Dec 10, 2018

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Python 2,250 124 Updated Sep 17, 2024

[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding

Python 88 2 Updated Nov 20, 2023

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

6,333 383 Updated Jul 28, 2024

[ICLR 2024] Map Learning with Lane Segment for Autonomous Driving

Python 255 27 Updated Jul 19, 2024

[CoRL 2022] InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer

Python 529 46 Updated Jan 20, 2024

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,881 207 Updated Sep 25, 2024

A generative and self-guided robotic agent that endlessly propose and master new skills.

Python 561 49 Updated May 31, 2024

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Jupyter Notebook 630 51 Updated Jul 7, 2024

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

896 48 Updated Sep 23, 2024

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

919 47 Updated Sep 25, 2024

A public available dataset for road boundary detection in aerial images

Python 108 20 Updated Apr 12, 2024
Python 188 20 Updated Sep 4, 2023

Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection

Python 236 13 Updated Mar 15, 2023
Next