Skip to content
View Tai-Wang's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Organizations

@open-mmlab
Block or Report

Block or report Tai-Wang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GRUtopia: Dream General Robots in a City at Scale

Python 253 7 Updated Jul 18, 2024

Code&Data for Grounded 3D-LLM with Referent Tokens

Python 60 Updated Jul 1, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Python 721 85 Updated Jul 21, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 4,276 325 Updated Jul 21, 2024

Official implementation of the paper "PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios" (CVPR 2024).

Python 46 Updated Jun 25, 2024

Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"

Python 927 63 Updated Jun 6, 2024

[CVPR 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Python 389 25 Updated Jun 14, 2024

Learning-based locomotion control from OpenRobotLab, including Hybrid Internal Model & H-Infinity Locomotion Control

Python 218 23 Updated May 14, 2024

[NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation

Python 65 6 Updated Jun 24, 2024

An Open-source Framework for Autonomous Language Agents

Python 4,923 382 Updated Jul 18, 2024

[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts

Python 142 5 Updated May 31, 2024

[ECCV 2024] PointLLM: Empowering Large Language Models to Understand Point Clouds

Python 443 22 Updated Jul 8, 2024

[ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding

Python 42 Updated Aug 28, 2023

[ECCV 2024] DriveLM: Driving with Graph Visual Question Answering

HTML 726 45 Updated Jul 19, 2024

A lightweight framework for building LLM-based agents

Python 1,068 107 Updated Jul 19, 2024

3D Occupancy Prediction Benchmark in Autonomous Driving

Python 277 20 Updated May 27, 2024
Python 413 37 Updated Jan 31, 2024

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Python 480 25 Updated Jun 11, 2024

Official release of InternLM2.5 7B base and chat models. 1M context support

Python 5,858 418 Updated Jul 19, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,604 3,432 Updated May 18, 2024

Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

C++ 9,214 4,691 Updated May 15, 2024

[ICCV 2023] OccNet: Scene as Occupancy

Python 526 47 Updated Jul 19, 2024

Multimodal-GPT

Python 1,444 119 Updated Jun 4, 2023

An open-source tool-augmented conversational language model from Fudan University

Python 11,886 1,146 Updated Jul 13, 2024

Topology Reasoning for Scene Perception in Autonomous Driving

Python 259 9 Updated Apr 30, 2024

[CoRL 2023] DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking

Python 63 1 Updated Jan 21, 2024

[IJCV 2024] P3Former: Position-Guided Point Cloud Panoptic Segmentation Transformer

Python 73 9 Updated Apr 2, 2024

[CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training

47 2 Updated Jun 4, 2023

A collaboration friendly studio for NeRFs

Python 8,968 1,201 Updated Jul 19, 2024
Next