Skip to content
View zhanghm1995's full-sized avatar
🎯
I maybe slow to respond :)
🎯
I maybe slow to respond :)
Block or Report

Block or report zhanghm1995

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

Python 707 38 Updated Jul 29, 2024

Official PyTorch implementation of D^2-World as the second place of CVPR 2024 Predictive World Model Challenge.

3 Updated Jun 7, 2024
Python 17 4 Updated Jul 1, 2024

Reasoning 3D Segmentation - "segment anything"/grounding/part seperation in 3D with natural conversations.

Python 73 10 Updated May 30, 2024

DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus

89 1 Updated May 25, 2024

[CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"

Python 128 10 Updated Jul 29, 2024
Jupyter Notebook 249 22 Updated Jun 25, 2024

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,189 230 Updated Jun 28, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 20,963 1,988 Updated Jul 25, 2024

[CVPR'24] DNGaussian: Optimizing Sparse-View 3D Gaussian Radiance Fields with Global-Local Depth Normalization

Python 214 14 Updated Jun 27, 2024

[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration

Python 284 15 Updated Jul 27, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,190 493 Updated Jul 11, 2024

(AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models

Python 41 2 Updated May 19, 2024

A curated list of awesome world models for autonomous driving (continually updated)

9 1 Updated Dec 28, 2023
Shell 1 Updated Apr 6, 2024

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

209 8 Updated Jul 1, 2024

[ECCV 2024] 3D World Model for Autonomous Driving

Python 305 17 Updated Apr 12, 2024

[ICRA 2024] RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision. (Former version: UniOcc)

Python 411 23 Updated Jan 17, 2024

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)

Python 1,387 152 Updated Jul 6, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,520 459 Updated Jul 29, 2024

A JAX-based simulator for autonomous driving research.

Python 806 85 Updated Mar 22, 2024

[ICLR 2024] Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting

Python 530 36 Updated Apr 6, 2024

Talk2BEV: Language-Enhanced Bird's Eye View Maps (Accepted to ICRA'24)

Python 85 8 Updated Jan 29, 2024

An Invitation to 3D Vision: A Tutorial for Everyone

CMake 1,433 280 Updated May 6, 2024

Web-based 3D visualization + Python

Python 619 34 Updated Jul 29, 2024
Python 141 12 Updated Nov 3, 2023

Meta-Transformer for Unified Multimodal Learning

Python 1,475 113 Updated Dec 5, 2023

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Jupyter Notebook 2,915 198 Updated Mar 9, 2024
Next