Skip to content
View ifzhang's full-sized avatar
🐶
Focusing
🐶
Focusing

Organizations

@hustvl
Block or Report

Block or report ifzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"

82 3 Updated Aug 2, 2024

Multimodal Models in Real World

Jupyter Notebook 347 17 Updated Jul 12, 2024

OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 201 4 Updated Jul 9, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,102 39 Updated Jul 14, 2024

A Generalizable World Model for Autonomous Driving

Python 424 22 Updated Jun 17, 2024
Python 78 1 Updated Jun 17, 2024

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

Python 102 3 Updated May 29, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,898 295 Updated Jul 16, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,680 171 Updated Aug 2, 2024

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Jupyter Notebook 574 52 Updated Jul 7, 2024

A method that can match the 3D point cloud sub-map generated by the robot during the SLAM process with the 2D map.

Python 14 3 Updated Oct 4, 2022
HTML 7 1 Updated Aug 26, 2023

[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"

Python 138 4 Updated Apr 18, 2024

[ICRA'2024] Rethinking Imitation-based Planner for Autonomous Driving

Python 167 11 Updated Jul 11, 2024

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 1,978 155 Updated Jul 29, 2024

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)

Python 604 32 Updated Jul 12, 2024

Layout-Guided multi-view driving scene video generation with latent diffusion model

Python 522 11 Updated Dec 15, 2023

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,567 263 Updated Jun 2, 2024

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,763 341 Updated May 8, 2024

[IEEE T-PAMI] All you need for End-to-end Autonomous Driving

1,839 181 Updated Jul 29, 2024

[CoRL'23] Parting with Misconceptions about Learning-based Vehicle Motion Planning

Python 467 51 Updated Mar 27, 2024

PyTorch code for the paper "Model-Based Imitation Learning for Urban Driving".

Python 340 32 Updated Apr 21, 2023

A curated list of awesome End-to-End Autonomous Driving resources (continually updated)

362 18 Updated Aug 13, 2023

[Information Fusion (Vol.103, Mar. '24)] Boosting Image Matting with Pretrained Plain Vision Transformers

Python 300 32 Updated May 24, 2024

[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation

Python 342 16 Updated Sep 19, 2023
Python 163 11 Updated Dec 20, 2023

Multi-Modal 3D Object Detection by Box Matching

51 1 Updated May 16, 2023

[CVPR 2023] Pytorch implementation of ThinkTwice, a SOTA Decoder for End-to-end Autonomous Driving under BEV.

Python 192 19 Updated Mar 4, 2024

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,339 470 Updated May 31, 2024
Next