Block or Report
Block or report deepcs233
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
a research paper for generative cartoon interpolation
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
A curated list of world models for autonomous driving. Keep updated.
A new one shot face swap approach for image and video domains
A Generalizable World Model for Autonomous Driving
[ICCV'23] Hidden Biases of End-to-End Driving Models
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
The code and dataset for vicrop paper
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
👾 Fast and simple video download library and CLI tool written in Go
Open-Sora: Democratizing Efficient Video Production for All
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Unofficial Implementation of Animate Anyone
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Generative Models by Stability AI
Emote Portrait Alive - using ai to reverse engineer code from white paper. (abandoned)
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
[CVPR 2024] SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
An open source, lightweight note-taking service. Easily capture and share your great thoughts.
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)