-
Nanyang Technological University
- Singapore
- @zeqi_xiao
Block or Report
Block or report xizaoqu
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
COLMAP - Structure-from-Motion and Multi-View Stereo
ControlNet++: All-in-one ControlNet for image generations and editing!
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
SEED-Story: Multimodal Long Story Generation with Large Language Model
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
[SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling
Understand Human Behavior to Align True Needs
CoTracker is a model for tracking any point (pixel) on a video.
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model"
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
Enjoy the magic of Diffusion models!
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Latte: Latent Diffusion Transformer for Video Generation.
a research paper for generative cartoon interpolation