Skip to content
View xizaoqu's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • Nanyang Technological University
  • Singapore
  • X @zeqi_xiao
Block or Report

Block or report xizaoqu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 5,290 479 Updated Aug 9, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 9,088 573 Updated Aug 10, 2024

COLMAP - Structure-from-Motion and Multi-View Stereo

C++ 7,277 1,478 Updated Aug 10, 2024

ControlNet++: All-in-one ControlNet for image generations and editing!

Python 1,510 30 Updated Aug 6, 2024

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution

Python 2,038 126 Updated Jul 12, 2024

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 4,080 358 Updated Jul 30, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 640 48 Updated Jul 29, 2024

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 416 13 Updated Aug 9, 2024
Python 664 40 Updated Jul 29, 2024

Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

71 1 Updated Jul 16, 2024

[SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

Python 67 2 Updated Jul 13, 2024

Understand Human Behavior to Align True Needs

Python 3,140 275 Updated Jul 20, 2024

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 2,609 182 Updated Jul 19, 2024

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Python 3,254 186 Updated Feb 29, 2024

Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”

Python 336 36 Updated Jul 5, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,122 278 Updated May 4, 2024

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 2,901 214 Updated Jul 29, 2024

Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model"

153 13 Updated Aug 9, 2024

Video-Infinity generates long videos quickly using multiple GPUs without extra training.

Python 150 13 Updated Aug 4, 2024
Jupyter Notebook 63 5 Updated Jul 12, 2024

Enjoy the magic of Diffusion models!

Python 6,107 544 Updated Aug 2, 2024

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 366 10 Updated Aug 7, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,123 40 Updated Jul 14, 2024

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,568 163 Updated Jul 26, 2024
Python 381 16 Updated May 24, 2024

a research paper for generative cartoon interpolation

Python 4,981 408 Updated Jun 1, 2024

Your image is almost there!

Python 7,063 411 Updated Jul 26, 2024
Next