Stars
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
[NeurIPS 24] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
ViViD: Video Virtual Try-on using Diffusion Models
[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
Official implementation of "DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents"
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
Official implementations for paper: InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Implementation of MagViT2 Tokenizer in Pytorch
Open-Sora: Democratizing Efficient Video Production for All
An Open-source Toolkit for LLM Development
Official Repository of the paper "Trajectory Consistency Distillation"
Concept Sliders for Precise Control of Diffusion Models
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Pytorch实现的NMS和Soft-NMS,可直接使用yolov5官方开源的代码中
Pytorch code for some vision transformer models
The Pytorch implementation of Grounding 3D Object Affordance from 2D Interactios in Images.
Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)
Official code for "Style Aligned Image Generation via Shared Attention"
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)