Stars
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
VideoSys: An easy and efficient system for video generation
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
LAVIS - A One-stop Library for Language-Vision Intelligence
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalizations
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Official inference repo for FLUX.1 models
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Auto detecting, masking and inpainting with detection model.
SEED-Story: Multimodal Long Story Generation with Large Language Model
📷 EasyPhoto | Your Smart AI Photo Generator.
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
One-click Face Swapper and Restoration powered by insightface 🔥
Streamlit — A faster way to build and share data apps.
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
collection of diffusion model papers categorized by their subareas
A collection of resources on controllable generation with text-to-image diffusion models.
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
A Pytorch Implementation of Finite Scalar Quantization