Skip to content
View Qiulin-W's full-sized avatar
Block or Report

Block or report Qiulin-W

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 143 5 Updated Jul 28, 2024
Python 67 3 Updated Jul 12, 2024

Codes for ID-Specific Video Customized Diffusion

Python 445 35 Updated Feb 22, 2024

🔥 StableIdentity: Inserting Anybody into Anywhere at First Sight

Python 246 7 Updated Mar 22, 2024

Bring portraits to life!

Python 10,101 980 Updated Aug 14, 2024

RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with the file name of the associated labeled images (no urls or im…

87 2 Updated Jun 25, 2024

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

Jupyter Notebook 383 25 Updated May 29, 2024

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 1,774 155 Updated Jul 17, 2024

SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training

Python 135 3 Updated Jul 5, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 602 34 Updated Aug 6, 2024
Python 1,893 116 Updated Aug 15, 2024

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 27,383 3,437 Updated Aug 6, 2024

Repository for Detail-revealing Deep Video Super-resolution https://arxiv.org/abs/1704.02738

Python 261 59 Updated Dec 31, 2019

Kolmogorov Arnold Networks

Jupyter Notebook 14,097 1,278 Updated Aug 11, 2024

Minimal implementation of scalable rectified flow transformers, based on SD3's approach

Jupyter Notebook 345 23 Updated Jul 1, 2024

Official PyTorch implementation of the paper: Flow Matching in Latent Space

Python 170 5 Updated Jul 23, 2024

[MM 2024 Oral] Refiner for AIGC

Jupyter Notebook 23 1 Updated Jul 29, 2024

Fitting 3DMM models to multiview (monocular) video data.

Python 65 11 Updated Apr 2, 2024

[CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap".

10 Updated Jun 14, 2024

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 1,733 159 Updated Aug 14, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 1,977 81 Updated Aug 6, 2024

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Python 161 5 Updated Jun 9, 2024

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

Python 125 2 Updated Aug 14, 2024

[Arxiv 2024] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models"

Python 135 4 Updated Apr 7, 2024

[CVPR 2024] On the Content Bias in Fréchet Video Distance

Python 64 1 Updated Aug 12, 2024

Evaluating text-to-image/video/3D models with VQAScore

Python 152 15 Updated Aug 11, 2024

Unified Multi-modal IAA Baseline and Benchmark

69 5 Updated Apr 16, 2024

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

139 10 Updated Aug 5, 2024
Python 7,044 545 Updated Aug 12, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,929 297 Updated Jul 16, 2024
Next