Skip to content
View sshan-zhao's full-sized avatar

Block or report sshan-zhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Python 970 73 Updated Jul 23, 2024

Dual-Branch Network for Portrait Image Quality Assessment

Python 13 1 Updated May 22, 2024

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)

Jupyter Notebook 443 26 Updated May 29, 2024

A collection of various image grids created with Flux. Things like hair styles, clothing, nationalities, ages, etc.

JavaScript 157 10 Updated Aug 16, 2024
Python 1,098 72 Updated Oct 30, 2024

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 512 31 Updated Nov 4, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,441 295 Updated Oct 11, 2024

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Python 2,266 164 Updated Aug 7, 2024

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Python 553 60 Updated Oct 4, 2024

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,454 262 Updated Jun 28, 2024

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 17,996 2,782 Updated Jul 26, 2024

Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IF…

C++ 13,116 881 Updated Nov 2, 2024

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

5,221 489 Updated Oct 30, 2024

A collection of awesome video generation studies.

TeX 341 13 Updated Nov 12, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,223 2,174 Updated Aug 9, 2024

(CVPR 2023) CelebV-Text: A Large-Scale Facial Text-Video Dataset

Python 389 33 Updated Jan 4, 2024

Annotated Flow Matching paper

Jupyter Notebook 131 4 Updated Sep 14, 2024

Infinite Photorealistic Worlds using Procedural Generation

Python 5,399 468 Updated Nov 11, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,802 180 Updated Oct 31, 2024

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,579 206 Updated Sep 8, 2024

[NeurIPS'23] ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding

Python 8 Updated Dec 9, 2023

[CSUR] A Survey on Video Diffusion Models

1,803 90 Updated Nov 8, 2024

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,064 137 Updated Jul 12, 2024

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Python 896 154 Updated Apr 4, 2024

📖 A curated list of resources dedicated to talking face.

1,328 111 Updated Nov 3, 2024

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,639 976 Updated Aug 5, 2024

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Python 1,060 84 Updated Jan 23, 2024

The world's simplest facial recognition api for Python and the command line

Python 53,425 13,488 Updated Aug 21, 2024

WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens

193 4 Updated Jan 19, 2024

[CVPR2024] Make Your Dream A Vlog

Python 415 42 Updated Mar 19, 2024
Next