-
Zhejiang University
- https://yerfor.github.io/en
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Zhejiang University Graduation Thesis LaTeX Template
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Hackable and optimized Transformers building blocks, supporting a composable construction.
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Real time interactive streaming digital human
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]
[CVPR 2024] The official repo for FlashAvatar
Fitting 3DMM models to multiview (monocular) video data.
[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
Implementation of PyTorch: "GAMBA: MARRY GAUSSIAN SPLATTING WITH MAMBA FOR SINGLE-VIEW 3D RECONSTRUCTION"
TriplaneGaussian: A new hybrid representation for single-view 3D reconstruction.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.