yerfor

🧲

Focusing on new projects. I may be slow to respond.

Zhenhui Ye yerfor

🧲

Focusing on new projects. I may be slow to respond.

A Ph.D. student with many ideas~ Work hard no anxiety. Currently working on speech synthesis and talking face generation.

344 followers · 17 following

Zhejiang University
https://yerfor.github.io/en

Achievements

Highlights

Lists (3)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

GTSinger / GTSinger

Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Python 61 6 Updated Oct 11, 2024

TheNetAdmin / zjuthesis

Zhejiang University Graduation Thesis LaTeX Template

TeX 2,574 602 Updated Sep 6, 2024

warmshao / FasterLivePortrait

Bring portraits to life in Real Time！onnx/tensorrt support！实时肖像驱动！

Python 487 46 Updated Sep 12, 2024

FoundationVision / VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 4,059 304 Updated Oct 6, 2024

KwaiVGI / LivePortrait

Bring portraits to life!

Python 12,251 1,295 Updated Oct 7, 2024

BadToBest / EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 2,611 308 Updated Aug 15, 2024

gxyes / CrowdMoGen

CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation

56 Updated Jul 9, 2024

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,630 78 Updated Aug 5, 2024

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,729 175 Updated Aug 1, 2024

fudan-generative-vision / hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 9,290 1,279 Updated Sep 14, 2024

aigc-apps / EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,228 92 Updated Oct 11, 2024

henry123-boy / SpaTracker

[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space

Python 696 24 Updated Jun 4, 2024

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,467 603 Updated Oct 11, 2024

ali-vilab / dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Python 1,575 191 Updated Jan 15, 2024

TMElyralab / MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 2,562 313 Updated Sep 23, 2024

lipku / livetalking

Real time interactive streaming digital human

Python 3,604 514 Updated Oct 5, 2024

Kedreamix / Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…

Python 1,880 310 Updated Sep 27, 2024

Kedreamix / Awesome-Talking-Head-Synthesis

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

765 41 Updated Oct 9, 2024