![awesome logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/awesome/awesome.png)
Block or Report
Block or report azuredsky
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
Code and dataset for photorealistic Codec Avatars driven from audio
Pythonic AI generation of images and videos
[CVPR 2024] Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.
A work list of recent human video generation method. This repository focus on half/full body human video generation method, The Nerf, Gaussian splashing, Motion Pose, and talking head/Portrait is n…
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
manipulable audio-driven talking head generation system
[ICCV 2023] Official implementation of "Make Encoder Great Again in 3D GAN Inversion through Geometry and Occlusion-Aware Encoding" in International Conference on Computer Vision (ICCV) 2023.
[ECCV 2022] Flow-Guided Transformer for Video Inpainting
🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)
🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.
[ECCV 2024] EDTalk - Official PyTorch Implementation
[ICLR 2024] Generalizable and Precise Head Avatar from Image(s)
ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generation
Contextual Loss (CX) and Contextual Bilateral Loss (CoBi).
Paper 'Transformer based Pluralistic Image Completion with Reduced Information Loss' in TPAMI 2024 and 'Reduce Information Loss in Transformers for Pluralistic Image Inpainting' in CVPR2022
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Talking avatars Heads for the IF_AI tools integrates dreamtalk in ComfyUI
Preprocessing Scipts for Talking Face Generation
VQ-VAE implementation using Vision Transformers for both the encoder and decoder
3D-Aware Face Editing via Warping-Guided Latent Direction Learning
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation