Block or Report
Block or report YahooKID
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
one for all, Optimal generator with No Exception
The example project of inferencing Pose Estimation using Core ML
Repository for Real-World Blur Dataset for Learning and Benchmarking Deblurring Algorithms
Generative models for conditional audio generation
Character Animation (AnimateAnyone, Face Reenactment)
Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data (CVPR 24); Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer (ECCV 2024)
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
👀 Eye Tracking library easily implementable to your projects
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
High-Resolution Image Synthesis with Latent Diffusion Models
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
Using Claude Opus to reverse engineer code from MegaPortraits: One-shot Megapixel Neural Head Avatars
Efficient face emotion recognition in photos and videos
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Using Claude Opus to reverse engineer code from VASA white paper - WIP - (this is for La Raza 🎷)
Latent Text-to-Image Diffusion