Block or Report
Block or report 0xKayra
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
Generative models for conditional audio generation
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
a research paper for generative cartoon interpolation
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
[WIP] Layer Diffusion for WebUI (via Forge)
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
[CVPR 2023] DPE: Disentanglement of Pose and Expression for General Video Portrait Editing
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
camenduru / WhisperSpeech
Forked from collabora/WhisperSpeechAn Open Source text-to-speech system built by inverting Whisper.