[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 559 40 Updated Aug 21, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,609 1,224 Updated Dec 6, 2023

facebookresearch / audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,656 249 Updated Jun 24, 2024

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,076 4,108 Updated Aug 19, 2024

rsxdalv / tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

TypeScript 1,579 169 Updated Aug 19, 2024

camenduru / MusicGen-colab

Jupyter Notebook 515 63 Updated Jul 25, 2023

shansongliu / M2UGen

This is the official repository for M2UGen

Jupyter Notebook 435 38 Updated May 8, 2024

shansongliu / MU-LLaMA

MU-LLaMA: Music Understanding Large Language Model

Python 221 16 Updated Mar 25, 2024

prophesier / diff-svc

Singing Voice Conversion via diffusion model

Jupyter Notebook 2,614 799 Updated Jul 10, 2023

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell.

Python 28,096 2,752 Updated Aug 21, 2024

HumanAIGC / AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,237 952 Updated Jul 26, 2024

voicepaw / so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Python 8,646 1,149 Updated Aug 21, 2024

bmaltais / kohya_ss

Python 9,185 1,194 Updated Aug 27, 2024

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,233 2,142 Updated Aug 27, 2024

haoheliu / versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,062 104 Updated May 10, 2024

kuleshov / audio-super-res

Audio super resolution using neural networks

Python 1,143 205 Updated Oct 24, 2023

OpenTalker / SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 11,526 2,155 Updated Jun 26, 2024

OpenTalker / video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 6,312 929 Updated Aug 5, 2024

OpenBMB / ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Shell 24,922 3,116 Updated Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiyoon No nojiyoon

Block or report nojiyoon

Stars

junyanz / pytorch-CycleGAN-and-pix2pix

f90 / Wave-U-Net

allenai / OLMo

MubertAI / Mubert-Text-to-Music

instantX-research / InstantID

openai / plugins-quickstart

openai / chatgpt-retrieval-plugin

TencentARC / PhotoMaker

Vchitect / Vlogger

ProjectNUWA / DragNUWA

w-okada / voice-changer

ddlBoJack / emotion2vec