Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Jupyter Notebook 7,385 547 Updated Nov 1, 2024

Stability-AI / stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 150 16 Updated Jul 25, 2024

umbertocappellazzo / PETL_AST

This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture…

Python 35 3 Updated Jul 31, 2024

atong01 / conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

Python 1,196 97 Updated Oct 9, 2024

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 4,585 463 Updated Oct 30, 2024

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

678 33 Updated Nov 7, 2024

hzy312 / Awesome-LLM-Watermark

UP-TO-DATE LLM Watermark paper. 🔥🔥🔥

285 18 Updated Jun 14, 2024

spotify / pedalboard

🎛 🔊 A Python library for audio.

C++ 5,223 261 Updated Nov 7, 2024

hugofloresgarcia / vampnet

music generation with masked transformers!

Python 296 35 Updated Oct 8, 2024

hubertsiuzdak / snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 432 26 Updated Oct 22, 2024

gemelo-ai / vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 820 95 Updated Aug 7, 2024

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 470 40 Updated Jun 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mu Yang Mu-Y

Achievements

Achievements

Highlights

Block or report Mu-Y

Stars

Berkeley-Speech-Group / sylber

THUDM / GLM-4-Voice

SWivid / F5-TTS

kehanlu / DeSTA2

AlanBaade / SyllableLM

0nutation / SpeechGPT

kyutai-labs / moshi

lukas-blecher / LaTeX-OCR

shinjiwlab / versa

jishengpeng / WavTokenizer

nicolaus625 / FM4Music

jasonppy / VoiceCraft

0nutation / USLM

jaeyeonkim99 / EnCLAP

X-LANCE / SLAM-LLM

mushanshanshan / ESLTTS

metame-ai / awesome-audio-plaza

myshell-ai / OpenVoice

open-mmlab / Amphion