Skip to content
View Mu-Y's full-sized avatar
Block or Report

Block or report Mu-Y

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,270 712 Updated Jun 24, 2024

Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)

Python 125 11 Updated Sep 14, 2023

Official Implementation of EnCLAP (ICASSP 2024)

Python 88 4 Updated Jun 2, 2024

Speech, Language, Audio, Music Processing with Large Language Model

Python 415 33 Updated Jul 24, 2024

ESLTTS dataset

15 1 Updated Jun 21, 2024

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

259 11 Updated Jul 20, 2024

Instant voice cloning by MyShell.

Python 27,595 2,680 Updated Jul 23, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,319 368 Updated Jul 25, 2024

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 118 13 Updated Jul 25, 2024

This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture…

Python 31 1 Updated Jul 16, 2024

TorchCFM: a Conditional Flow Matching library

Python 936 64 Updated Jul 25, 2024

Inference and training library for high-quality TTS models.

Python 2,902 301 Updated Jul 25, 2024

Awesome speech/audio LLMs, representation learning, and codec models

516 26 Updated May 29, 2024

UP-TO-DATE LLM Watermark paper. 🔥🔥🔥

237 16 Updated Jun 14, 2024

🎛 🔊 A Python library for audio.

C++ 5,015 251 Updated Jul 26, 2024

music generation with masked transformers!

Jupyter Notebook 275 35 Updated Jul 20, 2024

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 252 17 Updated Apr 9, 2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 714 83 Updated Jul 6, 2024

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 379 32 Updated Jun 9, 2024

ASR text preprocessing utility

Python 20 5 Updated May 1, 2023
Python 12 Updated Mar 1, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,312 63 Updated Mar 8, 2024

Audio Codec Speech processing Universal PERformance Benchmark

Python 188 22 Updated Jun 19, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,802 803 Updated Jul 1, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,326 299 Updated Jan 4, 2024

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 319 28 Updated Jan 25, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 5,021 548 Updated Jul 26, 2024

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 22,414 4,716 Updated Jul 15, 2024
Python 387 57 Updated Jul 11, 2024

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python 1,743 232 Updated Jul 14, 2024
Next