Skip to content
View Mu-Y's full-sized avatar

Highlights

  • Pro

Block or report Mu-Y

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
348 results for source starred repositories
Clear filter

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 180 13 Updated Aug 30, 2024

The official GitHub page for the survey paper "Foundation Models for Music: A Survey".

61 Updated Aug 27, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,418 725 Updated Jun 24, 2024

Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)

Python 124 11 Updated Sep 14, 2023

Official Implementation of EnCLAP (ICASSP 2024)

Python 89 5 Updated Jun 2, 2024

Speech, Language, Audio, Music Processing with Large Language Model

Python 459 36 Updated Aug 20, 2024

ESLTTS dataset

15 1 Updated Jun 21, 2024

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

289 11 Updated Aug 20, 2024

Instant voice cloning by MIT and MyShell.

Python 28,164 2,760 Updated Aug 21, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,419 373 Updated Aug 29, 2024

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 131 15 Updated Jul 25, 2024

This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture…

Python 31 2 Updated Jul 31, 2024

TorchCFM: a Conditional Flow Matching library

Python 1,025 77 Updated Aug 21, 2024

Inference and training library for high-quality TTS models.

Python 4,026 398 Updated Aug 19, 2024

Awesome speech/audio LLMs, representation learning, and codec models

559 26 Updated May 29, 2024

UP-TO-DATE LLM Watermark paper. 🔥🔥🔥

249 16 Updated Jun 14, 2024

🎛 🔊 A Python library for audio.

C++ 5,105 260 Updated Aug 26, 2024

music generation with masked transformers!

Jupyter Notebook 288 35 Updated Aug 2, 2024

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 271 18 Updated Apr 9, 2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 743 86 Updated Aug 7, 2024

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 404 36 Updated Jun 9, 2024

ASR text preprocessing utility

Python 20 5 Updated Aug 5, 2024
Python 12 Updated Mar 1, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,334 64 Updated Mar 8, 2024

Audio Codec Speech processing Universal PERformance Benchmark

Python 199 22 Updated Jun 19, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,981 821 Updated Jul 1, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,391 303 Updated Jan 4, 2024

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 339 29 Updated Jan 25, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 5,704 618 Updated Aug 27, 2024

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 22,657 4,743 Updated Aug 8, 2024
Next