Mu-Y

Mu-Y

Ph.D. Student at the University of Texas at Dallas. Former USC/ISI @PlusLabNLP. Working on Speech.

24 followers · 63 following

University of Texas at Dallas
https://mu-y.github.io/
@MuYang55

Achievements

Block or Report

Block or report Mu-Y

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,270 712 Updated Jun 24, 2024

0nutation / USLM

Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)

Python 125 11 Updated Sep 14, 2023

jaeyeonkim99 / EnCLAP

Official Implementation of EnCLAP (ICASSP 2024)

Python 88 4 Updated Jun 2, 2024

X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Python 415 33 Updated Jul 24, 2024

mushanshanshan / ESLTTS

ESLTTS dataset

15 1 Updated Jun 21, 2024

metame-ai / awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

259 11 Updated Jul 20, 2024

myshell-ai / OpenVoice

Instant voice cloning by MyShell.

Python 27,595 2,680 Updated Jul 23, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,319 368 Updated Jul 25, 2024

Stability-AI / stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 118 13 Updated Jul 25, 2024

umbertocappellazzo / PETL_AST

This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture…

Python 31 1 Updated Jul 16, 2024

atong01 / conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

Python 936 64 Updated Jul 25, 2024

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 2,902 301 Updated Jul 25, 2024

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

516 26 Updated May 29, 2024

hzy312 / Awesome-LLM-Watermark

UP-TO-DATE LLM Watermark paper. 🔥🔥🔥

237 16 Updated Jun 14, 2024

spotify / pedalboard

🎛 🔊 A Python library for audio.

C++ 5,015 251 Updated Jul 26, 2024

hugofloresgarcia / vampnet

music generation with masked transformers!

Jupyter Notebook 275 35 Updated Jul 20, 2024

hubertsiuzdak / snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 252 17 Updated Apr 9, 2024

gemelo-ai / vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 714 83 Updated Jul 6, 2024

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 379 32 Updated Jun 9, 2024

voidful / asrp

ASR text preprocessing utility

Python 20 5 Updated May 1, 2023

aixplain / NoRefER

Python 12 Updated Mar 1, 2024

XueFuzhao / OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,312 63 Updated Mar 8, 2024

voidful / Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

Python 188 22 Updated Jun 19, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,802 803 Updated Jul 1, 2024

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,326 299 Updated Jan 4, 2024

modelscope / FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 319 28 Updated Jan 25, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 5,021 548 Updated Jul 26, 2024

posquit0 / Awesome-CV

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 22,414 4,716 Updated Jul 15, 2024

CNChTu / Diffusion-SVC

Python 387 57 Updated Jul 11, 2024

yxlllc / DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python 1,743 232 Updated Jul 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mu-Y

Achievements

Achievements

Block or report Mu-Y

Stars

jasonppy / VoiceCraft

0nutation / USLM

jaeyeonkim99 / EnCLAP

X-LANCE / SLAM-LLM

mushanshanshan / ESLTTS

metame-ai / awesome-audio-plaza

myshell-ai / OpenVoice

open-mmlab / Amphion

Stability-AI / stable-audio-metrics

umbertocappellazzo / PETL_AST

atong01 / conditional-flow-matching

huggingface / parler-tts

ga642381 / speech-trident

hzy312 / Awesome-LLM-Watermark

spotify / pedalboard

hugofloresgarcia / vampnet

hubertsiuzdak / snac

gemelo-ai / vocos

ZhangXInFD / SpeechTokenizer

voidful / asrp

aixplain / NoRefER

XueFuzhao / OpenMoE

voidful / Codec-SUPERB

karpathy / minbpe

facebookresearch / encodec

modelscope / FunCodec

modelscope / FunASR

posquit0 / Awesome-CV

CNChTu / Diffusion-SVC

yxlllc / DDSP-SVC