cpdu

Chenpeng Du cpdu

59 followers · 10 following

Achievements

Block or Report

Block or report cpdu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

tianweiy / DMD2

Python 387 23 Updated Jul 10, 2024

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,356 97 Updated Jul 5, 2024

metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS

Python 3,643 637 Updated Jul 30, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 35,974 3,777 Updated Jul 28, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,389 373 Updated Aug 4, 2024

francislata / unicats

An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".

Python 21 1 Updated Nov 4, 2023

sony / bigvsan

Pytorch implementation of BigVSAN

Python 195 16 Updated Mar 23, 2024

modelscope / FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 335 28 Updated Jan 25, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,077 98 Updated Jul 11, 2024

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python 7,508 749 Updated Feb 11, 2024

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 1,971 321 Updated Nov 14, 2023

lucidrains / voicebox-pytorch

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 579 47 Updated Feb 16, 2024

sp-nitech / diffsptk

A differentiable version of SPTK

Python 158 13 Updated Aug 20, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,674 1,032 Updated Aug 15, 2024

k2-fsa / icefall

Python 864 282 Updated Aug 17, 2024

SpeechifyInc / Meta-voicebox

Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.

548 31 Updated Jun 19, 2023

WelkinYang / WaveODE

An ODE-based generative neural vocoder using Rectified Flow

Python 57 6 Updated Apr 29, 2023

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,382 4,020 Updated Aug 20, 2024

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 38,502 4,320 Updated Aug 20, 2024

liusongxiang / Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

423 24 Updated Jan 17, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,276 2,341 Updated Aug 20, 2024

NVIDIA / BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 824 96 Updated Aug 13, 2024

yoyololicon / music-spectrogram-diffusion-pytorch

Python 70 4 Updated Jan 29, 2023

magenta / midi-ddsp

Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)

Python 298 18 Updated Nov 30, 2022

magenta / ddsp

DDSP: Differentiable Digital Signal Processing

Python 2,839 331 Updated Jun 17, 2024

ndkgit339 / spe-dss

Speech Parameter Estimation Using Differentiable Speech Synthesizer

Python 44 5 Updated May 9, 2023

chomeyama / SiFiGAN

Official implementation of the source-filter HiFiGAN vocoder

Python 233 34 Updated Jul 29, 2023

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 631 109 Updated Aug 20, 2024

nnsvs / nnsvs

Neural network-based singing voice synthesis library for research

Python 676 80 Updated Oct 9, 2023

huawei-noah / Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Jupyter Notebook 547 114 Updated Sep 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chenpeng Du cpdu

Achievements

Achievements

Block or report cpdu

Stars

tianweiy / DMD2

QwenLM / Qwen-Audio

metavoiceio / metavoice-src

mlabonne / llm-course

open-mmlab / Amphion

francislata / unicats

sony / bigvsan

modelscope / FunCodec

descriptinc / descript-audio-codec

Plachtaa / VALL-E-X

lifeiteng / vall-e

lucidrains / voicebox-pytorch

sp-nitech / diffsptk

facebookresearch / seamless_communication

k2-fsa / icefall

SpeechifyInc / Meta-voicebox

WelkinYang / WaveODE

microsoft / DeepSpeed

hpcaitech / ColossalAI

liusongxiang / Large-Audio-Models

NVIDIA / NeMo

NVIDIA / BigVGAN

yoyololicon / music-spectrogram-diffusion-pytorch

magenta / midi-ddsp

magenta / ddsp

ndkgit339 / spe-dss

chomeyama / SiFiGAN

wenet-e2e / wespeaker

nnsvs / nnsvs

huawei-noah / Speech-Backbones