Skip to content
View zengchang233's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report zengchang233

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Local realtime voice AI

Python 1,867 86 Updated Nov 11, 2024

provides metadata for chains

Kotlin 8,849 6,626 Updated Nov 15, 2024

a MUSHRA compliant web audio API based experiment software

JavaScript 352 137 Updated Aug 9, 2024

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Python 170 31 Updated Jul 25, 2024

State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning

Python 608 66 Updated Nov 15, 2024

Unsupervised Rhythm Modeling for Voice Conversion

Python 80 7 Updated Aug 3, 2023

A sequence-to-sequence voice conversion toolkit.

Python 86 10 Updated Jul 5, 2024

A toolkit for any-to-any encoder-decoder voice conversion systems

Python 81 8 Updated Aug 10, 2023

S3PRL-VC: A Voice Conversion Toolkit based on S3PRL

Python 97 12 Updated Jun 26, 2024

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

Jupyter Notebook 340 55 Updated Apr 27, 2022

Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)

Python 112 13 Updated Feb 7, 2024

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

Jupyter Notebook 312 39 Updated May 10, 2024

Lightweight Speech Representation Learning for One-Shot Voice Conversion

Python 16 1 Updated Aug 12, 2024

[WIP] VoiceSmith makes training text to speech models easy.

Python 222 32 Updated Oct 10, 2022

A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.

986 108 Updated Nov 14, 2024

A curated list of awesome voice conversion, projects and communities.

199 12 Updated Oct 13, 2024

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,432 632 Updated Aug 13, 2024
HTML 11 Updated Oct 3, 2024

The official Python library for the OpenAI API

Python 22,985 3,226 Updated Nov 12, 2024

✨✨Latest Advances on Multimodal Large Language Models

12,652 808 Updated Nov 10, 2024
Python 54 2 Updated Dec 19, 2023

Audio Codec Speech processing Universal PERformance Benchmark

Python 219 22 Updated Nov 1, 2024

A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)

Python 257 39 Updated Sep 4, 2024

Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering"

Python 44 5 Updated May 19, 2023

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

144 6 Updated Nov 4, 2024

Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS

Python 35 7 Updated Aug 4, 2023

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Python 319 44 Updated Feb 21, 2022

PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis

Python 337 37 Updated Nov 9, 2024

Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment

Python 65 6 Updated Jul 5, 2024
Next