Deep Learning | Speech Synthesis | Neural Vocoder
-
42dot Inc.
- Seoul, Korea
- https://scholar.google.com/citations?authuser=1&user=d3VX0zQAAAAJ
Highlights
- Pro
Block or Report
Block or report lism13
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)