[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
-
Updated
Sep 30, 2024 - Jupyter Notebook
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
awesome-LLM-controlled-constrained-generation
A fast speech-to-any translation model that supports simultaneous decoding and offers 28× speedup.
[AAAI 2024] GLOP: Learning Global Partition and Local Construction for Solving Large-scale Routing Problems in Real-time
Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
BERT-based pre-trained non-autoregressive sequence-to-sequence model
Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)
M.Sc. thesis on Continual Learning for Non-Autoregressive Neural Machine Translation
[EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
Reparameterized Discrete Diffusion Models for Text Generation
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Add a description, image, and links to the non-autoregressive topic page so that developers can more easily learn about it.
To associate your repository with the non-autoregressive topic, visit your repo's landing page and select "manage topics."