Stars
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
Predict prosody labels for Chinese sentences.
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
This is the GitHub page for publicly available emotional speech data.
Test datasets for the Montreal Forced Aligner
Based on https://github.com/sannawag/TD-PSOLA by Sanna Wager (9/18/19)
Deezer source separation library including pretrained models.
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
Audio fingerprinting and recognition in Python
Production First and Production Ready End-to-End Speech Recognition Toolkit
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services
Tacotron text to speech in C++(synthesize only)