Skip to content
View QinHsiu's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report QinHsiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Awesome-TTS

some amazing TTS projects
112 repositories
Python 243 35 Updated May 15, 2023

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 886 152 Updated Jul 5, 2023

Contrastive Language-Audio Pretraining

Python 1,290 124 Updated Jul 9, 2024

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 822 95 Updated Aug 13, 2024

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Python 1,135 166 Updated Feb 5, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,865 497 Updated Jul 27, 2024

A deep neural network architecture for low-latency audio processing

Python 276 34 Updated Aug 15, 2023

Official Implementation of StyleTTS-VC

Python 156 19 Updated Apr 23, 2023

so-vits-svc fork with realtime support, improved interface and more features.

Python 8,616 1,141 Updated Aug 12, 2024

A simple GUI application that slices audio with silence detection

Python 1,175 160 Updated Jul 29, 2024

SoftVC VITS Singing Voice Conversion

Python 25,101 4,722 Updated Nov 11, 2023
Python 1,351 181 Updated Feb 11, 2024

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 9,936 852 Updated Jul 6, 2024

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 550 79 Updated Dec 27, 2023

List of speech synthesis papers.

982 120 Updated Jul 24, 2023

A Python wrapper for the high-quality vocoder "World"

Cython 718 118 Updated Oct 23, 2023

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 34,578 4,067 Updated Aug 15, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 66,040 7,756 Updated Aug 13, 2024

The deme page of InstructTTS

155 8 Updated Feb 10, 2024

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"

Python 343 36 Updated Aug 3, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,323 4,011 Updated Aug 15, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 32,712 3,944 Updated Aug 14, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,357 223 Updated Jun 2, 2024

A library for audio and music analysis, feature extraction.

C 2,648 114 Updated May 24, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,359 301 Updated Jan 4, 2024

SpeechGPT Series: Speech Large Language Models

Python 1,174 75 Updated Jul 22, 2024

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,586 917 Updated Apr 23, 2024

singing voice change based on whisper, and lora for singing voice clone

Python 611 79 Updated Nov 3, 2023

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Python 311 45 Updated Feb 9, 2024