QinHsiu

🎯

Focusing

QinHsiu QinHsiu

🎯

Focusing

Man proposes, Gad disposes.

22 followers · 150 following

05:11 (UTC -12:00)
https://qinhsiu.github.io

Achievements

Block or Report

Block or report QinHsiu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Awesome-TTS

some amazing TTS projects

112 repositories

b04901014 / MQTTS

Python 243 35 Updated May 15, 2023

aliutkus / speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 886 152 Updated Jul 5, 2023

LAION-AI / CLAP

Contrastive Language-Audio Pretraining

Python 1,290 124 Updated Jul 9, 2024

NVIDIA / BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 822 95 Updated Aug 13, 2024

PlayVoice / vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Python 1,135 166 Updated Feb 5, 2024

jik876 / hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,865 497 Updated Jul 27, 2024

vb000 / Waveformer

A deep neural network architecture for low-latency audio processing

Python 276 34 Updated Aug 15, 2023

yl4579 / StyleTTS-VC

Official Implementation of StyleTTS-VC

Python 156 19 Updated Apr 23, 2023

voicepaw / so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Python 8,616 1,141 Updated Aug 12, 2024

flutydeer / audio-slicer

A simple GUI application that slices audio with silence detection

Python 1,175 160 Updated Jul 29, 2024

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 25,101 4,722 Updated Nov 11, 2023

microsoft / NeuralSpeech

Python 1,351 181 Updated Feb 11, 2024

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 9,936 852 Updated Jul 6, 2024

yangdongchao / AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 550 79 Updated Dec 27, 2023

wenet-e2e / speech-synthesis-paper

List of speech synthesis papers.

982 120 Updated Jul 24, 2023

JeremyCCHsu / Python-Wrapper-for-World-Vocoder

A Python wrapper for the high-quality vocoder "World"

Cython 718 118 Updated Oct 23, 2023

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 34,578 4,067 Updated Aug 15, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 66,040 7,756 Updated Aug 13, 2024

yangdongchao / InstructTTS

The deme page of InstructTTS

155 8 Updated Feb 10, 2024

yangdongchao / Text-to-sound-Synthesis

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"

Python 343 36 Updated Aug 3, 2023

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,323 4,011 Updated Aug 15, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 32,712 3,944 Updated Aug 14, 2024

haoheliu / AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,357 223 Updated Jun 2, 2024

libAudioFlux / audioFlux

A library for audio and music analysis, feature extraction.

C 2,648 114 Updated May 24, 2024

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,359 301 Updated Jan 4, 2024

0nutation / SpeechGPT

SpeechGPT Series: Speech Large Language Models

Python 1,174 75 Updated Jul 22, 2024

0nutation / awesome-diffusion4speech-papers

9 1 Updated Apr 24, 2023

PlayVoice / whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,586 917 Updated Apr 23, 2024

PlayVoice / lora-svc

singing voice change based on whisper, and lora for singing voice clone

Python 611 79 Updated Nov 3, 2023

Rongjiehuang / GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Python 311 45 Updated Feb 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QinHsiu QinHsiu

Achievements

Achievements

Block or report QinHsiu

Awesome-TTS

b04901014 / MQTTS

aliutkus / speechmetrics

LAION-AI / CLAP

NVIDIA / BigVGAN

PlayVoice / vits_chinese

jik876 / hifi-gan

vb000 / Waveformer

yl4579 / StyleTTS-VC

voicepaw / so-vits-svc-fork

flutydeer / audio-slicer

svc-develop-team / so-vits-svc

microsoft / NeuralSpeech

AIGC-Audio / AudioGPT

yangdongchao / AcademiCodec

wenet-e2e / speech-synthesis-paper

JeremyCCHsu / Python-Wrapper-for-World-Vocoder

suno-ai / bark

openai / whisper

yangdongchao / InstructTTS

yangdongchao / Text-to-sound-Synthesis

microsoft / DeepSpeed

coqui-ai / TTS

haoheliu / AudioLDM

libAudioFlux / audioFlux

facebookresearch / encodec

0nutation / SpeechGPT

0nutation / awesome-diffusion4speech-papers

PlayVoice / whisper-vits-svc

PlayVoice / lora-svc

Rongjiehuang / GenerSpeech