AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…

HTML 710 80 Updated Jul 28, 2024

fishaudio / fish-speech

Brand new TTS solution

Python 6,575 512 Updated Jul 29, 2024

IIEleven11 / StyleTTS2FineTune

Python 142 27 Updated Jul 29, 2024

licungang / amazon-polly-metahumans

C++ 1 Updated Apr 20, 2023

zhaoyun0071 / CosyVoice-windows-GUI

Windows不用搭建环境只要英伟达显卡就行，解压即用！

14 1 Updated Jul 14, 2024

Executedone / Chinese-FastSpeech2

基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏

Python 230 38 Updated Sep 10, 2023

X-LANCE / VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 260 20 Updated Mar 24, 2024

exadel-inc / CompreFace

Leading free and open-source face recognition system

Java 5,016 684 Updated Jul 19, 2024

nii-yamagishilab / ZMM-TTS

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

C 100 8 Updated Mar 6, 2024

yangdongchao / InstructTTS

The deme page of InstructTTS

153 8 Updated Feb 10, 2024

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,497 358 Updated Jul 10, 2024

kehanlu / Mandarin-Wav2Vec2

Pre-trained Wav2vec2.0 for Mandarin

33 5 Updated Oct 30, 2022

TencentGameMate / chinese_speech_pretrain

chinese speech pretrained models

Shell 973 84 Updated Mar 11, 2024

ddlBoJack / Awesome-Speech-Pretraining

Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.

195 13 Updated Jan 18, 2024

j20001970 / GDMP-demo

Demo project for GDMP plugin.

C# 15 3 Updated May 15, 2024

rhasspy / piper

A fast, local neural text to speech system

C++ 5,282 374 Updated Jul 23, 2024

liukuangxiangzi / audio2viseme

The code generate phoneme from audio features.

Python 16 3 Updated Jun 15, 2021

k2-fsa / sherpa-ncnn

Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, VisionFive2, LicheePi4A etc.

C++ 917 142 Updated Jul 11, 2024

espeak-ng / espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

C 3,951 853 Updated Jul 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CasonTsai

Achievements

Achievements

Block or report CasonTsai

Stars

ShadowfallStudios / ALS-Community

mingyuan-zhang / MotionDiffuse

GuyTevet / motion-diffusion-model

ChenFengYe / motion-latent-diffusion

EricGuo5513 / text-to-motion

RodinHD / RodinHD

SaltedSlark / Fast-3D-Talking-Face

endink / Mediapipe4u-plugin

metavoiceio / metavoice-src

Artrajz / vits-simple-api

manmay-nakhashi / tortoise-tts-fastest

erew123 / alltalk_tts