Skip to content
View CasonTsai's full-sized avatar
Block or Report

Block or report CasonTsai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Replicated and optimized community version of Advanced Locomotion System V4 for Unreal Engine 5.4 with additional features & bug fixes

C++ 2,144 578 Updated Jul 10, 2024

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Python 818 73 Updated Jul 19, 2024

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Python 2,992 323 Updated Jul 9, 2024

[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model

Python 550 47 Updated Jul 11, 2023

Official implementation for "Generating Diverse and Natural 3D Human Motions from Texts (CVPR2022)."

Python 429 38 Updated Jan 12, 2024
54 Updated Jul 8, 2024

Drive your metahuman to speak within 1 second.

Python 5 1 Updated Jul 24, 2024
Dockerfile 348 49 Updated Jun 20, 2024

Foundational model for human-like, expressive TTS

Python 3,573 624 Updated Jul 21, 2024

A simple VITS HTTP API, developed by extending Moegoe with additional features.

Python 748 116 Updated Jul 29, 2024

Faster Tortoise inference then Tortoise Fast Fork

Jupyter Notebook 121 7 Updated Apr 21, 2024

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…

HTML 710 80 Updated Jul 28, 2024

Brand new TTS solution

Python 6,575 512 Updated Jul 29, 2024

Windows不用搭建环境只要英伟达显卡就行,解压即用!

14 1 Updated Jul 14, 2024

基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏

Python 230 38 Updated Sep 10, 2023

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 260 20 Updated Mar 24, 2024

Leading free and open-source face recognition system

Java 5,016 684 Updated Jul 19, 2024

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

C 100 8 Updated Mar 6, 2024

The deme page of InstructTTS

153 8 Updated Feb 10, 2024

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,497 358 Updated Jul 10, 2024

Pre-trained Wav2vec2.0 for Mandarin

33 5 Updated Oct 30, 2022

chinese speech pretrained models

Shell 973 84 Updated Mar 11, 2024

Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.

195 13 Updated Jan 18, 2024

Demo project for GDMP plugin.

C# 15 3 Updated May 15, 2024

A fast, local neural text to speech system

C++ 5,282 374 Updated Jul 23, 2024

The code generate phoneme from audio features.

Python 16 3 Updated Jun 15, 2021

Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, VisionFive2, LicheePi4A etc.

C++ 917 142 Updated Jul 11, 2024

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

C 3,951 853 Updated Jul 29, 2024
Next