Skip to content
View CasonTsai's full-sized avatar

Block or report CasonTsai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

zero-shot voice conversion & singing voice conversion with in context learning

Python 266 27 Updated Sep 24, 2024

A plugin for multilingual translation of ComfyUI,This plugin implements translation of resident menu bar/search bar/right-click context menu/node, etc

JavaScript 1,477 119 Updated Sep 26, 2024

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…

JavaScript 6,150 765 Updated Sep 27, 2024

有趣的80后程序员的工作流分享

57 11 Updated Sep 26, 2024

Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.

Python 30 6 Updated Jun 12, 2024

g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

Python 237 30 Updated Jul 10, 2019

It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。

Python 90 8 Updated Sep 7, 2024

A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification

Python 12 2 Updated Jan 19, 2021

Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"

Python 16 1 Updated Jun 21, 2023

基于 g2pW 提升 pypinyin 的准确性

Python 75 7 Updated Jun 24, 2023

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,679 250 Updated Sep 25, 2024

g2p: English Grapheme To Phoneme Conversion

Python 796 128 Updated Jan 5, 2023

Chinese and English Bilinguish G2P

Python 19 3 Updated Jul 16, 2023

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Python 277 38 Updated Jun 16, 2024

本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。

Python 32 5 Updated Aug 30, 2024

FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry

813 64 Updated Aug 27, 2024

a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine

Python 31 7 Updated Oct 13, 2023

华中师范大学物联网协会算法组--自然语言处理组

Python 1 Updated Oct 13, 2023

基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用

Python 132 24 Updated Jun 20, 2024

Python Package for Airborne RGB machine learning

Python 489 172 Updated Sep 26, 2024

单独维护的中文TTS

Python 35 6 Updated Oct 28, 2022

Pytorch reimplementation of audio driven face mesh or blendshape models, including Audio2Mesh, VOCA, etc

Python 9 2 Updated Sep 6, 2024

PaddleSpeech TTS cpp

Python 35 12 Updated Mar 8, 2023

Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)

Python 74 7 Updated Feb 28, 2024

phoneme toolkit

Python 3 29 Updated Feb 19, 2020

vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统

Python 209 72 Updated Sep 27, 2021

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Python 465 67 Updated Feb 7, 2024
Next