CasonTsai

CasonTsai

6 followers · 27 following

Achievements

Stars

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion with in context learning

Python 266 27 Updated Sep 24, 2024

AIGODLIKE / AIGODLIKE-ComfyUI-Translation

A plugin for multilingual translation of ComfyUI，This plugin implements translation of resident menu bar/search bar/right-click context menu/node, etc

JavaScript 1,477 119 Updated Sep 26, 2024

ltdrdata / ComfyUI-Manager

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…

JavaScript 6,150 765 Updated Sep 27, 2024

amao2001 / ganloss-latent-space

有趣的80后程序员的工作流分享

57 11 Updated Sep 26, 2024

stefantaubert / pinyin-to-ipa

Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.

Python 30 6 Updated Jun 12, 2024

Kyubyong / g2pC

g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

Python 237 30 Updated Jul 10, 2019

yzhou359 / VisemeNet_tensorflow

Python 190 59 Updated Jul 15, 2021

chdzq / ARPAbetAndIPAConvertor

Python 63 14 Updated Dec 18, 2022

juntaosun / LangSegment

It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言（97种语言）混合文本内容自动分词工具。

Python 90 8 Updated Sep 7, 2024

Magicboomliu / Viseme-Classification

A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification

Python 12 2 Updated Jan 19, 2021

YUCHEN005 / UniVPM

Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"

Python 16 1 Updated Jun 21, 2023

mozillazg / pypinyin-g2pW

基于 g2pW 提升 pypinyin 的准确性

Python 75 7 Updated Jun 24, 2023

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,679 250 Updated Sep 25, 2024