Skip to content
View hcy71o's full-sized avatar
Block or Report

Block or report hcy71o

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official inference repo for FLUX.1 models

Python 8,245 510 Updated Aug 16, 2024

Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))

Python 19 Updated Aug 6, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,875 514 Updated May 31, 2024

Simple text to phones converter for multiple languages

Python 1,175 163 Updated Aug 1, 2024

Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)

Python 56 4 Updated Apr 4, 2024

Spectral Analysis in Python

Python 333 88 Updated Dec 23, 2023

vits2 backbone with multilingual-bert(한국어 지원)

Python 24 1 Updated Apr 6, 2024

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Python 656 76 Updated Jul 27, 2024

vits2 backbone with multilingual-bert

Python 7,671 1,088 Updated Aug 15, 2024
Python 11 2 Updated Jul 16, 2023
Python 92 39 Updated Mar 24, 2023

DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.

Python 47 4 Updated Sep 25, 2023
JavaScript 7 3 Updated Jul 5, 2022

Official implementation of the source-filter HiFiGAN vocoder

Python 233 34 Updated Jul 29, 2023

Easy-to-Use Speech MOS predictors

Python 197 13 Updated Oct 24, 2023

The official implementation of HierSpeech++

Python 1,147 134 Updated Feb 20, 2024

Unofficial implementation of NVIDIA P-Flow TTS paper

Python 204 30 Updated Jul 1, 2024
Jupyter Notebook 31 7 Updated Jan 30, 2023

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 15,713 1,693 Updated Aug 7, 2024

E2E TTS using Conditional Flow Matching (Experimental*)

Jupyter Notebook 61 5 Updated Nov 10, 2023

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,589 917 Updated Apr 23, 2024

Collect Voice Conversion researches

TypeScript 89 7 Updated Aug 13, 2024

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 129 19 Updated Oct 16, 2023

Bilingual-TTS (Japanese and Korean)

Jupyter Notebook 25 5 Updated Jul 1, 2023

Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)

Python 73 7 Updated Feb 28, 2024

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilingual Cleaners

Python 61 7 Updated Nov 21, 2022

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

Python 29 17 Updated Mar 19, 2024

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python 292 35 Updated Jul 22, 2024

unofficial vits2-TTS implementation in pytorch

Python 468 84 Updated Mar 28, 2024

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

C 3,997 853 Updated Aug 17, 2024
Next