Highlights
- Pro
Block or Report
Block or report jottr
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusetts
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Foundational Models for State-of-the-Art Speech and Text Translation
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
An unofficial PyTorch implementation of the audio LM VALL-E
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
An Open Source text-to-speech system built by inverting Whisper.