-
Sony
- Tokyo
- https://www.linkedin.com/in/junki/
- @jojonki
Block or Report
Block or report jojonki
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A Visual Studio Code extension with support for the Ruff linter.
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Faster Whisper transcription with CTranslate2
Neural Spline Flow, RealNVP, Autoregressive Flow, 1x1Conv in PyTorch.
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
A tool for exploring each layer in a docker image
Foundational model for human-like, expressive TTS
DALL·E Mini - Generate images from a text prompt
🐸 - A general purpose model trainer, as flexible as it gets
A multi-voice TTS system trained with an emphasis on quality
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
JSON conversion and parsing for VBA
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
Layer-wise analysis of self-supervised pre-trained speech representations
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
[CVPR24 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation
This is the GitHub page for publicly available emotional speech data.
CURRENNNT codes and scripts
Official implementation of the source-filter HiFiGAN vocoder
UT-Sarulab MOS prediction system using SSL models