Skip to content
View jojonki's full-sized avatar
Block or Report

Block or report jojonki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Visual Studio Code extension with support for the Ruff linter.

TypeScript 953 45 Updated Jul 11, 2024

Easy-to-Use Speech MOS predictors

Python 180 12 Updated Oct 24, 2023

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Jupyter Notebook 247 36 Updated Mar 7, 2023

ESPnet Model Zoo

Python 243 41 Updated Jul 9, 2023
Python 15 1 Updated Jun 5, 2024

Faster Whisper transcription with CTranslate2

Python 10,229 859 Updated Jul 10, 2024

Neural Spline Flow, RealNVP, Autoregressive Flow, 1x1Conv in PyTorch.

Python 269 38 Updated Dec 14, 2023

Code for Neural Spline Flows paper

Python 253 42 Updated Jun 20, 2020

LLM training in simple, raw C/CUDA

Cuda 21,535 2,342 Updated Jul 11, 2024

vits2 backbone with multilingual-bert

Python 7,457 1,062 Updated Jul 8, 2024

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Jupyter Notebook 421 44 Updated Sep 11, 2023

A tool for exploring each layer in a docker image

Go 44,610 1,695 Updated Jun 16, 2024

Foundational model for human-like, expressive TTS

Python 3,503 616 Updated Jul 10, 2024

DALL·E Mini - Generate images from a text prompt

Python 14,701 1,193 Updated Nov 9, 2023

🐸 - A general purpose model trainer, as flexible as it gets

Python 179 104 Updated Mar 7, 2024

Japanese to romaji converter in Python

Python 277 20 Updated Jul 3, 2024

Instant voice cloning by MyShell.

Python 27,216 2,639 Updated Jul 6, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 12,435 1,738 Updated Jun 27, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 31,939 3,830 Updated Jul 8, 2024

JSON conversion and parsing for VBA

Visual Basic 1,724 562 Updated Mar 14, 2024

Official implementation of MelHuBERT

Python 56 4 Updated Jul 9, 2024

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

Rust 51,704 5,807 Updated Jun 19, 2024

Layer-wise analysis of self-supervised pre-trained speech representations

Python 83 15 Updated Mar 6, 2024
Python 153 20 Updated Jul 25, 2022

NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis

Python 139 11 Updated Feb 11, 2023

[CVPR24 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation

Python 71 1 Updated Jul 6, 2024

This is the GitHub page for publicly available emotional speech data.

305 22 Updated Jan 6, 2022

CURRENNNT codes and scripts

Cuda 77 11 Updated Aug 31, 2020

Official implementation of the source-filter HiFiGAN vocoder

Python 233 35 Updated Jul 29, 2023

UT-Sarulab MOS prediction system using SSL models

Python 144 14 Updated Apr 11, 2024
Next