Skip to content
View synesthesiam's full-sized avatar

Sponsors

@Toothwitch
@zugaldia
Private Sponsor

Highlights

  • Pro

Block or report synesthesiam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
106 stars written in Python
Clear filter

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 35,500 4,333 Updated Aug 16, 2024

Faster Whisper transcription with CTranslate2

Python 12,524 1,048 Updated Nov 17, 2024

Manipulate audio with a simple and easy high level interface

Python 8,956 1,046 Updated Jul 25, 2024

Python library for audio and music analysis

Python 7,180 965 Updated Oct 8, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,876 1,265 Updated Dec 6, 2023

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,377 428 Updated Nov 13, 2024

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Python 3,483 257 Updated Nov 21, 2022

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

Python 3,398 676 Updated Dec 14, 2022

Noise supression using deep filtering

Python 2,538 237 Updated Oct 17, 2024

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python 2,376 617 Updated Oct 20, 2021

WaveRNN Vocoder + TTS

Python 2,143 698 Updated Jul 2, 2022

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,874 191 Updated Nov 13, 2024

CLI task management & automation tool

Python 1,871 175 Updated Jul 4, 2024

AI powered speech denoising and enhancement

Python 1,432 142 Updated Nov 5, 2024

Easily serialize Data Classes to and from JSON

Python 1,386 154 Updated Aug 8, 2024

Command line utility for forced alignment using Kaldi

Python 1,343 248 Updated Nov 11, 2024

Simple text to phones converter for multiple languages

Python 1,232 174 Updated Sep 26, 2024

Persian NLP Toolkit

Python 1,209 179 Updated Jul 16, 2024

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,187 101 Updated Oct 1, 2024

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Python 1,131 227 Updated May 3, 2024

A fast local neural text to speech engine for Mycroft

Python 1,075 103 Updated Dec 8, 2023

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Python 1,073 339 Updated Jun 8, 2024

🔉 Play and Record Sound with Python 🐍

Python 1,054 149 Updated Nov 1, 2024

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 962 88 Updated Nov 8, 2024

Create highly reproducible python environments

Python 862 106 Updated May 20, 2024

Modules to convert numbers to words. 42 --> forty-two

Python 824 501 Updated Oct 2, 2024

g2p: English Grapheme To Phoneme Conversion

Python 811 128 Updated Jan 5, 2023

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Python 807 84 Updated Aug 20, 2024

Open STT

Python 783 81 Updated Mar 11, 2022

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 775 113 Updated Mar 26, 2024
Next