-
Nabu Casa
- United States
- https://synesthesiam.com
- @rhasspy
- @[email protected]
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Faster Whisper transcription with CTranslate2
Manipulate audio with a simple and easy high level interface
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
Noise supression using deep filtering
This library provides common speech features for ASR including MFCCs and filterbank energies.
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
AI powered speech denoising and enhancement
Easily serialize Data Classes to and from JSON
Command line utility for forced alignment using Kaldi
Simple text to phones converter for multiple languages
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
A fast local neural text to speech engine for Mycroft
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
🔉 Play and Record Sound with Python 🐍
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
Create highly reproducible python environments
Modules to convert numbers to words. 42 --> forty-two
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.