Starred repositories
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Pitch Estimating Neural Networks (PENN)
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Inference and training library for high-quality TTS models.
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus
A python package to analyze and compare voices with deep learning
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Hunspell UTF8 dictionaries. These work with Sublime Text. [Spell check]
🇳🇱🇧🇪🇸🇷 Dutch word list by OpenTaal
A multi-voice TTS system trained with an emphasis on quality
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Simple text to phones converter for multiple languages
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
The PyTorch-based audio source separation toolkit for researchers
This is Pytorch Implementation of Google's Non-attentive Tacotron.