synesthesiam

Michael Hansen synesthesiam

Computer/cognitive science PhD, open source voice assistant enthusiast.

532 followers · 2 following

Achievements

x2 x4 x3 x3

Achievements

x2 x4 x3 x3

Highlights

Lists (1)

Sort

🔮 Future ideas

1 repository

Stars

106 stars written in Python

Clear filter

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 35,500 4,333 Updated Aug 16, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 12,524 1,048 Updated Nov 17, 2024

jiaaro / pydub

Manipulate audio with a simple and easy high level interface

Python 8,956 1,046 Updated Jul 25, 2024

librosa / librosa

Python library for audio and music analysis

Python 7,180 965 Updated Oct 8, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,876 1,265 Updated Dec 6, 2023

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,377 428 Updated Nov 13, 2024

kuprel / min-dalle

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Python 3,483 257 Updated Nov 21, 2022

minimaxir / gpt-2-simple

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

Python 3,398 676 Updated Dec 14, 2022

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 2,538 237 Updated Oct 17, 2024

jameslyons / python_speech_features

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python 2,376 617 Updated Oct 20, 2021

fatchord / WaveRNN

WaveRNN Vocoder + TTS

Python 2,143 698 Updated Jul 2, 2022

iver56 / audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,874 191 Updated Nov 13, 2024

pydoit / doit

CLI task management & automation tool

Python 1,871 175 Updated Jul 4, 2024

resemble-ai / resemble-enhance

AI powered speech denoising and enhancement

Python 1,432 142 Updated Nov 5, 2024

lidatong / dataclasses-json

Easily serialize Data Classes to and from JSON

Python 1,386 154 Updated Aug 8, 2024

MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Python 1,343 248 Updated Nov 11, 2024

bootphon / phonemizer

Simple text to phones converter for multiple languages

Python 1,232 174 Updated Sep 26, 2024

roshan-research / hazm

Persian NLP Toolkit

Python 1,209 179 Updated Jul 16, 2024

bheinzerling / bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,187 101 Updated Oct 1, 2024

as-ideas / TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Python 1,131 227 Updated May 3, 2024

MycroftAI / mimic3

A fast local neural text to speech engine for Mycroft

Python 1,075 103 Updated Dec 8, 2023

alumae / kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Python 1,073 339 Updated Jun 8, 2024

spatialaudio / python-sounddevice

🔉 Play and Record Sound with Python 🐍

Python 1,054 149 Updated Nov 1, 2024

asteroid-team / torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 962 88 Updated Nov 8, 2024

DavHau / mach-nix

Create highly reproducible python environments

Python 862 106 Updated May 20, 2024

savoirfairelinux / num2words

Modules to convert numbers to words. 42 --> forty-two

Python 824 501 Updated Oct 2, 2024

Kyubyong / g2p

g2p: English Grapheme To Phoneme Conversion

Python 811 128 Updated Jan 5, 2023

nipunsadvilkar / pySBD

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Python 807 84 Updated Aug 20, 2024

snakers4 / open_stt

Open STT

Python 783 81 Updated Mar 11, 2022

lmnt-com / diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 775 113 Updated Mar 26, 2024

Michael Hansen synesthesiam

Sponsors

Highlights

Lists (1)

🔮 Future ideas

Stars