synesthesiam

Michael Hansen synesthesiam

Computer/cognitive science PhD, open source voice assistant enthusiast.

472 followers · 2 following

Achievements

x2 x4 x3 x3

BetaSend feedback

Achievements

x2 x4 x3 x3

BetaSend feedback

Highlights

Block or Report

Block or report synesthesiam

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (1)

Sort

🔮 Future ideas

1 repository

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

brian-smith-github / ch32v003_stt

Simple Speech-To-Text on the '10 cents' CH32V003 Microcontroller

C 208 8 Updated May 23, 2024

domesticatedviking / TextyMcSpeechy

Easily create text-to-speech models in any voice for rhasspy/piper. Make a text-to-speech model with your own voice recordings, or use thousands of RVC voices. Works offline on a Raspberry pi. Rapi…

Shell 177 3 Updated Jun 11, 2024

evuraan / mintPiper

Make Linux speak what's on the screen: clearly and securely.

Python 10 1 Updated Apr 6, 2024

StuartIanNaylor / 2ch_delay_sum

2 channel delay sum beamformer

C 5 Updated Feb 9, 2023

greg-kennedy / p5-NRL-TextToPhoneme

Perl implementation of the Naval Research Laboratory text-to-phoneme algorithm, described by Elovitz et al (1976)

Perl 12 2 Updated May 7, 2020

Lord-Nightmare / NRL_TextToPhonemes

C 2 Updated Jun 25, 2024

acon96 / home-llm

A Home Assistant integration & Model to control your smart home using a Local LLM

Python 473 53 Updated Jun 16, 2024

ser / wyoming-whisper-api-client

Wyoming protocol server for the Whisper API speech to text system

Python 17 3 Updated May 21, 2024

bookbot-hive / babygruut

Forked from rhasspy/gruut

A tokenizer, text cleaner, and phonemizer for many human languages.

Python 2 Updated Mar 14, 2023

MallocArray / airgradient_esphome

ESPHome definition for an AirGradient DIY device to send data to HomeAssistant and AirGradient servers

199 29 Updated Jun 23, 2024

kahrendt / microWakeWord

A TensorFlow based wake word detection training framework using synthetic sample generation suitable for certain microcontrollers.

Python 136 11 Updated Jun 15, 2024

ClaimCompass / num2cyrillic

Python class for converting numbers into Bulgarian cyrillic words

Python 5 Updated Aug 11, 2018

nothings / stb

stb single-file public domain libraries for C/C++

C 25,695 7,662 Updated Jun 20, 2024

resemble-ai / resemble-enhance

AI powered speech denoising and enhancement

Python 1,063 100 Updated Jun 21, 2024

nsu-ai / russian_g2p

Accentor and transcriptor for Russian language

Python 118 24 Updated Jun 19, 2022

kahrendt / esphome-on-device-wake-word

Detect wake words for ESPHome's voice assistant component on the device

PureBasic 28 3 Updated Feb 21, 2024

czyzi0 / the-mc-speech-dataset

Free speech dataset consisting of 24018 short audio clips of a single speaker reading sentences in Polish

4 Updated Dec 29, 2023

ewan-xu / LibrosaCpp

LibrosaCpp is a c++ implemention of librosa to compute short-time fourier transform coefficients,mel spectrogram or mfcc

C++ 170 42 Updated Dec 28, 2020

Wataru-Nakata / miipher

Unofficial implementation of miipher

Python 92 14 Updated Apr 19, 2024

jiaaro / pydub

Manipulate audio with a simple and easy high level interface

Python 8,534 1,012 Updated May 17, 2024

fwartner / home-assistant-wakewords-collection

Community Collection of Wake-Words for Home Assistant

255 42 Updated Jun 25, 2024

mush42 / libtashkeel

Add diacritics to Arabic text with ease

Rust 15 1 Updated May 10, 2024

elazarg / nakdimon

Hebrew Diacritizer

Jupyter Notebook 24 5 Updated May 21, 2024

NTT123 / light-speed

A modified VITS that utilizes phoneme duration's ground truth for better robustness

Python 101 25 Updated Aug 27, 2023

rhasspy / piper

A fast, local neural text to speech system

C++ 4,874 338 Updated Jun 24, 2024

voithru / voice-activity-detection

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

Python 142 25 Updated Oct 26, 2021

mush42 / hareef

state-of-the-art models for diacritics restoration for Arabic language

Python 7 3 Updated May 3, 2024

shahizat / jetsonGPT

Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech

Python 90 10 Updated Jun 15, 2023

roatienza / efficientspeech

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

Jupyter Notebook 142 25 Updated Mar 18, 2024

asteroid-team / torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 896 86 Updated Apr 4, 2024