Production First and Production Ready End-to-End Speech Recognition Toolkit
-
Updated
Nov 8, 2024 - Python
Production First and Production Ready End-to-End Speech Recognition Toolkit
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
OpenAI Whisper ASR Webservice API
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Open STT
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
End-to-end ASR/LM implementation with PyTorch
On-device streaming speech-to-text engine powered by deep learning
This is a list of features, scripts, blogs and resources for better using Kaldi ( https://kaldi-asr.org/ )
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1
On-device speech-to-text engine powered by deep learning
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
The dataset of Speech Recognition
🔉 Youtube Videos Transcription with OpenAI's Whisper
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."