-
Netease Game AI Lab
- Guangzhou
Stars
MuseScore is an open source and free music notation software. For support, contribution, bug reports, visit MuseScore.org. Fork and make pull requests!
Amazon Kinesis Video Streams Webrtc SDK is for developers to install and customize realtime communication between devices and enable secure streaming of video, audio to Kinesis Video Streams.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A fast parallel implementation of RNN Transducer.
Tools for handling speech data in machine learning projects.
medbar / kaldi
Forked from kaldi-asr/kaldikaldi-asr/kaldi is the official location of the Kaldi project.
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Open-Unmix - Music Source Separation for PyTorch
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Open tools and data for cloudless automatic speech recognition
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
kaldi-asr/kaldi is the official location of the Kaldi project.
TensorFlow implementation of the paper "Learning to learn by gradient descent by gradient descent ( https://arxiv.org/abs/1606.04474 )"
Example scripts that illustrate how to use Kaldi+CNTK for speech recognition.