Skip to content
View chenxy12's full-sized avatar
Block or Report

Block or report chenxy12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch CTC Decoder bindings

C++ 816 240 Updated Apr 4, 2024

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Python 27,553 3,310 Updated Jul 19, 2024

Hydra is a framework for elegantly configuring complex applications

Python 8,433 608 Updated Jul 17, 2024

PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)

Python 129 16 Updated Nov 22, 2022

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python 668 115 Updated Oct 23, 2023

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 919 173 Updated Dec 22, 2023

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

Python 569 100 Updated Jul 17, 2024

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 24,807 3,926 Updated Jun 22, 2024

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Python 287 44 Updated Aug 5, 2021

CUDA-Warp RNN-Transducer

Python 211 41 Updated Feb 22, 2023

Streaming 가능한 RNN Transducer 모델을 PyTorch Lightning으로 구현해본다.

Python 6 Updated Dec 20, 2022

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python 34 7 Updated Oct 18, 2021

ASRT:一个基于CTC的流式语言识别框架

Python 7 Updated Feb 28, 2023

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Python 736 177 Updated Jul 6, 2023

PyTorch Implementations for End-to-End Automatic Speech Recognition

Python 126 27 Updated Jun 10, 2019

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Python 1,174 319 Updated Dec 19, 2020

End-to-end ASR/LM implementation with PyTorch

Python 589 140 Updated Aug 30, 2021

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 13,959 5,294 Updated Jun 29, 2024

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Python 916 242 Updated Jul 15, 2024

A framework for automatic speech recognition

Python 46 7 Updated Apr 1, 2023

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 10,658 1,810 Updated Jul 19, 2024

Conformer RNN-Transducer

Python 13 1 Updated May 25, 2022

RNN-Transducer for korean

Python 38 3 Updated Oct 31, 2020

A fast parallel implementation of RNN Transducer.

C++ 306 124 Updated Jun 7, 2023

End-to-End Speech Processing Toolkit

Python 8,141 2,138 Updated Jul 18, 2024

🔥 ASR教程: https://dataxujing.github.io/ASR-paper/

19 4 Updated Jul 1, 2024

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Python 30 2 Updated Feb 19, 2021

A pytorch_lightning reimplementation of the Transducer module from ESPnet.

Python 75 17 Updated Mar 11, 2021

主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt

31 4 Updated May 25, 2021

使用python进行语音识别

Python 126 541 Updated Feb 16, 2022
Next