Stars
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]
中国程序员容易发音错误的单词
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
This repo gives the code for the official implementation of RCT.
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
A pytorch template for beginners based on pytorch_lightning
A library built for easier audio self-supervised training, downstream tasks evaluation
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
Single Shot MultiBox Detector in TensorFlow
A PyTorch Implementation of Single Shot MultiBox Detector
Python sample codes for robotics algorithms.
📡 Simple and ready-to-use tutorials for TensorFlow
Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html