FnoY0723

FnoY FnoY0723

2 followers · 2 following

https://orcid.org/0009-0003-8767-1172

Achievements

Stars

24 results for source starred repositories

Clear filter

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 27,120 3,071 Updated Aug 12, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,959 739 Updated Nov 15, 2024

Audio-WestlakeU / UMA-ASR

This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).

Shell 16 3 Updated Oct 29, 2024

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

15,993 1,477 Updated Sep 19, 2024

Audio-WestlakeU / RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]

Python 93 11 Updated Oct 12, 2024

shimohq / chinese-programmer-wrong-pronunciation

中国程序员容易发音错误的单词

JavaScript 22,284 1,584 Updated Aug 16, 2024

Audio-WestlakeU / FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Python 84 4 Updated Oct 17, 2024

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,503 2,185 Updated Nov 14, 2024

Audio-WestlakeU / RCT

This repo gives the code for the official implementation of RCT.

Python 12 1 Updated Jun 28, 2022

Audio-WestlakeU / McNet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

Python 108 13 Updated Mar 24, 2023

Audio-WestlakeU / pytorch_lightning_template_for_beginners

A pytorch template for beginners based on pytorch_lightning

Python 36 5 Updated Feb 1, 2024

Audio-WestlakeU / audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

Python 106 10 Updated Aug 27, 2024

Audio-WestlakeU / FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 549 155 Updated Aug 19, 2023

Audio-WestlakeU / FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization

Python 90 10 Updated Nov 13, 2024

Audio-WestlakeU / NBSS

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

Python 232 26 Updated Nov 4, 2024

Audio-WestlakeU / RVAE-EM

Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

Python 42 4 Updated Mar 20, 2024