yfyeung

Yifan Yang yfyeung

Speech recognition

64 followers · 67 following

Shanghai Jiao Tong University
Beijing
https://yfyeung.github.io/

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Block or Report

Block or report yfyeung

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,718 505 Updated May 31, 2024

pengsida / learning_research

本人的科研经验

4,939 306 Updated Jun 1, 2024

csukuangfj / kaldi-native-fbank

Kaldi-compatible online fbank extractor without external dependencies

C++ 70 15 Updated Jun 25, 2024

CLUEbenchmark / SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

2,854 92 Updated May 23, 2024

k2-fsa / icefall

Python 843 273 Updated Jul 18, 2024

pytorch / torchtune

A Native-PyTorch Library for LLM Fine-tuning

Python 3,632 298 Updated Jul 19, 2024

karpathy / deep-vector-quantization

VQVAEs, GumbelSoftmaxes and friends

Jupyter Notebook 509 42 Updated Nov 20, 2021

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 12,466 1,108 Updated Jul 19, 2024

GitYCC / g2pW

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Python 247 34 Updated Jun 16, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 2,313 212 Updated Jul 18, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 1,454 129 Updated Jul 19, 2024

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 77,268 6,062 Updated Jul 19, 2024

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 374 32 Updated Jun 9, 2024

0nutation / SpeechGPT

SpeechGPT Series: Speech Large Language Models

Python 1,053 64 Updated Mar 28, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 27,965 3,035 Updated Jul 20, 2024

datawhalechina / leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 11,235 2,667 Updated Jul 11, 2024

SpeechColab / GigaSpeech2

An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement

Python 82 4 Updated Jun 22, 2024

rasbt / LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 23,140 2,385 Updated Jul 19, 2024

webdataset / webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,116 162 Updated Jul 11, 2024

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,016 826 Updated Jul 18, 2024

reazon-research / ReazonSpeech

Massive open Japanese speech corpus

Python 206 14 Updated Jul 17, 2024

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 5,556 731 Updated Jul 14, 2024

JusperLee / Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

724 134 Updated Apr 18, 2022

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 10,977 2,291 Updated Jul 20, 2024