-
Computer of Science and Technology Beijing
Highlights
- Pro
Stars
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
A feature-rich command-line audio/video downloader
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A generative speech model for daily dialogue.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Deezer source separation library including pretrained models.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Convert Machine Learning Code Between Frameworks
Faster Whisper transcription with CTranslate2
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Ongoing research training transformer models at scale
Manipulate audio with a simple and easy high level interface
Official release of InternLM2.5 base and chat models. 1M context support
Official repo for consistency models.
Flax is a neural network library for JAX that is designed for flexibility.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Production First and Production Ready End-to-End Speech Recognition Toolkit
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Multilingual Voice Understanding Model
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
The PyTorch-based audio source separation toolkit for researchers
Self-Supervised Speech Pre-training and Representation Learning Toolkit
中文语音识别; Mandarin Automatic Speech Recognition;