-
University of Texas at Dallas
- https://mu-y.github.io/
- @MuYang55
Highlights
- Pro
Stars
An opinionated list of awesome Python frameworks, libraries, software and resources.
All Algorithms implemented in Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Clone a voice in 5 seconds to generate arbitrary speech in real-time
2023年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Instant voice cloning by MIT and MyShell.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
SoftVC VITS Singing Voice Conversion
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
State-of-the-Art Text Embeddings
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Python package built to ease deep learning on graph, on top of existing DL frameworks.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Train transformer language models with reinforcement learning.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.