Starred repositories
Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL
The repo provides information about KeSpeech dataset.
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
LangChain 的中文入门教程
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Easy to use vocal separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Implementation for the paper "Can Language Models Learn to Listen?"
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021
GUI for a Vocal Remover that uses Deep Neural Networks.
Real-time Automatic Piano Transcription using PyTorch with Web Visualization
Cola-Ace / Python-ncmdump
Forked from allenfrostline/pyNCMDUMP使用Python3编写的脚本,可将网易云音乐下载的.ncm格式转换成.flac和.mp3
Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.
ONSETS&VELOCITIES real-time piano detection - PyTorch training [EUSIPCO2023]
A Collection of Variational Autoencoders (VAE) in PyTorch.
Pytorch Implementation of "Neural Discrete Representation Learning"
[CVPR 2023] CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
NVIDIA's Deep Imagination Team's PyTorch Library
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
Production First and Production Ready End-to-End Speech Recognition Toolkit
Simple GUI for ByteDance's Piano Transcription with Pedals
Automatic Chord Recognition tools - ISMIR2021 Late-Breaking Demo presentation
A generative speech model for daily dialogue.
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"