Block or Report
Block or report Dream-High
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A Pytorch-Lightning Implementation of Transformer Network
Transformer: PyTorch Implementation of "Attention Is All You Need"
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
关于机器学习,深度学习,自然语言处理等各种算法的实现、示例,与博客文章配套,论文复现等
Official Implementation of "Multitrack Music Transformer" (ICASSP 2023)
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
openvpi / DiffSinger
Forked from MoonInTheRiver/DiffSingerAn advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)
Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation
Self-supervised learning for fast pitch estimation
Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)
SoftVC VITS Singing Voice Conversion
Unofficial download repository for MusicCaps
Download the MusicCaps dataset for music captioning
million song dataset split for extended clean tag & artist-level stratified
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
TAPE: An End-to-End Timbre-Aware Pitch Estimator
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.