Skip to content
View Dream-High's full-sized avatar
Block or Report

Block or report Dream-High

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 18 1 Updated Apr 22, 2024

A Pytorch-Lightning Implementation of Transformer Network

Python 10 1 Updated Oct 22, 2020

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 2,638 405 Updated Aug 6, 2024

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

280 11 Updated Aug 14, 2024

LLM&VLM Tutorial

Python 1,228 793 Updated Aug 15, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 843 32 Updated Aug 13, 2024

关于机器学习,深度学习,自然语言处理等各种算法的实现、示例,与博客文章配套,论文复现等

Jupyter Notebook 186 35 Updated Sep 4, 2022

The official source code of UniAudio

Python 80 6 Updated Mar 29, 2024

Official Implementation of "Multitrack Music Transformer" (ICASSP 2023)

Python 133 23 Updated Mar 14, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,420 2,055 Updated Jul 18, 2024

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Python 2,651 274 Updated Aug 4, 2024
Python 195 34 Updated Jan 25, 2024

This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.

Jupyter Notebook 670 61 Updated Oct 17, 2023

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 7,740 983 Updated Jun 27, 2024

Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)

Python 26 2 Updated Jan 19, 2024
Python 178 20 Updated Apr 24, 2024

Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation

Python 72 10 Updated Nov 15, 2023

Self-supervised learning for fast pitch estimation

Python 168 15 Updated Aug 9, 2024

Full models and training code for PESTO

Python 48 12 Updated Jun 12, 2024

Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)

Python 50 8 Updated Jan 18, 2024
Python 243 35 Updated May 15, 2023

SoftVC VITS Singing Voice Conversion

Python 25,099 4,722 Updated Nov 11, 2023

Unofficial download repository for MusicCaps

Python 38 2 Updated Apr 21, 2023

Download the MusicCaps dataset for music captioning

Jupyter Notebook 96 8 Updated Aug 9, 2024

million song dataset split for extended clean tag & artist-level stratified

Jupyter Notebook 46 2 Updated Aug 12, 2023
Python 21 3 Updated Apr 4, 2024

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Python 263 32 Updated Apr 8, 2024

TAPE: An End-to-End Timbre-Aware Pitch Estimator

Jupyter Notebook 18 Updated Nov 25, 2023

Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

Jupyter Notebook 285 23 Updated May 30, 2024
Next