Lists (1)
Sort Name ascending (A-Z)
Stars
An Open-source Streaming High-fidelity Neural Audio Codec
PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing
Multitrack music mixing style transfer given a reference song using differentiable mixing console.
Machine Learning Engineering Open Book
[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs
Types and functions that make it a little easier to work with Core ML in Swift.
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, L…
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.
Unaligned Supervision for Automatic Music Transcription in The Wild
♬ A JavaScript library which provides an API for programmatically generating and creating expressive multi-track MIDI files and JSON.
The official GitHub page for the survey paper "Foundation Models for Music: A Survey".
A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ISMIR 24 Supplementary Material
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A python package to build AI-powered real-time audio applications
Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, accepted in 2024 ICASSP
The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.
Fine-tune Stable Audio Open with DiT ControlNet.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Port of Now Playing from Pixels to other Android devices
Joint Embedding Predictive Architecture for Musical Stem Compatibility Estimation
Searching for Music Mixing Graphs: A Pruning Approach