Stars
SALMONN: Speech Audio Language Music Open Neural Network
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Simple package for binding functions to CLI or config files.
Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experiments with a unique hash signature. Scale up to hundreds of exp…
Zstandard - Fast real-time compression algorithm
TorchCFM: a Conditional Flow Matching library
Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)
Python packaging and dependency management made easy
EVAR ~ Evaluation package for Audio Representations
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
Wwise plugin that runs RAVE models, enabling real-time timbre transfer via neural audio synthesis in a game audio setting
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
🔊 Text-Prompted Generative Audio Model
Python 3.8+ toolbox for submitting jobs to Slurm
Muzic: Music Understanding and Generation with Artificial Intelligence
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight