Stars
💿 Free software that works great, and also happens to be open-source Python.
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
This is the repository for the distill web framework
A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
🔊 Text-Prompted Generative Audio Model
Paper List for a new paradigm of NLP: Interactive NLP (https://arxiv.org/abs/2305.13246) 🔥
Audio generation using diffusion models, in PyTorch.
[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
DALL·E Mini - Generate images from a text prompt
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
[NeurIPS 2022] Denoising Diffusion Restoration Models -- Official Code Repository
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
A collection of resources and papers on Diffusion Models
Simple, extendable, easy to understand Glow implementation in PyTorch