Denoising Diffusion Implicit Models
-
Updated
Jun 15, 2022 - Python
Denoising Diffusion Implicit Models
Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)
Turn your words into music! Describe a sound (e.g., happy, spooky) and this app generates a short piece based on your text.
The mel spectrogram generator using conditional WGAN-GP. For the mel spectrogram inverter, look up HiFi-GAN
Various projects utilizing diverse generative AI techniques to produce audio, code, images, text, and Streamlit applications.
Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
MIDI generator for chord progressions.
The service is used to query text-to-audio AI models from the Hugging Face inference API.
Site for sharing Bark voices
ai audio processing methods
This repository is a comprehensive guide and toolkit for music generation, featuring diverse algorithms, deep learning models, and creative techniques to inspire and assist in the composition of unique musical pieces.
Code implementation for the paper "Relating Human Perception of Musicality to Prediction in a Predictive Coding Model"
BeeBrain is your personal chatbot. Use tools, generate images, run code and so much more!
Text To Audio (Voice, Music) -Support Chat-GPT
Experiments in neural networks for audio generation.
ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Image Captioning and Text-to-Speech
Knowledge Distillation of different DDSP Decoders for audio signal generation
Docker image for stable-audio-tools: Generative models for conditional audio generation
Add a description, image, and links to the audio-generation topic page so that developers can more easily learn about it.
To associate your repository with the audio-generation topic, visit your repo's landing page and select "manage topics."