Highlights
Lists (1)
Sort Name ascending (A-Z)
Stars
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
[CVPR 2024🔥] EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection
PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
Music Source Separation Training Inference Webui, besides, we packed UVR together!
Performs the entire AI cover generation process with UI
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
Chai-1, SOTA model for biomolecular structure prediction
Official implementation of "Separate Anything You Describe"
simplest & fastest way to transfer files between computers via WireGuard
TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor contents easier.
A Beautiful Private and Secure Desktop Investment Tracking Application
🚀 Strapi is the leading open-source headless CMS. It’s 100% JavaScript/TypeScript, fully customizable, and developer-first.
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
FlagGems is an operator library for large language models implemented in Triton Language.
Scaling Diffusion Transformers with Mixture of Experts
Text-to-Music Generation with Rectified Flow Transformers
Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
High-resolution models for human tasks.
Fine-tune Stable Audio Open with DiT ControlNet.
Official mirror of Rubber Band Library, an audio time-stretching and pitch-shifting library.