Highlights
- Pro
Stars
ControlLoRA Version 2: A Lightweight Neural Network To Control Stable Diffusion Spatial Information Version 2
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
High-resolution models for human tasks.
set prompt to divided region
Geometric Computer Vision Library for Spatial AI
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
llama3.np is a pure NumPy implementation for Llama 3 model.
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
Implementation for for "L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors"
Stable Diffusion web UI
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…
Foundational model for human-like, expressive TTS
Mixture-of-Experts for Large Vision-Language Models
A language for constraint-guided and efficient LLM programming.
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
A quick guide (especially) for trending instruction finetuning datasets