-
University of Rochester
- Rochester, NY
-
06:37
(UTC -04:00) - yeates.github.io
Highlights
- Pro
Stars
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
🔊 Text-Prompted Generative Audio Model
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Natural Language Processing Tutorial for Deep Learning Researchers
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
High-Resolution Image Synthesis with Latent Diffusion Models
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Using Low-rank adaptation to quickly fine-tune diffusion models.
Official Code for Stable Cascade
Inpaint anything using Segment Anything and inpainting models.
Taming Transformers for High-Resolution Image Synthesis
Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework
Chess reinforcement learning by AlphaGo Zero methods.
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
Concept Sliders for Precise Control of Diffusion Models
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models
[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
[SIGGRAPH Asia 2022] IDE-3D: Interactive Disentangled Editing For High-Resolution 3D-aware Portrait Synthesis
Remove unwanted objects and restore images without prompts, powered by ControlNet.