Name		Name	Last commit message	Last commit date
parent directory ..
dataset		dataset
gligen		gligen
images		images
.gitignore		.gitignore
DejaVuSansMono.ttf		DejaVuSansMono.ttf
README.md		README.md
__init__.py		__init__.py
app.py		app.py
environment.yaml		environment.yaml
environment_cpu_mps.yaml		environment_cpu_mps.yaml

README.md

title	emoji	colorFrom	colorTo	sdk	sdk_version	app_file	pinned
GLIGen	👁	red	green	gradio	3.15.0	app.py	false

Gradio App Demo for GLIGEN

🎶 Introduction

This folder includes the source code of our Gradio app demo for GLIGEN. It automatically downloads and loads our checkpoints hosted on Huggingface.

NOTE: You may notice slight implementation differences of the pipeline between this code base and main GLIGEN repo, although the functionality and the checkpoints are the same. We'll replace the implementation pipeline to Diffusers after we finish the integration.

🧰 Installation

To install GLIGEN demo with CUDA support, create an environment.

conda env create -f environment.yaml

In case you don't have a CUDA-enabled GPU, you can run it on a CPU - though, it will be very slow. For some speedup on Macbooks with M1 Apple Silicon, there is support with MPS (much faster than CPU, slower than CUDA). To use Macbook GPUs, make sure that you install conda miniforge for the arm64 architecture (recommended: mambaforge).

mamba env create -f environment_cpu_mps.yaml

📓 Usage

Activate the environment with

conda activate gligen_demo

By default, it only loads the base text-box generation pipeline to save memory. You'll see error in the UI interface if attempting to run pipelines that are not loaded. Modify command line arguments to enable/disable specific pipelines.

python app.py \
    --load-text-box-generation=True \
    --load-text-box-inpainting=False \
    --load-text-image-box-generation=False

❓ How do you draw bounding boxes using Gradio sketchpad?

Gradio does not natively support drawing bounding boxes in its sketchpad. In this repo, we use a simple workaround where users draw their boxes using freeform brush, and the backend calculates the min/max point along x/y axis, and "guesses" a bounding box. The interpreted boxes are visualized on the side for better user experience.

Hope that we'll have native support for drawing bounding boxes with Gradio soon! 🥳

❄️ TODO

Use diffusers as the inference pipeline
Refactor code base

📖 Citation

@article{li2023gligen,
  title={GLIGEN: Open-Set Grounded Text-to-Image Generation},
  author={Li, Yuheng and Liu, Haotian and Wu, Qingyang and Mu, Fangzhou and Yang, Jianwei and Gao, Jianfeng and Li, Chunyuan and Lee, Yong Jae},
  journal={CVPR},
  year={2023}
}

Disclaimer

The original GLIGEN was partly implemented and trained during an internship at Microsoft. This repo re-implements GLIGEN in PyTorch with university GPUs after the internship. Despite the minor implementation differences, this repo aims to reproduce the results and observations in the paper for research purposes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

demo

demo

README.md

Gradio App Demo for GLIGEN

🎶 Introduction

🧰 Installation

📓 Usage

❓ How do you draw bounding boxes using Gradio sketchpad?

❄️ TODO

📖 Citation

Disclaimer

Files

demo

Directory actions

More options

Directory actions

More options

Latest commit

History

demo

Folders and files

parent directory

README.md

Gradio App Demo for GLIGEN

🎶 Introduction

🧰 Installation

📓 Usage

❓ How do you draw bounding boxes using Gradio sketchpad?

❄️ TODO

📖 Citation

Disclaimer