Mobile Stable Diffusion with DreamDrawer

This is the official repo for the paper "DreamDrawer: Ultra-fast Open-sourced Text-to-Image Generation on Mobile Devices"

We present a new iOS mobile application, DreamDrawer, capable of on-device text to image generation in less than 3s.

*Sample images generated within 3 seconds on an iPhone 13 Pro

*Note: Safety checking is enforced by default in our mobile application. If you require it to be disabled for debugging purposes, please pull the iOS Mobile App code from the repo and run a new instance of the application on your device.

The code for the mobile application, the weights, and the model training are all open-sourced and provided in links below.

All users are welcome to try the application via our TestFlight link. We recommend using the application with target devices having equivalent or better hardware specifications than iPhone 13 Pro to achieve a similar experience.

List of Modules Provided

Deploying, Compiling, or Inferencing with CoreML

Directly deploying our pre-trained and compiled weights

Our converted checkpoints can be found on HuggingFace via this link: https://huggingface.co/davidw0311/sd-coreml.

How to convert from your own custom Pytorch model to CoreML Model Weights

The text encoder and VAE encoder/decoder can be converted into CoreML following apple's official repository https://github.com/apple/ml-stable-diffusion.

To convert the unet, we provide a custom script torch2coreml_custom.py, which is customized specifically for converting a distilled model with a new unet. To perform the conversion, run the following snippet:

export PYTORCH_UNET_PATH=<path_to_pytorch_unet>
export COREML_UNET_SAVEPATH=<path_to_save_coreml_unet>

replacing <path_to_pytorch_unet> with the path to the pytorch unet checkpoint, and <path_to_save_coreml_unet> with where the converted model will be saved. Then, execute:

python -m python_coreml_stable_diffusion.torch2coreml_custom --convert-unet --model-version "lykon/absolutereality" -o $COREML_UNET_SAVEPATH --unet-path $PYTORCH_UNET_PATH --compute-unit CPU_AND_NE --quantize-nbits 6 --attention-implementation SPLIT_EINSUM

After the model has been converted into a CoreML .mlpackage, run the following to compile to a .mlmodelc file

xcrun coremlcompiler compile <path_to_mlpackage> <output_dir>

replacing <path_to_mlpackage> with the path of the .mlpackage file and <output_dir>

Implementating LCM Scheduler for our new model or your own LCM

The Latent Consistency Sheduler is added to the implementation in Scheduler.swift

Building off of Apple's framework, we can now directly use the lcmScheduler option to choose LCM as the scheduler.

Using the CoreML models for inferencing

First, export the path to the compiled coreml models.

export COREML_MODELS_PATH=<path_to_models>

replacing <path_to_models> with the absolute path to the models folder

Then generate an image using

swift run StableDiffusionSample "a cat" --resource-path $COREML_MODELS_PATH --seed 123456 --disable-safety --compute-units cpuAndNeuralEngine --step-count 4 --output-path images --scheduler lcm --guidance-scale 1.0

On a macbook, the first time loading the model may take a few minutes, and subsequent image generation should take only a few seconds.

The generated images will appear in the images folder, tagged with their name and random seed.

Model Training

Our training for a new smaller UNet model can be found at this repo.

Our LCM fine-tuning script for less inference steps can be found at this repo.

Model Evaluation

Our evaluation script for a new model using Human Preference Score V2 can be found at this repo, or you can refer to the original HPS V2 repo.

Acknowledgments

National Supercomputing Centre (NSCC) Singapore for their GPU resources.
CompVis, Runway, CivitAI, Lykon and Stability AI for the Stable Diffusion models.
LAION, Diffusers, PEFT, Core ML Stable Diffusion, and Nota AI for their contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
.github		.github
assets		assets
images		images
python_coreml_stable_diffusion		python_coreml_stable_diffusion
swift		swift
tests		tests
.gitignore		.gitignore
ACKNOWLEDGEMENTS		ACKNOWLEDGEMENTS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
Package.swift		Package.swift
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mobile Stable Diffusion with DreamDrawer

List of Modules Provided

Deploying, Compiling, or Inferencing with CoreML

Directly deploying our pre-trained and compiled weights

How to convert from your own custom Pytorch model to CoreML Model Weights

Implementating LCM Scheduler for our new model or your own LCM

Using the CoreML models for inferencing

Model Training

Model Evaluation

Acknowledgments

About

Releases

Packages

Languages

License

davidw0311/mobile-sd

Folders and files

Latest commit

History

Repository files navigation

Mobile Stable Diffusion with DreamDrawer

List of Modules Provided

Deploying, Compiling, or Inferencing with CoreML

Directly deploying our pre-trained and compiled weights

How to convert from your own custom Pytorch model to CoreML Model Weights

Implementating LCM Scheduler for our new model or your own LCM

Using the CoreML models for inferencing

Model Training

Model Evaluation

Acknowledgments

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages