Colab: Balances the data distribution #1710

Peppe289 · 2024-01-02T08:17:40Z

Analyses

After conducting several tests on Google Colab and reviewing the documentation, initiating "fooocus" with specific flags not only enables the utilization of cloud computing VRAM (15GB of VRAM), significantly reducing the consumption of system RAM, but also enhances processing speed and prevents premature process termination.

What the user notices

By default, VRAM remains unused, and the program processes solely with system memory (12GB). The program crashes when attempting to use more than one image in the prompt. The process was terminated with ^C.

After this commit

Using the extra 15GB of vram allows you to make the most of the program on colab. The use is perfectly balanced and manages to process with 4 images in the prompt.

Thanks for your work, best regards.

enables the use of VRAM memory so as not to saturate the system RAM

mashb1t · 2024-01-02T12:24:32Z

Thank you for the contribution. While i totally agree with adding --always-high-vram we should not set --disable-offload-from-vram by default as switching models will cause them also to stay on VRAM, which may cause problems.
Did you test using Fooocus on Colab incl. switching models?

Peppe289 · 2024-01-02T12:33:52Z

Thank you for the contribution. While i totally agree with adding --always-high-vram we should not set --disable-offload-from-vram by default as switching models will cause them also to stay on VRAM, which may cause problems. Did you test using Fooocus on Colab incl. switching models?

If you mean advanced->model, no. I kept the default settings. For the two flags, even though I kept them I didn't notice any problems.

mashb1t · 2024-01-02T12:58:14Z

fyi if you do not offload from VRAM and switch models, every additional model will also be kept in VRAM, which is fine.
Just checked the code, this shouldn't be an issue as in

Fooocus/ldm_patched/modules/model_management.py

Lines 357 to 359 in 0c4f20a

 if not ALWAYS_VRAM_OFFLOAD: 

 if get_free_memory(device) > memory_required: 

 break

models are unloaded when more VRAM is required than is free.
Please nevertheless test with multiple models and provide your results to make sure everything is working as expected. Looking forward to it :)

Peppe289 · 2024-01-05T11:06:30Z

fyi if you do not offload from VRAM and switch models, every additional model will also be kept in VRAM, which is fine. Just checked the code, this shouldn't be an issue as in

Fooocus/ldm_patched/modules/model_management.py

Lines 357 to 359 in 0c4f20a

if not ALWAYS_VRAM_OFFLOAD:

if get_free_memory(device) > memory_required:

break

models are unloaded when more VRAM is required than is free.
Please nevertheless test with multiple models and provide your results to make sure everything is working as expected. Looking forward to it :)

Sorry for the delay. I have tested the models and everything seems to work correctly. Regarding the flags I still have to see, but even if they are both there it doesn't seem to cause any kind of problem

mashb1t · 2024-02-26T12:53:29Z

The solution of this issue has been referenced to and has helped countless users already in enabling them to use Colab.
A final test should be conducted and a decision needs to be made if the changes will be merged to main or not.

Peppe289 · 2024-02-28T08:39:38Z

The solution of this issue has been referenced to and has helped countless users already in enabling them to use Colab. A final test should be conducted and a decision needs to be made if the changes will be merged to main or not.

okay, thanks for considering this change. best regards.

mashb1t · 2024-03-11T18:40:17Z

Here are my extensive testing results. Tests have been conducted on Colab with a T4 instance (free tier) using 2 IP images (ImagePrompt) and a positive prompt in 1152×896, default model, default styles (irrelevant for test).

default (only --share)

Process ran out of memory

--attention-split

Process ran out of memory

--always-high-vram

Process did NOT run out of memory

--always-high-vram --disable-offload-from-vram

Process did NOT run out of memory for first generation, but DID run out of memory when using upscale or different adapters afterwards

--always-high-vram --attention-split

Process did NOT run out of memory for first generation, but DID run out of memory when using upscale or different adapters afterwards

--always-high-vram --disable-offload-from-vram --attention-split

Process did NOT run out of memory for first generation (but overall slower), but DID run out of memory when using upscale or different adapters afterwards

--disable-offload-from-vram

Process did NOT run out of memory

Learnings:

--always-high-vram is overall beneficial as it shifts load from RAM to much faster VRAM
--disable-offload-from-vram alloes for faster loading when doing the same type of generation multiple times, but causes the instance to crash due to not offloading, duh, when using different adapters or functionalities.
--attention-split is overall beneficial and lowers RAM AND VRAM, but at the cost of performance

=> using --always-high-vram achieves the overall best balance between performance, flexibility and stability.

poor7 · 2024-03-18T18:26:09Z

@mashb1t I’ll add that the keys --vae-in-fp16 --unet-in-fp16 --all-in-fp16 further increase generation speed by ~10-20% and reduce memory consumption by ~10-20%, improving overall performance and stability.

colab: balance the use of RAM

0be7dd6

enables the use of VRAM memory so as not to saturate the system RAM

Peppe289 requested a review from lllyasviel as a code owner January 2, 2024 08:17

mashb1t mentioned this pull request Jan 5, 2024

Colab: Crashing every 8 to 15 minutes #1756

Closed

This was referenced Jan 9, 2024

Not using GPU #1839

Closed

Combine ImagePrompt and FaceSwap crash Fooocus #1844

Closed

Merge branch 'lllyasviel:main' into main

9d1188e

This was referenced Jan 14, 2024

Google colab Gradio webui does not work #1927

Closed

Colab #1961

Closed

Peppe289 closed this Jan 19, 2024

This was referenced Feb 12, 2024

Error 1006 when using image prompt #2239

Closed

Inpaint feature ceased working since yesterday in Fooocus Colab #2283

Closed

mashb1t mentioned this pull request Feb 26, 2024

Crashing after faceswap on Colab Pro #2327

Closed

mashb1t reopened this Feb 26, 2024

mashb1t mentioned this pull request Feb 28, 2024

Not work in google colab. machineminded/Fooocus-inswapper#6

Open

mashb1t added this to the 2.3.0 milestone Mar 3, 2024

mashb1t mentioned this pull request Mar 6, 2024

[Bug]: Sd 1.5 Checkpoint and Lora with Foocus on Google Colab #2470

Closed

4 tasks

feat: use --always-high-vram by default for Colab, adjust readme

11a9d2d

mashb1t changed the base branch from main to develop March 11, 2024 18:57

mashb1t merged commit 532401d into lllyasviel:develop Mar 11, 2024

mashb1t mentioned this pull request Mar 18, 2024

[Bug]: The Modify Content feature under inpainting causes crashes. #2555

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Colab: Balances the data distribution #1710

Colab: Balances the data distribution #1710

Peppe289 commented Jan 2, 2024

mashb1t commented Jan 2, 2024

Peppe289 commented Jan 2, 2024

mashb1t commented Jan 2, 2024

Peppe289 commented Jan 5, 2024

mashb1t commented Feb 26, 2024 •

edited

Loading

Peppe289 commented Feb 28, 2024

mashb1t commented Mar 11, 2024 •

edited

Loading

poor7 commented Mar 18, 2024

Colab: Balances the data distribution #1710

Colab: Balances the data distribution #1710

Conversation

Peppe289 commented Jan 2, 2024

Analyses

What the user notices

After this commit

mashb1t commented Jan 2, 2024

Peppe289 commented Jan 2, 2024

mashb1t commented Jan 2, 2024

Peppe289 commented Jan 5, 2024

mashb1t commented Feb 26, 2024 • edited Loading

Peppe289 commented Feb 28, 2024

mashb1t commented Mar 11, 2024 • edited Loading

poor7 commented Mar 18, 2024

mashb1t commented Feb 26, 2024 •

edited

Loading

mashb1t commented Mar 11, 2024 •

edited

Loading