[Examples] Save SDXL LoRA weights with chosen precision #4791

mnslarcher · 2023-08-26T12:12:58Z

What does this PR do?

As discussed in issue #4736, this PR tackles the following tasks:

Increases the minimum required version of accelerate from 0.16.0 to 0.22.0. This prevents unnecessary peaks in GPU memory consumption during checkpointing with mixed-precision.
Removes casting to float32 before saving to avoid unnecessary increases in GPU memory consumption.
Removes redundant instantiation of the VAE at the end (already present).
Deletes unused models before creating the final pipeline.

The command make test-examples completed successfully.

The following images were generated using the weights saved in fp32:

The following images were generated using the same weights as above, but saved in fp16:

Previously, peak memory consumption was borderline on an RTX 4090 during checkpoint and final weight saving. Now, it remains consistently below 75%.

Saving and loading also work with bf16.

Fixes #4736 (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@sayakpaul

patrickvonplaten · 2023-08-26T17:53:11Z

examples/text_to_image/train_text_to_image_lora_sdxl.py

        # Final inference
        # Load previous pipeline
-        vae = AutoencoderKL.from_pretrained(


is this already defined?

Yes, we never del the one defined here so is still there even at the end. And... it works without it :)

patrickvonplaten

Generally looks good to me - wdyt @sayakpaul ?

HuggingFaceDocBuilderDev · 2023-08-26T18:08:47Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

sayakpaul · 2023-08-28T08:26:24Z

examples/text_to_image/requirements_sdxl.txt

@@ -1,4 +1,4 @@
-accelerate>=0.16.0
+accelerate>=0.22.0


Is this necessary?

Unfortunately, without this change, there is still a spike in GPU memory during checkpointing :/. From Zach's message on the issue, my understanding is that there was a bug when saving with the accelerator while using mixed precision

sayakpaul · 2023-08-28T08:27:35Z

Looks amazing actually. Thanks for delving deep here and for the fixes!

…4791) * Increase min accelerate ver to avoid OOM when mixed precision * Rm re-instantiation of VAE * Rm casting to float32 * Del unused models and free GPU * Fix style

mnslarcher added 5 commits August 26, 2023 12:19

Increase min accelerate ver to avoid OOM when mixed precision

9606a8f

Rm re-instantiation of VAE

25de838

Rm casting to float32

174f4f8

Del unused models and free GPU

6ef0f5a

Fix style

9cb7de9

patrickvonplaten reviewed Aug 26, 2023

View reviewed changes

sayakpaul reviewed Aug 28, 2023

View reviewed changes

sayakpaul merged commit 87ae330 into huggingface:main Aug 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Examples] Save SDXL LoRA weights with chosen precision #4791

[Examples] Save SDXL LoRA weights with chosen precision #4791

Uh oh!

mnslarcher commented Aug 26, 2023

Uh oh!

patrickvonplaten Aug 26, 2023

Uh oh!

mnslarcher Aug 26, 2023

Uh oh!

patrickvonplaten left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Aug 26, 2023

Uh oh!

sayakpaul Aug 28, 2023

Uh oh!

mnslarcher Aug 28, 2023

Uh oh!

sayakpaul commented Aug 28, 2023

Uh oh!

Uh oh!

		@@ -1,4 +1,4 @@
		accelerate>=0.16.0
		accelerate>=0.22.0

[Examples] Save SDXL LoRA weights with chosen precision #4791

[Examples] Save SDXL LoRA weights with chosen precision #4791

Uh oh!

Conversation

mnslarcher commented Aug 26, 2023

What does this PR do?

Before submitting

Who can review?

Uh oh!

patrickvonplaten Aug 26, 2023

Choose a reason for hiding this comment

Uh oh!

mnslarcher Aug 26, 2023

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Aug 26, 2023

Uh oh!

sayakpaul Aug 28, 2023

Choose a reason for hiding this comment

Uh oh!

mnslarcher Aug 28, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented Aug 28, 2023

Uh oh!

Uh oh!