Add Option to use Target Model in LCM-LoRA Scripts by dg845 · Pull Request #6537 · huggingface/diffusers

dg845 · 2024-01-11T18:57:40Z

What does this PR do?

This PR enables a target model to be optionally used in the LCM-LoRA distillation scripts via the --use_target_model argument.

Follow up to #6505.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@patrickvonplaten
@patil-suraj
@sayakpaul
@jon-chuang
@shuminghu

jon-chuang · 2024-01-11T19:13:50Z

            # Checks if the accelerator has performed an optimization step behind the scenes
            if accelerator.sync_gradients:
+                # 12. If using a target model, update its parameters via EMA.
+                update_ema(target_unet.parameters(), unet.parameters(), args.ema_decay)


Are you sure this works? I had errors with the LoRA parameters.

I have a workaround in the issue

both are lora weights, so they should work?

I haven't been able to test this fully yet, it's possible that this runs into the errors mentioned in #6505 (comment)

I believe the current implementation is correctly updating the LoRA parameters.

shuminghu · 2024-01-11T19:27:58Z

    # ----Latent Consistency Distillation (LCD) Specific Arguments----
+    parser.add_argument(
+        "--use_target_model",
+        action="store_true",


Should this default to false so existing users are not surprised?

The target model will be used only if the --use_target_model flag is specified (so existing script calls should work as before).

Thanks. My bad.

patil-suraj

The code looks good to me. Could you explain the behind this ? Do you have any experiments that demonstrate the use of this ?

dg845 · 2024-01-18T11:06:59Z

#6505 (comment) reports seeing a lot of training instability when training using the current LCM-LoRA script. @jon-chuang, would you be willing to share more details about the training instability?

HuggingFaceDocBuilderDev · 2024-01-23T11:27:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jon-chuang · 2024-01-29T15:32:12Z

I don’t have conclusive evidence that this fix will mitigate, but what I observed was some divergent results on LCM-LoRA training runs.

Anw, I think it’s a zero-cost opt-in feature that may produce better results for some users.

I will definitely try the EMA once it is merged and can report on further results.

github-actions · 2024-02-23T15:04:46Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

sayakpaul · 2024-02-23T15:10:58Z

@patil-suraj a gentle ping.

github-actions · 2024-03-19T15:03:54Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

dg845 added 2 commits January 11, 2024 10:38

Add option to use a target model for the LCM-LoRA WDS scripts.

ed21c4a

Add update_ema function back to SD-XL LCM-LoRA WDS script.

2692721

dg845 mentioned this pull request Jan 11, 2024

[consistency distillation] LoRA scripts omits EMA update #6505

Closed

jon-chuang reviewed Jan 11, 2024

View reviewed changes

shuminghu reviewed Jan 11, 2024

View reviewed changes

shuminghu approved these changes Jan 12, 2024

View reviewed changes

patrickvonplaten requested a review from patil-suraj January 15, 2024 14:36

patil-suraj reviewed Jan 16, 2024

View reviewed changes

dg845 added 2 commits January 20, 2024 19:04

Fix bugs when --use_target_model is set.

5bdd576

Merge branch 'main' into lcm-lora-scripts-enable-target-model

f3be18a

dg845 changed the title ~~[WIP] Add Option to use Target Model in LCM-LoRA Scripts~~ Add Option to use Target Model in LCM-LoRA Scripts Jan 21, 2024

patrickvonplaten requested a review from patil-suraj January 23, 2024 11:26

github-actions Bot added the stale Issues that haven't received updates label Feb 23, 2024

yiyixuxu added training and removed stale Issues that haven't received updates labels Feb 23, 2024

github-actions Bot added the stale Issues that haven't received updates label Mar 19, 2024

Conversation

dg845 commented Jan 11, 2024

What does this PR do?

Before submitting

Who can review?

Uh oh!

jon-chuang Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

jon-chuang Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

shuminghu Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

dg845 Jan 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dg845 Jan 21, 2024

Choose a reason for hiding this comment

Uh oh!

shuminghu Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

dg845 Jan 12, 2024

Choose a reason for hiding this comment

Uh oh!

shuminghu Jan 12, 2024

Choose a reason for hiding this comment

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

dg845 commented Jan 18, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Jan 23, 2024

Uh oh!

jon-chuang commented Jan 29, 2024

Uh oh!

github-actions Bot commented Feb 23, 2024

Uh oh!

sayakpaul commented Feb 23, 2024

Uh oh!

github-actions Bot commented Mar 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

dg845 Jan 18, 2024 •

edited

Loading