Fast LoRA initialization by skipping redundant linearizations #4980

isidentical · 2023-09-11T17:11:28Z

What does this PR do?

Part of #4975, along with #4976 and #4979. This PR leverages the accelerate's init empty weights (just like diffusers automatically does on certain models, like vae for SD) when we are initializing the LoRA weights for the first them. This grants a great boost for avoiding unnecessary computations which would be overriden just a moment later. On top of #4976 and #4979, this PR makes the load_lora_weights() + unload_lora_weights() cycle take around a second (1.05seconds to be precise) from ~6 seconds without any of them. Relative effect on top of the first two PRs (combined) is about 1.33x.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

patrickvonplaten · 2023-09-12T16:47:43Z

I think this is already fixed by: #4994

isidentical · 2023-09-12T16:50:22Z

Yep, #4994 fixes this!

Fast LoRA initialization by skipping redundant linearizations

be17c8d

isidentical marked this pull request as ready for review September 11, 2023 17:12

isidentical closed this Sep 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fast LoRA initialization by skipping redundant linearizations #4980

Fast LoRA initialization by skipping redundant linearizations #4980

Uh oh!

isidentical commented Sep 11, 2023

Uh oh!

patrickvonplaten commented Sep 12, 2023

Uh oh!

isidentical commented Sep 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fast LoRA initialization by skipping redundant linearizations #4980

Fast LoRA initialization by skipping redundant linearizations #4980

Uh oh!

Conversation

isidentical commented Sep 11, 2023

What does this PR do?

Before submitting

Who can review?

Uh oh!

patrickvonplaten commented Sep 12, 2023

Uh oh!

isidentical commented Sep 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants