[MM]Fix bug in offload_unlocked_models() call #6450

lstein · 2024-05-28T00:06:16Z

Summary

When lazy offloading is false, the model locker's unlock() method was asking offload_unlocked_models to free up VRAM sufficient to load the model that is being unlocked. This was incorrect. The method should be called with a size of 0, which will have the effect of removing models until VRAM is less than or equal to max_vram_cache.

Related Issues / Discussions

Please see #6439 (review) for @RyanJDick 's discovery of this bug.

QA Instructions

Merge Plan

Merge when approved.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)

…imit only

when unlocking models, offload_unlocked_models should prune to vram l…

e3a5dc1

…imit only

lstein requested review from blessedcoolant, brandonrising, RyanJDick and hipsterusername as code owners May 28, 2024 00:06

github-actions bot added python PRs that change python files backend PRs that change backend files labels May 28, 2024

lstein mentioned this pull request May 28, 2024

LoRA patching optimization #6439

Merged

3 tasks

RyanJDick approved these changes May 28, 2024 •

edited

Loading

View reviewed changes

lstein enabled auto-merge (squash) May 29, 2024 02:52

Merge branch 'main' into lstein/bugfix/offload-unlocked-models-bug

cab4a62

lstein merged commit 21a60af into main May 29, 2024
14 checks passed

lstein deleted the lstein/bugfix/offload-unlocked-models-bug branch May 29, 2024 03:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MM]Fix bug in offload_unlocked_models() call #6450

[MM]Fix bug in offload_unlocked_models() call #6450

lstein commented May 28, 2024 •

edited

Loading

[MM]Fix bug in offload_unlocked_models() call #6450

[MM]Fix bug in offload_unlocked_models() call #6450

Conversation

lstein commented May 28, 2024 • edited Loading

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

lstein commented May 28, 2024 •

edited

Loading