Move io_same_device hook to before attach_align_device hook on cpu_offload and disk_offload. #768

piEsposito · 2022-10-17T22:35:52Z

Solves #767.

Moves the AlignDevicesHook with io_same_device from accelerate.cpu_offload and accelerate.disk_offload to before the attach_align_device_hook.

That way we can keep the changes on forward method for the whole module without deleting the hook we want to keep: the one with execution device and configurations on how to move the tensors between devices.

…fload and disk_offload. That way we can keep the changes on forward method for the whole module without deleting the hook we want to keep: the one with execution device and configurations on how to move the tensors between devices.

HuggingFaceDocBuilderDev · 2022-10-17T22:42:21Z

The documentation is not available anymore as the PR was closed or merged.

sgugger · 2022-10-18T12:43:59Z

The fix is not exactly right: by doing so, the hook that ensures the input and output of the model are on the same device is now erased. In your code sample in #767, since x is on the CPU, net(x) should also be on the CPU. This is not the case with your PR. The solution would be to write a util function that will:

just add the hook if none is present
extract the current hook if one is present and chain it with this hook using a SequentialHook.

This is slightly more advanced than the current PR, so let me know if you'd prefer for me to do it :-)

piEsposito · 2022-10-18T13:02:45Z

@sgugger I would like to try if that's ok to you.

What do you think of creating an append_if_needed flag on add_hook_to_module that does what you just said?

sgugger · 2022-10-18T13:07:59Z

That works for me, though the name of the argument could simply be append :-)
Thanks for diving into this!

piEsposito · 2022-10-18T13:13:45Z

@sgugger append it is.

piEsposito · 2022-10-18T13:26:37Z

@sgugger it is ready for review, I've also added the tests.

sgugger

Very nice, thanks! Left a couple of nits, and I think you should still put the hook with io first: just tested locally and we still have the same issue of net(x) being on the wrong device since it runs second and the input was already moved.

src/accelerate/hooks.py

piEsposito · 2022-10-18T13:52:21Z

Very nice, thanks! Left a couple of nits, and I think you should still put the hook with io first: just tested locally and we still have the same issue of net(x) being on the wrong device since it runs second and the input was already moved.

@sgugger I've just addressed your nits and moved the io hook to the top. Thanks for the review! I've tested it locally on the snippet of the bug report and it brings the tensor back to CPU after the inference. Thanks!

sgugger

Perfect, thanks! Re-tested locally and got the expected results for the code sample you shared in the issue.

piEsposito · 2022-10-18T14:00:38Z

@sgugger, there is a test step that failed due to an http error when installing a lib. I've created and empty commit to try running it again.

piEsposito mentioned this pull request Oct 17, 2022

minimal stable diffusion GPU memory usage with accelerate hooks huggingface/diffusers#850

Merged

piEsposito added 3 commits October 18, 2022 10:17

add append flag to add hook to enable usage of sequential hooks

86e9567

add tests to append hooks

83bbf0b

add docstring to append flag

90e560a

sgugger reviewed Oct 18, 2022

View reviewed changes

src/accelerate/hooks.py Outdated Show resolved Hide resolved

src/accelerate/hooks.py Outdated Show resolved Hide resolved

src/accelerate/hooks.py Outdated Show resolved Hide resolved

piEsposito added 2 commits October 18, 2022 10:46

address review comments

b6eee3a

move io_same_device hook to top on cpu_offload and disk_offload

40ebae0

piEsposito requested a review from sgugger October 18, 2022 13:51

sgugger approved these changes Oct 18, 2022

View reviewed changes

trigger ci

b24735b

sgugger merged commit 5e8ab12 into huggingface:main Oct 18, 2022

piEsposito mentioned this pull request Oct 18, 2022

Accelerate not moving anything to execution device after cpu_offload #767

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move io_same_device hook to before attach_align_device hook on cpu_offload and disk_offload. #768

Move io_same_device hook to before attach_align_device hook on cpu_offload and disk_offload. #768

Uh oh!

piEsposito commented Oct 17, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Oct 17, 2022 •

edited

Loading

Uh oh!

sgugger commented Oct 18, 2022

Uh oh!

piEsposito commented Oct 18, 2022

Uh oh!

sgugger commented Oct 18, 2022

Uh oh!

piEsposito commented Oct 18, 2022

Uh oh!

piEsposito commented Oct 18, 2022

Uh oh!

sgugger left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

piEsposito commented Oct 18, 2022 •

edited

Loading

Uh oh!

sgugger left a comment

Uh oh!

piEsposito commented Oct 18, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Move io_same_device hook to before attach_align_device hook on cpu_offload and disk_offload. #768

Move io_same_device hook to before attach_align_device hook on cpu_offload and disk_offload. #768

Uh oh!

Conversation

piEsposito commented Oct 17, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Oct 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger commented Oct 18, 2022

Uh oh!

piEsposito commented Oct 18, 2022

Uh oh!

sgugger commented Oct 18, 2022

Uh oh!

piEsposito commented Oct 18, 2022

Uh oh!

piEsposito commented Oct 18, 2022

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

piEsposito commented Oct 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

piEsposito commented Oct 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HuggingFaceDocBuilderDev commented Oct 17, 2022 •

edited

Loading

piEsposito commented Oct 18, 2022 •

edited

Loading

piEsposito commented Oct 18, 2022 •

edited

Loading