Save tensors in context of memory_efficient_linear #3413

tohtana · 2023-04-30T09:18:22Z

By default, torch.nn.functional.linear is replaced with LinearFunctionForZeroStage3. However, LinearFunctionForZeroStage3 causes memory leak in some usecases.

In PEFT's LoRA mentioned in #3002, the weight is passed after 'transpose'.

result = F.linear(x, transpose(self.weight, self.fan_in_fan_out), bias=self.bias)

LinearFunctionForZeroStage3 saves the weight in a map and the key is the object's ID. But a new transposed weight is created and the ID changes for each iteration. So the saved weights will increase through iterations.

This PR simply saves weight and bias in the context instead of the IDs.
I don't understand the intention of using IDs to store the weight. If saving IDs instead of tensors is a crucial part of this module, we need another approach to fix this.

tohtana · 2023-05-01T23:20:44Z

@tjruwase Thank you for merging this PR!

As I mentioned, I didn't understand the intention of saving tensors in a global map.
I would be happy to fix again if we find any problem with this PR.

tjruwase · 2023-05-01T23:54:46Z

@tohtana, I think your solution is the correct one.

tohtana · 2023-05-02T00:23:17Z

Thank you for your reviewing, @tjruwase!

save tensors in context of memory_efficient_linear

ff20562

tohtana requested review from jeffra, tjruwase, samyam and mrwyattii as code owners April 30, 2023 09:18

tjruwase approved these changes May 1, 2023

View reviewed changes

Merge branch 'master' into tohtana/leak_mem_efficient_linear

a417425

tjruwase mentioned this pull request May 1, 2023

[BUG] Peft Training with Zero.Init() and Zero3 will increase GPU memory every forward step #3002

Closed

tjruwase merged commit 42858a9 into master May 1, 2023

tohtana deleted the tohtana/leak_mem_efficient_linear branch May 1, 2023 23:20

This was referenced May 5, 2023

Zero 3 init ReadME update dumpmemory/peft#39

Closed

Zero 3 init ReadME update huggingface/peft#399

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Save tensors in context of memory_efficient_linear #3413

Save tensors in context of memory_efficient_linear #3413

tohtana commented Apr 30, 2023

tohtana commented May 1, 2023

tjruwase commented May 1, 2023

tohtana commented May 2, 2023

Save tensors in context of memory_efficient_linear #3413

Save tensors in context of memory_efficient_linear #3413

Conversation

tohtana commented Apr 30, 2023

tohtana commented May 1, 2023

tjruwase commented May 1, 2023

tohtana commented May 2, 2023