Improve memory usage for stablecode-completion-alpha-3b #1019

IvanYashchuk · 2024-08-22T09:49:30Z

Running this model on DGX-H100 with FSDP was giving an OOM error. With the proposed change peak memory usage is now 67.79GB.

This works by fusing RoPE and GELU into one fusion region and allowing the rematerialization pass to recompute GELU in the backward fusion region.

More details in #246 (comment).

The following command was used to check the OOM error.

torchrun --nproc_per_node=8 thunder/benchmarks/benchmark_litgpt.py --model_name stablecode-completion-alpha-3b --compile=thunder --distributed_mode=fsdp

cc @apaz-cli

Running this model on DGX-H100 with FSDP was giving OOM error. With the proposed change peak memory usage is now 67.79GB. This works by fusing RoPE and GELU into one fusion region and this allows rematerialization pass to recompute GELU in backward fusion region. More details in #246 (comment)

t-vi

Thank you @IvanYashchuk

IvanYashchuk added torch.compile memory use labels Aug 22, 2024

IvanYashchuk requested review from mruberry, lantiga and t-vi as code owners August 22, 2024 09:49

IvanYashchuk mentioned this pull request Aug 22, 2024

Thunder + Inductor gives OOM for stablecode-completion-alpha-3b model from LitGPT #246

Open

t-vi approved these changes Aug 22, 2024

View reviewed changes

t-vi merged commit 2caf5df into main Aug 22, 2024
39 checks passed

t-vi deleted the improve-stablecode-memory-use branch August 22, 2024 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve memory usage for stablecode-completion-alpha-3b #1019

Improve memory usage for stablecode-completion-alpha-3b #1019

IvanYashchuk commented Aug 22, 2024 •

edited by github-actions bot

Loading

t-vi left a comment

Improve memory usage for stablecode-completion-alpha-3b #1019

Improve memory usage for stablecode-completion-alpha-3b #1019

Conversation

IvanYashchuk commented Aug 22, 2024 • edited by github-actions bot Loading

t-vi left a comment

Choose a reason for hiding this comment

IvanYashchuk commented Aug 22, 2024 •

edited by github-actions bot

Loading