Avoid graph break by removing redundant requires_grad attr change by deepcharm · Pull Request #7158 · deepspeedai/DeepSpeed

deepcharm · 2025-03-20T15:49:50Z

This PR is a continuation of the efforts to improve DeepSpeed performance when using PyTorch compile.

Dynamo breaks the graph because flat_tensor.requires_grad = False:

Is a side-effecting operation on tensor metadata
Occurs in a context where Dynamo expects static tensor properties for tracing

flat_tensor.requires_grad is redundant and can be safely removed because:

_allgather_params() function is already decorated with @torch.no_grad() which ensures the desired property
flat_tensor is created using the torch.empty() which sets the requires_grad=False by default.

This PR is a continuation of the efforts to improve Deepspeed performance when using PyTorch compile. Dynamo breaks the graph because flat_tensor.requires_grad = False * Is a side-effecting operation on tensor metadata * Occurs in a context where Dynamo expects static tensor properties for tracing flat_tensor.requires_grad is redundant and can be safely removed because: * _allgather_params function is already decorated with @torch.no_grad() which ensures the desired property * flat_tensor is created using the torch.empty(..) which sets the requires_grad=False by default. Signed-off-by: Max Kovalenko <mkovalenko@habana.ai>

Signed-off-by: Max Kovalenko <mkovalenko@habana.ai>

This reverts commit 11612773b3d68aa5b8d72bad1de4b1714ea1193a. Signed-off-by: Max Kovalenko <mkovalenko@habana.ai>

) This PR is a continuation of the efforts to improve DeepSpeed performance when using PyTorch compile. Dynamo breaks the graph because `flat_tensor.requires_grad = False`: * Is a side-effecting operation on tensor metadata * Occurs in a context where Dynamo expects static tensor properties for tracing `flat_tensor.requires_grad` is redundant and can be safely removed because: * `_allgather_params()` function is already decorated with `@torch.no_grad()` which ensures the desired property * `flat_tensor` is created using the `torch.empty()` which sets the `requires_grad=False` by default. --------- Signed-off-by: Max Kovalenko <mkovalenko@habana.ai> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Hongwei Chen <33092912+hwchen2017@users.noreply.github.com> Signed-off-by: Logan Adams <loadams@microsoft.com>

…epspeedai#7158) This PR is a continuation of the efforts to improve DeepSpeed performance when using PyTorch compile. Dynamo breaks the graph because `flat_tensor.requires_grad = False`: * Is a side-effecting operation on tensor metadata * Occurs in a context where Dynamo expects static tensor properties for tracing `flat_tensor.requires_grad` is redundant and can be safely removed because: * `_allgather_params()` function is already decorated with `@torch.no_grad()` which ensures the desired property * `flat_tensor` is created using the `torch.empty()` which sets the `requires_grad=False` by default. --------- Signed-off-by: Max Kovalenko <mkovalenko@habana.ai> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Hongwei Chen <33092912+hwchen2017@users.noreply.github.com>

This PR is an follow-up to PR (deepspeedai#7158) handling the same issue. Signed-off-by: Max Kovalenko <mkovalenko@habana.ai>

…7263) This PR is an follow-up to [PR #7158](#7158) handling the same issue in another place. See [PR #7158](#7158) for details. --------- Signed-off-by: Max Kovalenko <mkovalenko@habana.ai> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Hongwei Chen <33092912+hwchen2017@users.noreply.github.com>

…epspeedai#7158) This PR is a continuation of the efforts to improve DeepSpeed performance when using PyTorch compile. Dynamo breaks the graph because `flat_tensor.requires_grad = False`: * Is a side-effecting operation on tensor metadata * Occurs in a context where Dynamo expects static tensor properties for tracing `flat_tensor.requires_grad` is redundant and can be safely removed because: * `_allgather_params()` function is already decorated with `@torch.no_grad()` which ensures the desired property * `flat_tensor` is created using the `torch.empty()` which sets the `requires_grad=False` by default. --------- Signed-off-by: Max Kovalenko <mkovalenko@habana.ai> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Hongwei Chen <33092912+hwchen2017@users.noreply.github.com> Signed-off-by: yisheng <yi.sheng@intel.com>

…eepspeedai#7263) This PR is an follow-up to [PR deepspeedai#7158](deepspeedai#7158) handling the same issue in another place. See [PR deepspeedai#7158](deepspeedai#7158) for details. --------- Signed-off-by: Max Kovalenko <mkovalenko@habana.ai> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Hongwei Chen <33092912+hwchen2017@users.noreply.github.com> Signed-off-by: Max Kovalenko <mkovalenko@habana.ai>

…eepspeedai#7263) This PR is an follow-up to [PR deepspeedai#7158](deepspeedai#7158) handling the same issue in another place. See [PR deepspeedai#7158](deepspeedai#7158) for details. --------- Signed-off-by: Max Kovalenko <mkovalenko@habana.ai> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Hongwei Chen <33092912+hwchen2017@users.noreply.github.com>

deepcharm requested review from tjruwase and tohtana as code owners March 20, 2025 15:49

deepcharm added 3 commits March 20, 2025 18:01

Small fix

052ef85

Signed-off-by: Max Kovalenko <mkovalenko@habana.ai>

Revert "Small fix"

cf18d95

This reverts commit 11612773b3d68aa5b8d72bad1de4b1714ea1193a. Signed-off-by: Max Kovalenko <mkovalenko@habana.ai>

deepcharm force-pushed the master branch from 5bc28e7 to cf18d95 Compare March 20, 2025 16:01

tohtana approved these changes Mar 20, 2025

View reviewed changes

Merge branch 'master' into master

0919cb2

loadams enabled auto-merge March 20, 2025 19:46

Merge branch 'master' into master

475be04

loadams added this pull request to the merge queue Mar 22, 2025

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 23, 2025

tjruwase added this pull request to the merge queue Mar 24, 2025

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Mar 24, 2025

loadams added this pull request to the merge queue Mar 24, 2025

Merged via the queue into deepspeedai:master with commit d40cf46 Mar 24, 2025
11 checks passed

deepcharm added a commit to deepcharm/DeepSpeed that referenced this pull request Apr 29, 2025

Remove yet another redundant requires_grad attr change

8412565

This PR is an follow-up to PR (deepspeedai#7158) handling the same issue. Signed-off-by: Max Kovalenko <mkovalenko@habana.ai>

deepcharm mentioned this pull request Apr 29, 2025

Avoid graph break by removing another redundant requires grad false #7263

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid graph break by removing redundant requires_grad attr change#7158

Avoid graph break by removing redundant requires_grad attr change#7158
loadams merged 5 commits intodeepspeedai:masterfrom
deepcharm:master

deepcharm commented Mar 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

deepcharm commented Mar 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants