Synchronize gradients in manual optimization with DDPStrategy(static_graph=True) #21251

Sohaib-Ahmed21 · 2025-09-28T09:06:56Z

What does this PR do?

Fixes gradient synchronization when using manual optimization with DDPStrategy(static_graph=True).

Calls reducer._delay_all_reduce() in post_backward to ensure gradients are properly reduced in the first iteration.
Adds a regression test (test_ddp_gradients_synced) that checks gradient synchronization across:
- automatic_optimization=True / False
- static_graph=True / False

Before submitting

Was this discussed/agreed via a GitHub issue? (not for typos and docs)
Yes
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together?
Did you write any new necessary tests? (not for typos and docs)
yes
Did you verify new and existing tests pass locally with your changes?
Did you update the CHANGELOG? (not for typos, docs, test updates, or minor internal changes/refactors)

PR review

Anyone in the community is welcome to review the PR.
Before you start reviewing, make sure you have read the review guidelines. In short, see the following bullet-list:

Reviewer checklist

Is this pull request ready for review? (if not, please submit in draft mode)
Check that all items from Before submitting are resolved
Make sure the title is self-explanatory and the description concisely explains the PR
Add labels and milestones (and optionally projects) to the PR so it can be classified

📚 Documentation preview 📚: https://pytorch-lightning--21251.org.readthedocs.build/en/21251/

…atic_graph=True). Ensure gradients are reduced correctly when using manual optimization and DDP with static_graph enabled.

…_graph.

Sohaib-Ahmed21 · 2025-10-07T18:50:42Z

Failures seem unrelated to the PR, can you please review the PR @Borda @SkafteNicki , thanks!

SkafteNicki · 2025-10-07T18:53:19Z

Failures seem unrelated to the PR, can you please review the PR @Borda @SkafteNicki , thanks!

@Sohaib-Ahmed21 yes CI is down at the moment so nothing to do at the moment

Sohaib-Ahmed21 · 2025-10-14T15:25:50Z

@Borda @SkafteNicki all tests are passing now. Can you please review this PR, thanks!

Sohaib-Ahmed21 · 2025-10-28T05:38:22Z

@SkafteNicki @Borda @lantiga , gentle ping on this PR. I believe it's ready for review when you have a moment. Thanks!

Borda · 2025-10-28T07:38:09Z

@SkafteNicki @Borda @lantiga , gentle ping on this PR. I believe it's ready for review when you have a moment. Thanks!

Apology, I am not a code-owner anymore...

SkafteNicki

LGTM :)

Sohaib-Ahmed21 added 3 commits September 28, 2025 13:50

fix: synchronize gradients in manual optimization with DDPStrategy(st…

e2a02cd

…atic_graph=True). Ensure gradients are reduced correctly when using manual optimization and DDP with static_graph enabled.

Adds regression test to cover all combinations of optimization/static…

38cc242

…_graph.

Initialize _pl_static_graph_delay_done attribute properly

c7e6cc4

Sohaib-Ahmed21 requested review from Borda, ethanwharris, justusschock, lantiga and tchaton as code owners September 28, 2025 09:06

github-actions bot added the pl Generic label for PyTorch Lightning package label Sep 28, 2025

Sohaib-Ahmed21 added 5 commits September 29, 2025 15:59

Merge branch 'master' into bugfix/18086_ddp_manual_opt_grad_sync

9b83f8f

Merge branch 'master' into bugfix/18086_ddp_manual_opt_grad_sync

1f3ae3a

Merge branch 'master' into bugfix/18086_ddp_manual_opt_grad_sync

0f2a559

Merge branch 'master' into bugfix/18086_ddp_manual_opt_grad_sync

e0a8778

Merge branch 'master' into bugfix/18086_ddp_manual_opt_grad_sync

f07ef42

Merge branch 'master' into bugfix/18086_ddp_manual_opt_grad_sync

a44327a

Merge branch 'master' into bugfix/18086_ddp_manual_opt_grad_sync

989fed0

changelog

4e31c47

SkafteNicki added optimization strategy: ddp DistributedDataParallel labels Oct 28, 2025

SkafteNicki approved these changes Oct 28, 2025

View reviewed changes

Borda approved these changes Oct 28, 2025

View reviewed changes

Merge branch 'master' into bugfix/18086_ddp_manual_opt_grad_sync

fb74a71

Sohaib-Ahmed21 requested a review from SkafteNicki October 29, 2025 05:38

SkafteNicki merged commit 10675b4 into Lightning-AI:master Oct 29, 2025
84 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Synchronize gradients in manual optimization with DDPStrategy(static_graph=True) #21251

Synchronize gradients in manual optimization with DDPStrategy(static_graph=True) #21251

Uh oh!

Sohaib-Ahmed21 commented Sep 28, 2025 •

edited by github-actions bot

Loading

Uh oh!

Sohaib-Ahmed21 commented Oct 7, 2025

Uh oh!

SkafteNicki commented Oct 7, 2025

Uh oh!

Sohaib-Ahmed21 commented Oct 14, 2025 •

edited

Loading

Uh oh!

Sohaib-Ahmed21 commented Oct 28, 2025

Uh oh!

Borda commented Oct 28, 2025

Uh oh!

SkafteNicki left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Synchronize gradients in manual optimization with DDPStrategy(static_graph=True) #21251

Synchronize gradients in manual optimization with DDPStrategy(static_graph=True) #21251

Uh oh!

Conversation

Sohaib-Ahmed21 commented Sep 28, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

PR review

Uh oh!

Sohaib-Ahmed21 commented Oct 7, 2025

Uh oh!

SkafteNicki commented Oct 7, 2025

Uh oh!

Sohaib-Ahmed21 commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sohaib-Ahmed21 commented Oct 28, 2025

Uh oh!

Borda commented Oct 28, 2025

Uh oh!

SkafteNicki left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Sohaib-Ahmed21 commented Sep 28, 2025 •

edited by github-actions bot

Loading

Sohaib-Ahmed21 commented Oct 14, 2025 •

edited

Loading