[annotation] Skip copying custom meta for gradient accumulation nodes; tag with is_gradient_acc=True #167572

yushangdi · 2025-11-11T19:55:49Z

The seq_nr doesn't always increment for gradient accumulation nodes, and they might be copying annotation from forward nodes.

I'm just going to skip copying the custom meta for any gradient accumulation nodes and give them a special tag e.g. node.meta["is_gradient_acc"]=True

Example repro for deepseek torchtitan (without using DTensor): https://gist.github.com/yushangdi/aae13ea382732f31d0fdfb3ffeda12c8

(side note: if you want some more hints on these gradient acc node: 1) they have torch.ops.aten.add.Tensor op, not add.default. 2) they have the highest seq_nr(s) )

cc @ezyang @EikanWang @jgong5 @wenzhe-nrv @voznesenskym @penguinwu @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @jiayisunx @chenyang78 @kadeng @chauhang @amjames @Lucaskabela

pytorch-bot · 2025-11-11T19:55:53Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167572

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit cb930ff with merge base f6a79b2 ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

trunk / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, lf.linux.2xlarge, unstable) (gh) (#166072)
backends/xnnpack/test/recipes/test_xnnpack_recipes.py::TestXnnpackRecipes::test_int8_static_quant_recipe

This comment was automatically generated by Dr. CI and updates every 15 minutes.

yushangdi · 2025-11-12T01:05:04Z

@pytorchbot merge

pytorchmergebot · 2025-11-12T01:07:10Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorch-bot bot added the release notes: fx release notes category label Nov 11, 2025

facebook-github-bot added the fx label Nov 11, 2025

yushangdi force-pushed the sy_exp branch from eac6c10 to f62b740 Compare November 11, 2025 21:25

pytorch-bot bot added the ciflow/inductor label Nov 11, 2025

yushangdi marked this pull request as ready for review November 11, 2025 21:26

yushangdi requested review from bdhirsh and zou3519 as code owners November 11, 2025 21:26

yushangdi requested review from SherlockNoMad and mlazos November 11, 2025 21:26

yushangdi changed the title ~~Tag gradient acc in node~~ [annotation] Skip copying custom meta for gradient accumulation nodes; tag with is_gradient_acc=True Nov 11, 2025

mlazos approved these changes Nov 11, 2025

View reviewed changes

yushangdi force-pushed the sy_exp branch 2 times, most recently from 8f33d1e to a30df58 Compare November 11, 2025 21:54

Tag gradient acc in node

cb930ff

yushangdi force-pushed the sy_exp branch from a30df58 to cb930ff Compare November 11, 2025 22:25

pytorch-bot bot added the module: dynamo label Nov 11, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 12, 2025

pytorchmergebot added the merging label Nov 12, 2025

pytorchmergebot added the Merged label Nov 12, 2025

pytorchmergebot closed this in 7dd5647 Nov 12, 2025

pytorchmergebot removed the merging label Nov 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[annotation] Skip copying custom meta for gradient accumulation nodes; tag with is_gradient_acc=True #167572

[annotation] Skip copying custom meta for gradient accumulation nodes; tag with is_gradient_acc=True #167572

yushangdi commented Nov 11, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Nov 11, 2025 •

edited

Loading

Uh oh!

yushangdi commented Nov 12, 2025

Uh oh!

pytorchmergebot commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[annotation] Skip copying custom meta for gradient accumulation nodes; tag with is_gradient_acc=True #167572

[annotation] Skip copying custom meta for gradient accumulation nodes; tag with is_gradient_acc=True #167572

Conversation

yushangdi commented Nov 11, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167572

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

yushangdi commented Nov 12, 2025

Uh oh!

pytorchmergebot commented Nov 12, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yushangdi commented Nov 11, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Nov 11, 2025 •

edited

Loading