[hop] support local_map + SAC #163322

xmfan · 2025-09-19T06:36:36Z

Stack from ghstack (oldest at bottom):

Some ops like local_map hop's deferred mode are not desugared by make_fx, this means that when we apply SAC tags, we will need to define dispatch rules for the SAC torch dispatch modes as pointed out here: #162246 (comment). This PR adds those rules.

Additionally it fixes a pre-existing issue where we weren't coercing tangent layout (that AOTAutograd typically does) when partitioning the HOP joint.

[ghstack-poisoned]

pytorch-bot · 2025-09-19T06:36:41Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163322

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 72c863b with merge base 607489f ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 719f629 Pull Request resolved: #163322

[ghstack-poisoned]

ghstack-source-id: a313618 Pull Request resolved: #163322

xmfan · 2025-09-19T16:46:45Z

hold on, updating test to use aot eager backend

soulitzer · 2025-09-19T16:58:47Z

test/higher_order_ops/test_local_map.py

+        or op == torch.ops.aten._scaled_dot_product_efficient_attention.default
+    ):
+        # NOTE: we can't save nondeterministic_seeded ops, the run with rng wrapper is not traceable yet
+        return torch.utils.checkpoint.CheckpointPolicy.PREFER_SAVE


Prefer save is completely ignored by the compiler btw, so generally its recommended to use MUST_SAVE, but probably fine I if we're testing eager only

soulitzer · 2025-09-19T16:59:26Z

test/higher_order_ops/test_local_map.py

+        op == torch.ops.aten._scaled_dot_product_flash_attention.default
+        or op == torch.ops.aten._scaled_dot_product_efficient_attention.default
+    ):
+        # NOTE: we can't save nondeterministic_seeded ops, the run with rng wrapper is not traceable yet


Wait do you mean "cannot recompute RNG ops"

oh ignore this, this is autoparallel frontend specific

[ghstack-poisoned]

ghstack-source-id: 231ad97 Pull Request resolved: #163322

[ghstack-poisoned]

ghstack-source-id: c4ca088 Pull Request resolved: #163322

ezyang · 2025-09-23T13:42:43Z

Need more PR description

ezyang · 2025-09-23T13:44:54Z

test/higher_order_ops/test_local_map.py

 """,
            )

+    @requires_cuda_and_triton


Why does this test require CUDA?

nice it works on cpu now

ezyang · 2025-09-23T13:45:11Z

test/higher_order_ops/test_local_map.py

+    @requires_cuda_and_triton
+    @unittest.skipIf(
+        not torch.distributed.is_available(), "Torch distributed not available."
+    )


Ditto this, doesn't seem like you need distributed

i need it to import local_map/device mesh

oh ok, please look forward to https://www.internalfb.com/diff/D82283623

ezyang · 2025-09-23T13:46:42Z

test/higher_order_ops/test_local_map.py

+            ):
+                out = torch.compile(model, backend=backend)(*inputs)
+            out.sum().backward()
+        except AttributeError as e:


What's going on here?

local_map HOP currently only works for AP style compile which interprets the nodes and directly accessses their target. But the graph's codegen is currently wrong, it should be torch._higher_order_ops.local_map.<locals>.call_local_map. I'll fix this later as it's not needed for AP.

I guess I'd prefer to make it obvious which code is live and which code is dead, with a comment with what you said here

ezyang · 2025-09-23T13:47:05Z

test/higher_order_ops/test_local_map.py

+                    #     actual == torch.utils.checkpoint.CheckpointPolicy.MUST_RECOMPUTE
+                    # ):
+                    #     # can still be in fw_outs for post-graph bytecode
+                    #     self.assertFalse(node.name in bw_ins)


Is there a reason we can't use an expect test here?

we could, but we'd need to manually check for these properties whenever we update the expecttest anyway

ezyang

The actual impl changes are plausible

[ghstack-poisoned]

Some ops like local_map hop's deferred mode are not desugared by make_fx, this means that when we apply SAC tags, we will need to define dispatch rules for the SAC torch dispatch modes as pointed out here: #162246 (comment). This PR adds those rules. Additionally it fixes a pre-existing issue where we weren't coercing tangent layout (that AOTAutograd typically does) when partitioning the HOP joint. [ghstack-poisoned]

xmfan · 2025-09-24T02:01:00Z

@pytorchbot merge

pytorchmergebot · 2025-09-24T02:02:59Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Some ops like local_map hop's deferred mode are not desugared by make_fx, this means that when we apply SAC tags, we will need to define dispatch rules for the SAC torch dispatch modes as pointed out here: pytorch#162246 (comment). This PR adds those rules. Additionally it fixes a pre-existing issue where we weren't coercing tangent layout (that AOTAutograd typically does) when partitioning the HOP joint. Pull Request resolved: pytorch#163322 Approved by: https://github.com/ezyang

Some ops like local_map hop's deferred mode are not desugared by make_fx, this means that when we apply SAC tags, we will need to define dispatch rules for the SAC torch dispatch modes as pointed out here: #162246 (comment). This PR adds those rules. Additionally it fixes a pre-existing issue where we weren't coercing tangent layout (that AOTAutograd typically does) when partitioning the HOP joint. Pull Request resolved: #163322 Approved by: https://github.com/ezyang

[hop] support local_map + SAC

babb012

[ghstack-poisoned]

xmfan added a commit that referenced this pull request Sep 19, 2025

[hop] support local_map + SAC

3a93a3c

ghstack-source-id: 719f629 Pull Request resolved: #163322

xmfan mentioned this pull request Sep 19, 2025

re-enable AC with local_map meta-pytorch/autoparallel#150

Merged

Update on "[hop] support local_map + SAC"

1eef996

[ghstack-poisoned]

xmfan added a commit that referenced this pull request Sep 19, 2025

[hop] support local_map + SAC

17e326e

ghstack-source-id: a313618 Pull Request resolved: #163322

xmfan added the topic: not user facing topic category label Sep 19, 2025

xmfan marked this pull request as ready for review September 19, 2025 16:40

xmfan requested a review from zou3519 as a code owner September 19, 2025 16:40

xmfan requested review from soulitzer and ydwu4 September 19, 2025 16:40

soulitzer reviewed Sep 19, 2025

View reviewed changes

Update on "[hop] support local_map + SAC"

274bbd2

[ghstack-poisoned]

xmfan added a commit that referenced this pull request Sep 19, 2025

[hop] support local_map + SAC

b9d8455

ghstack-source-id: 231ad97 Pull Request resolved: #163322

Update on "[hop] support local_map + SAC"

c4fa97b

[ghstack-poisoned]

xmfan added a commit that referenced this pull request Sep 20, 2025

[hop] support local_map + SAC

3dd0296

ghstack-source-id: c4ca088 Pull Request resolved: #163322

xmfan mentioned this pull request Sep 23, 2025

[dynamo] trace local_map with local shapes for AP #163602

Closed

xmfan requested a review from ezyang September 23, 2025 03:28

ezyang reviewed Sep 23, 2025

View reviewed changes

ezyang approved these changes Sep 23, 2025

View reviewed changes

xmfan added 2 commits September 23, 2025 13:18

Update on "[hop] support local_map + SAC"

8d63224

[ghstack-poisoned]

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 24, 2025

pytorchmergebot added the merging label Sep 24, 2025

pytorchmergebot added the Merged label Sep 24, 2025

pytorchmergebot closed this in 124dd36 Sep 24, 2025

pytorchmergebot removed the merging label Sep 24, 2025

xmfan mentioned this pull request Sep 26, 2025

wip [hop side] trace local_map with local shapes for AP #163926

Closed

[hop] support local_map + SAC #163322

[hop] support local_map + SAC #163322

Uh oh!

Conversation

xmfan commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163322

✅ No Failures

Uh oh!

xmfan commented Sep 19, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ezyang commented Sep 23, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

xmfan commented Sep 24, 2025

Uh oh!

pytorchmergebot commented Sep 24, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

xmfan commented Sep 19, 2025 •

edited

Loading

pytorch-bot bot commented Sep 19, 2025 •

edited

Loading