Add regional aot eager support to AOTAutogradCacheEntry #166650

jamesjwu · 2025-10-30T16:30:25Z

Stack from ghstack (oldest at bottom):

This PR does two things:

It genericizes BundledAOTAutogradCacheEntry to support any outputcode, not just CompiledFxGraphs
It adds a brand new OutputCode for the aot_eager_regional_inductor backend, i.e. a graph module that has regional inductor components in it.

This allows BundledAOTAutogradCache to just integrate nicely with inductor out of the box, but more importantly, it allows the result of aot_autograd to be fully serializable when using aot_eager_regional_inductor. This will allow us to AOT precompile cases where we have an eager graph that has scooped up inductor bits.

It's a bit unfortunate that the naming makes BundledAOTAutogradCacheEntry sound like its primary use is for caching, but really the more common use is going to be as an AOTAutogradOutput. It may be worth revisiting how to refactor/rename these in a later PR:

AOTAutogradCacheEntry -> AOTAutogradResult
BundledAOTAutogradCacheEntry -> BundledAOTAutogradResult

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @Lucaskabela

[ghstack-poisoned]

pytorch-bot · 2025-10-30T16:30:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166650

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f9c21a2 with merge base 3f18247 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 7f9bc5d Pull Request resolved: #166650

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben Lucaskabela [ghstack-poisoned]

ghstack-source-id: d9cd537 Pull Request resolved: #166650

This PR does two things: - It genericizes `BundledAOTAutogradCacheEntry` to support *any* outputcode, not just CompiledFxGraphs - It adds a brand new OutputCode for the `aot_eager_regional_inductor` backend, i.e. a graph module that has regional inductor components in it. This allows BundledAOTAutogradCache to just integrate nicely with inductor out of the box, but more importantly, it allows the result of aot_autograd to be fully serializable when using `aot_eager_regional_inductor`. This will allow us to AOT precompile cases where we have an eager graph that has scooped up inductor bits. It's a bit unfortunate that the naming makes BundledAOTAutogradCacheEntry sound like its primary use is for caching, but really the more common use is going to be as an AOTAutogradOutput. It may be worth revisiting how to refactor/rename these in a later PR: - AOTAutogradCacheEntry -> AOTAutogradResult - BundledAOTAutogradCacheEntry -> BundledAOTAutogradResult cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben Lucaskabela [ghstack-poisoned]

ghstack-source-id: 16c9332 Pull Request resolved: #166650

pytorchmergebot · 2025-10-31T18:47:47Z

Starting merge as part of PR stack under #166656

This PR refactors the name AOTAutogradCacheEntry into AOTAutogradResult, and BundledAOTAutogradCacheEntry into BundledAOTAutogradResult. It also moves all coresponding files to a new file, `aot_autograd_result`, which is analogous to `output_code.py` from Inductor. Having all these be called cache entries made sense when all we used them for was caching. But with AOT compile using BundledAOTAutogradCacheEntry, we want a more generalized naming structure. This is a no-op change, and all existing tests should pass. Pull Request resolved: #166656 Approved by: https://github.com/zhxchen17 ghstack dependencies: #166650

This PR does two things: - It genericizes `BundledAOTAutogradCacheEntry` to support *any* outputcode, not just CompiledFxGraphs - It adds a brand new OutputCode for the `aot_eager_regional_inductor` backend, i.e. a graph module that has regional inductor components in it. This allows BundledAOTAutogradCache to just integrate nicely with inductor out of the box, but more importantly, it allows the result of aot_autograd to be fully serializable when using `aot_eager_regional_inductor`. This will allow us to AOT precompile cases where we have an eager graph that has scooped up inductor bits. It's a bit unfortunate that the naming makes BundledAOTAutogradCacheEntry sound like its primary use is for caching, but really the more common use is going to be as an AOTAutogradOutput. It may be worth revisiting how to refactor/rename these in a later PR: - AOTAutogradCacheEntry -> AOTAutogradResult - BundledAOTAutogradCacheEntry -> BundledAOTAutogradResult Pull Request resolved: #166650 Approved by: https://github.com/zhxchen17

This PR refactors the name AOTAutogradCacheEntry into AOTAutogradResult, and BundledAOTAutogradCacheEntry into BundledAOTAutogradResult. It also moves all coresponding files to a new file, `aot_autograd_result`, which is analogous to `output_code.py` from Inductor. Having all these be called cache entries made sense when all we used them for was caching. But with AOT compile using BundledAOTAutogradCacheEntry, we want a more generalized naming structure. This is a no-op change, and all existing tests should pass. Pull Request resolved: #166656 Approved by: https://github.com/zhxchen17 ghstack dependencies: #166650

This PR does two things: - It genericizes `BundledAOTAutogradCacheEntry` to support *any* outputcode, not just CompiledFxGraphs - It adds a brand new OutputCode for the `aot_eager_regional_inductor` backend, i.e. a graph module that has regional inductor components in it. This allows BundledAOTAutogradCache to just integrate nicely with inductor out of the box, but more importantly, it allows the result of aot_autograd to be fully serializable when using `aot_eager_regional_inductor`. This will allow us to AOT precompile cases where we have an eager graph that has scooped up inductor bits. It's a bit unfortunate that the naming makes BundledAOTAutogradCacheEntry sound like its primary use is for caching, but really the more common use is going to be as an AOTAutogradOutput. It may be worth revisiting how to refactor/rename these in a later PR: - AOTAutogradCacheEntry -> AOTAutogradResult - BundledAOTAutogradCacheEntry -> BundledAOTAutogradResult Pull Request resolved: pytorch#166650 Approved by: https://github.com/zhxchen17

This PR refactors the name AOTAutogradCacheEntry into AOTAutogradResult, and BundledAOTAutogradCacheEntry into BundledAOTAutogradResult. It also moves all coresponding files to a new file, `aot_autograd_result`, which is analogous to `output_code.py` from Inductor. Having all these be called cache entries made sense when all we used them for was caching. But with AOT compile using BundledAOTAutogradCacheEntry, we want a more generalized naming structure. This is a no-op change, and all existing tests should pass. Pull Request resolved: pytorch#166656 Approved by: https://github.com/zhxchen17 ghstack dependencies: pytorch#166650

Add regional aot eager support to AOTAutogradCacheEntry

50f8199

[ghstack-poisoned]

jamesjwu requested a review from bdhirsh as a code owner October 30, 2025 16:30

jamesjwu mentioned this pull request Oct 30, 2025

export flex attention with kwargs #166649

Closed

pytorch-bot bot added ciflow/inductor module: dynamo module: inductor labels Oct 30, 2025

jamesjwu added a commit that referenced this pull request Oct 30, 2025

Add regional aot eager support to AOTAutogradCacheEntry

d93adee

ghstack-source-id: 7f9bc5d Pull Request resolved: #166650

Update on "Add regional aot eager support to AOTAutogradCacheEntry"

6ceb428

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov coconutruben Lucaskabela [ghstack-poisoned]

jamesjwu added a commit that referenced this pull request Oct 30, 2025

Add regional aot eager support to AOTAutogradCacheEntry

8b46796

ghstack-source-id: d9cd537 Pull Request resolved: #166650

jamesjwu added the topic: not user facing topic category label Oct 30, 2025

jamesjwu requested review from anijain2305 and zhxchen17 October 30, 2025 16:40

jamesjwu added a commit that referenced this pull request Oct 30, 2025

Add regional aot eager support to AOTAutogradCacheEntry

9f630ee

ghstack-source-id: 16c9332 Pull Request resolved: #166650

jamesjwu mentioned this pull request Oct 30, 2025

Refactor AOTAutogradCacheEntry into AOTAutogradResult #166656

Closed

jamesjwu added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 30, 2025

jamesjwu requested a review from oulgen October 30, 2025 19:24

zhxchen17 approved these changes Oct 31, 2025

View reviewed changes

pytorchmergebot closed this in 30157d3 Oct 31, 2025

pytorchmergebot added the Merged label Oct 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add regional aot eager support to AOTAutogradCacheEntry #166650

Add regional aot eager support to AOTAutogradCacheEntry #166650

jamesjwu commented Oct 30, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 30, 2025 •

edited

Loading

Uh oh!

pytorchmergebot commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add regional aot eager support to AOTAutogradCacheEntry #166650

Add regional aot eager support to AOTAutogradCacheEntry #166650

Conversation

jamesjwu commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166650

✅ No Failures

Uh oh!

pytorchmergebot commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jamesjwu commented Oct 30, 2025 •

edited

Loading

pytorch-bot bot commented Oct 30, 2025 •

edited

Loading