Grab Current Tracing Fake Mode in a couple spots #99377

eellison · 2023-04-17T22:22:59Z

Stack from ghstack (oldest at bottom):

-> Grab Current Tracing Fake Mode in a couple spots #99377

Fix for #99286. There were a couple locations we were instantiating new fake modes instead of grabbing the correct one from the current tracing context/inputs.

cc @soumith @voznesenskym @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @desertfire

[ghstack-poisoned]

pytorch-bot · 2023-04-17T22:23:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/99377

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 Failures

As of commit 2e98e29:

NEW FAILURES - The following jobs have failed:

lintrunner / linux-job (gh)

BROKEN TRUNK - The following jobs failed but were present on the merge base ab08284:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

Fix for #99286. There were a couple locations we were instantiating new fake modes instead of grabbing the correct one from the current tracing context/inputs. cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

ghstack-source-id: 078a15d Pull Request resolved: #99377

Fix for #99286. There were a couple locations we were instantiating new fake modes instead of grabbing the correct one from the current tracing context/inputs. cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

eellison · 2023-04-18T01:38:27Z

@pytorchbot rebase

pytorchmergebot · 2023-04-18T01:40:28Z

@pytorchbot successfully started a rebase job. Check the current status here

Fix for #99286. There were a couple locations we were instantiating new fake modes instead of grabbing the correct one from the current tracing context/inputs. cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

pytorchmergebot · 2023-04-18T01:40:49Z

Successfully rebased gh/eellison/432/orig onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via ghstack checkout https://github.com/pytorch/pytorch/pull/99377)

ghstack-source-id: 1470e6b Pull Request resolved: #99377

ezyang · 2023-04-18T01:00:40Z

torch/_inductor/fx_passes/joint_graph.py

    from .fuse_attention import _sfdp_init

-    with FakeTensorMode():
+    with detect_fake_mode() or FakeTensorMode():


This is wrong as the patterns here are shared globally so you shouldn't arbitrarily pick the fake mode from a particular run; I actually have the more correct fix on a branch I am working on

Okay - I'm going to close and re-assign to you then.

This bug was discovered by a stronger assert (which I will be posting in a follow up PR.) The explanation for this change is a bit long and windy, and I am not sure I entirely understand the situation myself. But here's what I think is going on. jansel's joint graph pattern matcher does something fairly unusual: in order to initialize the pattern in question, it (lazily) runs an aot_function invocation in order to trace out what the joint graph of a given pattern looks like (we ought not use aot_function, but we can't really do this until bdhirsh lands AOT Autograd export properly). However, this lazy initialization occurs within the context of a separate compilation, which has its own tracing context, and importantly, fake tensor mode. What we would like, is the pattern matcher lazy initialization fake tensor mode to be unrelated to whatever the ambient fake tensor mode of the graph we were actually compiling. We want these to be independent, because we don't really care what the current compiled graph is; this is a lazy init function, it could have gotten initialized during any compilation, it just happens to be initialized on this one. To prevent us from picking up the ambient fake mode, we have to do two things: we have to remove the tracing context (which stores a fake mode), and we have to also disable the ambiently active fake mode. In #99377 eellison proposed an alternative approach, where we reuse the fake mode. While this probably won't cause any errors, it's morally not the right thing to do, because you'll end up polluting the enclosing fake tensor mode with tensors that have nothing to do with the mode itself. This might fix #99286 but it's also possible that #99320 fixed it already. Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: f572909 Pull Request resolved: #99391

This bug was discovered by a stronger assert (which I will be posting in a follow up PR.) The explanation for this change is a bit long and windy, and I am not sure I entirely understand the situation myself. But here's what I think is going on. jansel's joint graph pattern matcher does something fairly unusual: in order to initialize the pattern in question, it (lazily) runs an aot_function invocation in order to trace out what the joint graph of a given pattern looks like (we ought not use aot_function, but we can't really do this until bdhirsh lands AOT Autograd export properly). However, this lazy initialization occurs within the context of a separate compilation, which has its own tracing context, and importantly, fake tensor mode. What we would like, is the pattern matcher lazy initialization fake tensor mode to be unrelated to whatever the ambient fake tensor mode of the graph we were actually compiling. We want these to be independent, because we don't really care what the current compiled graph is; this is a lazy init function, it could have gotten initialized during any compilation, it just happens to be initialized on this one. To prevent us from picking up the ambient fake mode, we have to do two things: we have to remove the tracing context (which stores a fake mode), and we have to also disable the ambiently active fake mode. In #99377 eellison proposed an alternative approach, where we reuse the fake mode. While this probably won't cause any errors, it's morally not the right thing to do, because you'll end up polluting the enclosing fake tensor mode with tensors that have nothing to do with the mode itself. This might fix #99286 but it's also possible that #99320 fixed it already. Signed-off-by: Edward Z. Yang <ezyang@meta.com> [ghstack-poisoned]

This bug was discovered by a stronger assert (which I will be posting in a follow up PR.) The explanation for this change is a bit long and windy, and I am not sure I entirely understand the situation myself. But here's what I think is going on. jansel's joint graph pattern matcher does something fairly unusual: in order to initialize the pattern in question, it (lazily) runs an aot_function invocation in order to trace out what the joint graph of a given pattern looks like (we ought not use aot_function, but we can't really do this until bdhirsh lands AOT Autograd export properly). However, this lazy initialization occurs within the context of a separate compilation, which has its own tracing context, and importantly, fake tensor mode. What we would like, is the pattern matcher lazy initialization fake tensor mode to be unrelated to whatever the ambient fake tensor mode of the graph we were actually compiling. We want these to be independent, because we don't really care what the current compiled graph is; this is a lazy init function, it could have gotten initialized during any compilation, it just happens to be initialized on this one. To prevent us from picking up the ambient fake mode, we have to do two things: we have to remove the tracing context (which stores a fake mode), and we have to also disable the ambiently active fake mode. In #99377 eellison proposed an alternative approach, where we reuse the fake mode. While this probably won't cause any errors, it's morally not the right thing to do, because you'll end up polluting the enclosing fake tensor mode with tensors that have nothing to do with the mode itself. This might fix #99286 but it's also possible that #99320 fixed it already. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #99391 Approved by: https://github.com/bdhirsh

Fix hf bart inference failure

6a3b784

[ghstack-poisoned]

github-actions bot added ciflow/inductor module: inductor labels Apr 17, 2023

Update on "Fix hf bart inference failure"

3c73c6d

cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

Update on "Fix hf bart inference failure"

6873c89

cc soumith voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

eellison changed the title ~~Fix hf bart inference failure~~ Grab Current Tracing Fake Mode in a couple spots Apr 17, 2023

pytorch-bot bot added the release notes: AO frontend label Apr 17, 2023

eellison requested review from ezyang and voznesenskym April 17, 2023 22:49

eellison added a commit that referenced this pull request Apr 17, 2023

Reuse existing fake mode in a couple locations

a734a24

ghstack-source-id: 078a15d Pull Request resolved: #99377

pytorchmergebot pushed a commit that referenced this pull request Apr 18, 2023

Reuse existing fake mode in a couple locations

6b3d86c

ghstack-source-id: 1470e6b Pull Request resolved: #99377

ezyang reviewed Apr 18, 2023

View reviewed changes

eellison closed this Apr 18, 2023

ezyang mentioned this pull request Apr 18, 2023

Reset joint graph fake mode earlier, and more comprehensively #99391

Closed

facebook-github-bot deleted the gh/eellison/432/head branch June 8, 2023 16:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Grab Current Tracing Fake Mode in a couple spots #99377

Grab Current Tracing Fake Mode in a couple spots #99377

Uh oh!

eellison commented Apr 17, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 17, 2023 •

edited

Loading

Uh oh!

eellison commented Apr 18, 2023

Uh oh!

pytorchmergebot commented Apr 18, 2023

Uh oh!

pytorchmergebot commented Apr 18, 2023

Uh oh!

ezyang Apr 18, 2023

Uh oh!

eellison Apr 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Grab Current Tracing Fake Mode in a couple spots #99377

Grab Current Tracing Fake Mode in a couple spots #99377

Uh oh!

Conversation

eellison commented Apr 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/99377

❌ 3 Failures

Uh oh!

eellison commented Apr 18, 2023

Uh oh!

pytorchmergebot commented Apr 18, 2023

Uh oh!

pytorchmergebot commented Apr 18, 2023

Uh oh!

ezyang Apr 18, 2023

Choose a reason for hiding this comment

Uh oh!

eellison Apr 18, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eellison commented Apr 17, 2023 •

edited

Loading

pytorch-bot bot commented Apr 17, 2023 •

edited

Loading