fix inference_mode with torch.compile #101219

bdhirsh · 2023-05-11T20:57:13Z

It looks like inference_mode wasn't playing well with functionalization.

If you run torch.compile on a function, and the inputs to the function are tensors created outside of inference mode, then we need to make sure that when we created functional tensor wrappers for those inputs during compilation, those functional wrappers properly mirror whether or not the original tensor is an inference tensor.

Hopefully fixes #101151

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2023-05-11T20:57:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/101219

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit a469373:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 96a941f43bbc85d927166fd541d45fb57dc416d0 Pull Request resolved: #101219

albanD · 2023-05-11T21:38:27Z

aten/src/ATen/FunctionalTensorWrapper.cpp

+  // E.g. when running torch.compile under inference mode, we need to make sure that
+  // for any inputs that were created outside of inference mode (so they are not inference tensors),
+  // then the functional wrappers that we wrap them with should also not be inference tensors.
+  version_counter_ = value_.unsafeGetTensorImpl()->version_counter();


Wouldn't this access to the version_counter raise an error on inference Tensors?

Talked offline - we're not accessing the version counter, just straight up copying the struct onto the wrapper.

Also - we copy the dispatch keyset from the inner tensor onto the wrapper, so if the inner tensor has the Autograd dispatch key (because it was created outside of inference mode), then the wrapper will as well (even though it was created in inference mode).

albanD

Sounds ok!

It looks like inference_mode wasn't playing well with functionalization. If you run torch.compile on a function, and the inputs to the function are tensors created outside of inference mode, then we need to make sure that when we created functional tensor wrappers for those inputs during compilation, those functional wrappers properly mirror whether or not the original tensor is an inference tensor. Hopefully fixes #101151 [ghstack-poisoned]

ghstack-source-id: 7c24b4ff106383a8840c86c6a24a57eb92a0676f Pull Request resolved: #101219

ezyang

Thank you for fixing this!!! What a pain haha

ezyang

Thank you for fixing this!!! What a pain haha

ezyang

Thank you for fixing this!!! What a pain haha

ezyang

Thank you for fixing this!!! What a pain haha

Chillee · 2023-05-15T19:57:50Z

Wow Ed is really thankful for this fix.

It looks like inference_mode wasn't playing well with functionalization. If you run torch.compile on a function, and the inputs to the function are tensors created outside of inference mode, then we need to make sure that when we created functional tensor wrappers for those inputs during compilation, those functional wrappers properly mirror whether or not the original tensor is an inference tensor. Hopefully fixes #101151 [ghstack-poisoned]

ghstack-source-id: 5bc15b0c2ec4cae3db00f095e875fbb0bee21ddc Pull Request resolved: #101219

bdhirsh · 2023-05-17T20:11:14Z

@pytorchbot merge

pytorchmergebot · 2023-05-17T20:13:49Z

Merge failed

Reason: This PR needs a label
If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Details for Dev Infra team

Raised by workflow job

pytorchmergebot · 2023-05-19T16:14:46Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

PaliC · 2023-05-19T19:01:01Z

@pytorchbot revert -c "nosignal" -m "breaking inductor tests"
Add ciflow/inductor to run the tests on the pull request.

pytorchmergebot · 2023-05-19T19:02:56Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot · 2023-05-19T19:03:08Z

@bdhirsh your PR has been successfully reverted.

This reverts commit 11f7ae1. Reverted #101219 on behalf of https://github.com/PaliC due to breaking inductor tests ([comment](#101219 (comment)))

#100570)" This reverts commit 1fabee3. Reverted #100570 on behalf of https://github.com/PaliC due to breaking inductor tests along with #101219 ([comment](#100570 (comment)))

Fixes #100977 This will hopefully fix this error (from [issue](#99616)) This PR fixes an internal model: we were running an inductor inference graph, but `torch.is_grad_enabled()` was True, causing us to error inside of the inference graph when we encountered an out= operator. I haven't been able to create a smaller repro - before landing this, I want to create a smaller repro to convince myself of why we need to separate out these guards. Pull Request resolved: #100570 Approved by: https://github.com/ezyang

It looks like inference_mode wasn't playing well with functionalization. If you run torch.compile on a function, and the inputs to the function are tensors created outside of inference mode, then we need to make sure that when we created functional tensor wrappers for those inputs during compilation, those functional wrappers properly mirror whether or not the original tensor is an inference tensor. Hopefully fixes #101151 [ghstack-poisoned]

ghstack-source-id: fc724bccbfeb8315dc90599b7bd5e8299a11a652 Pull Request resolved: #101219

It looks like inference_mode wasn't playing well with functionalization. If you run torch.compile on a function, and the inputs to the function are tensors created outside of inference mode, then we need to make sure that when we created functional tensor wrappers for those inputs during compilation, those functional wrappers properly mirror whether or not the original tensor is an inference tensor. Hopefully fixes #101151 [ghstack-poisoned]

ghstack-source-id: 4aef060e6ca8055b49b354d2f8f1ace49f962a01 Pull Request resolved: #101219

It looks like inference_mode wasn't playing well with functionalization. If you run torch.compile on a function, and the inputs to the function are tensors created outside of inference mode, then we need to make sure that when we created functional tensor wrappers for those inputs during compilation, those functional wrappers properly mirror whether or not the original tensor is an inference tensor. Hopefully fixes #101151 [ghstack-poisoned]

ghstack-source-id: 2061c74c08bd4a763a31ab973b1e113a639bd840 Pull Request resolved: #101219

bdhirsh · 2023-05-24T14:56:01Z

@pytorchbot merge

pytorchmergebot · 2023-05-24T14:58:30Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

fix inference_mode with torch.compile

8d67ce0

[ghstack-poisoned]

bdhirsh requested review from ezyang and Chillee as code owners May 11, 2023 20:57

bdhirsh mentioned this pull request May 11, 2023

separate out dynamo .requires_grad and .is_grad_enabled guards #100570

Closed

github-actions bot requested review from albanD, antoniojkim, jbschlosser, miladm, SherlockNoMad, voznesenskym and wconstab May 11, 2023 20:57

bdhirsh added a commit that referenced this pull request May 11, 2023

fix inference_mode with torch.compile

8c06a3a

ghstack-source-id: 96a941f43bbc85d927166fd541d45fb57dc416d0 Pull Request resolved: #101219

bdhirsh mentioned this pull request May 11, 2023

[dynamo] Error "Inference tensors do not track version counter" in inference_mode w/ llama7b #101151

Closed

albanD reviewed May 11, 2023

View reviewed changes

albanD approved these changes May 11, 2023

View reviewed changes

bdhirsh added a commit that referenced this pull request May 12, 2023

fix inference_mode with torch.compile

5c452f4

ghstack-source-id: 7c24b4ff106383a8840c86c6a24a57eb92a0676f Pull Request resolved: #101219

ezyang approved these changes May 15, 2023

View reviewed changes

bdhirsh added a commit that referenced this pull request May 17, 2023

fix inference_mode with torch.compile

ffbffa5

ghstack-source-id: 5bc15b0c2ec4cae3db00f095e875fbb0bee21ddc Pull Request resolved: #101219

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label May 17, 2023

pytorchmergebot added the merging label May 17, 2023

pytorchmergebot removed the merging label May 17, 2023

pytorchmergebot added the merging label May 19, 2023

pytorchmergebot added Merged and removed merging labels May 19, 2023

pytorchmergebot closed this in 11f7ae1 May 19, 2023

pytorchmergebot added a commit that referenced this pull request May 19, 2023

Revert "fix inference_mode with torch.compile (#101219)"

083f304

This reverts commit 11f7ae1. Reverted #101219 on behalf of https://github.com/PaliC due to breaking inductor tests ([comment](#101219 (comment)))

pytorchmergebot added the Reverted label May 19, 2023

bdhirsh reopened this May 19, 2023

bdhirsh added the ciflow/inductor label May 19, 2023

github-actions bot requested review from albanD and ezyang May 19, 2023 19:36

bdhirsh added a commit that referenced this pull request May 22, 2023

fix inference_mode with torch.compile

5eb54fe

ghstack-source-id: fc724bccbfeb8315dc90599b7bd5e8299a11a652 Pull Request resolved: #101219

bdhirsh added a commit that referenced this pull request May 23, 2023

fix inference_mode with torch.compile

65de383

ghstack-source-id: 4aef060e6ca8055b49b354d2f8f1ace49f962a01 Pull Request resolved: #101219

bdhirsh added a commit that referenced this pull request May 24, 2023

fix inference_mode with torch.compile

444d423

ghstack-source-id: 2061c74c08bd4a763a31ab973b1e113a639bd840 Pull Request resolved: #101219

pytorchmergebot added the merging label May 24, 2023

pytorchmergebot removed the merging label May 24, 2023

pytorchmergebot closed this in ddf4f7b May 24, 2023

facebook-github-bot deleted the gh/bdhirsh/416/head branch June 8, 2023 15:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix inference_mode with torch.compile #101219

fix inference_mode with torch.compile #101219

bdhirsh commented May 11, 2023 •

edited

pytorch-bot bot commented May 11, 2023 •

edited

albanD May 11, 2023

bdhirsh May 11, 2023

albanD left a comment

ezyang left a comment

ezyang left a comment

ezyang left a comment

ezyang left a comment

Chillee commented May 15, 2023

bdhirsh commented May 17, 2023

pytorchmergebot commented May 17, 2023

pytorchmergebot commented May 19, 2023

PaliC commented May 19, 2023

pytorchmergebot commented May 19, 2023

pytorchmergebot commented May 19, 2023

bdhirsh commented May 24, 2023

pytorchmergebot commented May 24, 2023

fix inference_mode with torch.compile #101219

fix inference_mode with torch.compile #101219

Conversation

bdhirsh commented May 11, 2023 • edited

pytorch-bot bot commented May 11, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/101219

✅ No Failures

albanD May 11, 2023

Choose a reason for hiding this comment

bdhirsh May 11, 2023

Choose a reason for hiding this comment

albanD left a comment

Choose a reason for hiding this comment

ezyang left a comment

Choose a reason for hiding this comment

ezyang left a comment

Choose a reason for hiding this comment

ezyang left a comment

Choose a reason for hiding this comment

ezyang left a comment

Choose a reason for hiding this comment

Chillee commented May 15, 2023

bdhirsh commented May 17, 2023

pytorchmergebot commented May 17, 2023

Merge failed

pytorchmergebot commented May 19, 2023

Merge started

PaliC commented May 19, 2023

pytorchmergebot commented May 19, 2023

pytorchmergebot commented May 19, 2023

bdhirsh commented May 24, 2023

pytorchmergebot commented May 24, 2023

Merge started

bdhirsh commented May 11, 2023 •

edited

pytorch-bot bot commented May 11, 2023 •

edited