-
Notifications
You must be signed in to change notification settings - Fork 21.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Skip debug asserts for mixed dense, subclass views in autograd_not_implemented_fallback #128057
Conversation
…plemented_fallback [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/128057
Note: Links to docs will display an error until the docs builds have been completed. ✅ You can merge normally! (1 Unrelated Failure)As of commit 30bf882 with merge base edb45dc ( FLAKY - The following job failed but was likely due to flakiness present on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess this fixes test in DEBUG=1. Sounds good!
…grad_not_implemented_fallback" Fixes #125503 [ghstack-poisoned]
…grad_not_implemented_fallback" Fixes #125503 [ghstack-poisoned]
…grad_not_implemented_fallback" Fixes #125503 [ghstack-poisoned]
…grad_not_implemented_fallback" Fixes #125503 [ghstack-poisoned]
…grad_not_implemented_fallback" Fixes #125503 [ghstack-poisoned]
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Merge failedReason: 1 jobs have failed, first few of them are: Check mergeability of ghstack PR / ghstack-mergeability-check Details for Dev Infra teamRaised by workflow job |
…grad_not_implemented_fallback" Fixes #125503 [ghstack-poisoned]
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
…plemented_fallback (pytorch#128057) Fixes pytorch#125503 Pull Request resolved: pytorch#128057 Approved by: https://github.com/albanD, https://github.com/soulitzer ghstack dependencies: pytorch#127007
…plemented_fallback (pytorch#128057) Fixes pytorch#125503 Pull Request resolved: pytorch#128057 Approved by: https://github.com/albanD, https://github.com/soulitzer ghstack dependencies: pytorch#127007
Idea: close over min / max sequence length in the main NJT view func (`_nested_view_from_jagged`) so that view replay during fake-ification propagates these correctly in torch.compile. For dynamic shapes support for min / max sequence length, this PR uses a hack that stores the values in `(val, 0)` shaped tensors. **NB: This PR changes SDPA to operate on real views instead of using `buffer_from_jagged()` / `ViewNestedFromBuffer`, which may impact the internal FIRST model. That is, it undoes the partial revert from #123215 alongside a fix to the problem that required the partial revert. We need to verify that there are no regressions there before landing.** Differential Revision: [D55448636](https://our.internmc.facebook.com/intern/diff/D55448636) Pull Request resolved: #122836 Approved by: https://github.com/soulitzer ghstack dependencies: #127007, #128057
…plemented_fallback ghstack-source-id: d3a6a41688bb4a7cb546b0bbdc68d546c92adb52 Pull Request resolved: pytorch/pytorch#128057
Stack from ghstack (oldest at bottom):
Fixes #125503