fix(inductor): `ForeachKernelSchedulerNode` group shape should be opaque for graph debug #110336

jon-chuang · 2023-09-30T12:27:24Z

~~Shape is assumed by TensorMetadata to be torch.Shape/tuple, however, some of the scheduler node groups utilize int, so convert to tuple.~~

Root cause is actually foreach scheduler node having silent-error group of int, when in fact it ought to be opaque foreach.

Previously: silent error / confusing shape of (0,)

Now: clear that it is foreach which does not have well-defined shape:

Alternate might be to create list of shapes for each of its subnodes. Actually, for debuggability sake, I may prefer this. We can ensure that the recursive generation of this string is only done dynamically in a debug code path. Else, incrementally computing it on initialization of ForeachKernel may also be feasible. This is quite infeasible for 100s of params.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler

pytorch-bot · 2023-09-30T12:27:27Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110336

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 874d400 with merge base 86196bf ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…uang/fix-graph-debug

mlazos

Could you possible list all of the shapes? that would be really cool. If not this is fine too

jon-chuang · 2023-10-31T11:15:40Z

Could you possible list all of the shapes? that would be really cool.

I think let's leave opaque for now. It'll be quite difficult to read when listing all shapes.

jon-chuang · 2023-10-31T11:15:45Z

@pytorchbot merge

pytorchmergebot · 2023-10-31T11:18:04Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-10-31T15:10:38Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / linux-focal-rocm5.6-py3.8 / test (default, 1, 3, linux.rocm.gpu)

Details for Dev Infra team

Raised by workflow job

…uang/fix-graph-debug

jon-chuang · 2023-10-31T15:22:48Z

@pytorchbot merge

pytorchmergebot · 2023-10-31T15:26:42Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…que for graph debug (pytorch#110336) ~~Shape is assumed by `TensorMetadata` to be torch.Shape/tuple, however, some of the scheduler node groups utilize `int`, so convert to tuple.~~ Root cause is actually `foreach` scheduler node having silent-error group of int, when in fact it ought to be opaque `foreach`. **Previously:** silent error / confusing shape of (0,) ![image](https://github.com/pytorch/pytorch/assets/9093549/5bc2a3c7-151f-4433-bbf8-044c7b03e989) **Now:** clear that it is foreach which does not have well-defined shape: ![image](https://github.com/pytorch/pytorch/assets/9093549/8373080d-4519-4e74-8a3b-da463e9968da) ~~Alternate might be to create list of shapes for each of its subnodes. Actually, for debuggability sake, I may prefer this. We can ensure that the recursive generation of this string is only done dynamically in a debug code path. Else, incrementally computing it on initialization of ForeachKernel may also be feasible.~~ This is quite infeasible for 100s of params. Pull Request resolved: pytorch#110336 Approved by: https://github.com/mlazos

fix

b02bb6c

pytorch-bot bot added the release notes: fx release notes category label Sep 30, 2023

pytorchbot added the open source label Sep 30, 2023

jon-chuang added 2 commits October 1, 2023 03:28

merge

8b6fbb5

fix in downstream instead

d3ed93e

github-actions bot added module: inductor ciflow/inductor labels Oct 1, 2023

jon-chuang changed the title ~~fix(fx-graph-debug): shape is not always well-typed~~ fix(inductor): shape can be int for graph debug Oct 1, 2023

jon-chuang changed the title ~~fix(inductor): shape can be int for graph debug~~ fix(inductor): ForeachKernelSchedulerNode group shape should be opaque for graph debug Oct 1, 2023

jon-chuang force-pushed the jon-chuang/fix-graph-debug branch from 212719c to 979da14 Compare October 1, 2023 07:42

change foreach shape to opaque

f81e60b

jon-chuang force-pushed the jon-chuang/fix-graph-debug branch from 979da14 to f81e60b Compare October 1, 2023 07:48

jon-chuang added 3 commits October 1, 2023 10:16

Merge branch 'main' of https://github.com/pytorch/pytorch into jon-ch…

71534da

…uang/fix-graph-debug

Merge branch 'main' of https://github.com/pytorch/pytorch into jon-ch…

5ff1c50

…uang/fix-graph-debug

Empty-Commit

6681f97

jon-chuang mentioned this pull request Oct 2, 2023

fix(CI): Disable DALLE2_pytorch due to upstream breakage #110391

Closed

Merge branch 'main' of https://github.com/pytorch/pytorch into jon-ch…

a499adb

…uang/fix-graph-debug

colesbury requested a review from eellison October 3, 2023 23:06

colesbury added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Oct 3, 2023

eellison requested review from mlazos and removed request for eellison October 10, 2023 20:01

mlazos approved these changes Oct 11, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 31, 2023

pytorchmergebot added the merging label Oct 31, 2023

pytorchmergebot removed the merging label Oct 31, 2023

Merge branch 'main' of https://github.com/pytorch/pytorch into jon-ch…

874d400

…uang/fix-graph-debug

pytorchmergebot added the merging label Oct 31, 2023

pytorchmergebot added Merged and removed merging labels Oct 31, 2023

pytorchmergebot closed this in a21851c Oct 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(inductor): `ForeachKernelSchedulerNode` group shape should be opaque for graph debug #110336

fix(inductor): `ForeachKernelSchedulerNode` group shape should be opaque for graph debug #110336

Uh oh!

jon-chuang commented Sep 30, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 30, 2023 •

edited

Loading

Uh oh!

mlazos left a comment •

edited

Loading

Uh oh!

jon-chuang commented Oct 31, 2023

Uh oh!

jon-chuang commented Oct 31, 2023

Uh oh!

pytorchmergebot commented Oct 31, 2023

Uh oh!

pytorchmergebot commented Oct 31, 2023

Uh oh!

jon-chuang commented Oct 31, 2023

Uh oh!

pytorchmergebot commented Oct 31, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fix(inductor): ForeachKernelSchedulerNode group shape should be opaque for graph debug #110336

fix(inductor): ForeachKernelSchedulerNode group shape should be opaque for graph debug #110336

Uh oh!

Conversation

jon-chuang commented Sep 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110336

✅ No Failures

Uh oh!

mlazos left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jon-chuang commented Oct 31, 2023

Uh oh!

jon-chuang commented Oct 31, 2023

Uh oh!

pytorchmergebot commented Oct 31, 2023

Merge started

Uh oh!

pytorchmergebot commented Oct 31, 2023

Merge failed

Uh oh!

jon-chuang commented Oct 31, 2023

Uh oh!

pytorchmergebot commented Oct 31, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fix(inductor): `ForeachKernelSchedulerNode` group shape should be opaque for graph debug #110336

fix(inductor): `ForeachKernelSchedulerNode` group shape should be opaque for graph debug #110336

jon-chuang commented Sep 30, 2023 •

edited

Loading

pytorch-bot bot commented Sep 30, 2023 •

edited

Loading

mlazos left a comment •

edited

Loading