[inductor] fix test_linear_binary_dynamic_shapes_cpp_wrapper #139942

shunting314 · 2024-11-07T00:12:34Z

Stack from ghstack (oldest at bottom):

-> [inductor] fix test_linear_binary_dynamic_shapes_cpp_wrapper #139942

I recently added a new pattern here #139136 to remove pointless view/permute pairs. At that PR, I've already updated the matched pattern/node count in test_linear_binary to account for the new pattern. But it looks like with cpp wrapper, one more pattern will be matched.

7 patterns without cpp-wrapper:

========== pattern matched <code object pointless_view at 0x7f6d25c67aa0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l
ine 568> =======
========== pattern matched <code object pointless_view_pair at 0x7f6d25c67b50, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.p
y", line 581> =======
========== pattern matched <code object pointless_view at 0x7f6d25c67aa0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l
ine 568> =======
========== pattern matched <code object pointless_view at 0x7f6d25c67aa0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l
ine 568> =======
========== pattern matched <code object linear at 0x7f6d176e5dc0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mkldnn_fusion.py", line 11
21> =======
========== pattern matched <code object reshape_linear_reshape_pattern at 0x7f6d176e5210, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mk
ldnn_fusion.py", line 732> =======
========== pattern matched <code object fn at 0x7f6d176d3ec0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mkldnn_fusion.py", line 476> =
======

8 patterns with cpp wrapper:
========== pattern matched <code object pointless_view at 0x7f8e78bf07c0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l
ine 568> =======
========== pattern matched <code object pointless_view_pair at 0x7f8e78bf0870, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.p
y", line 581> =======
========== pattern matched <code object pointless_view at 0x7f8e78bf07c0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l
ine 568> =======
========== pattern matched <code object pointless_view at 0x7f8e78bf07c0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l
ine 568> =======
========== pattern matched <code object pointless_view at 0x7f8e78bf07c0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l
ine 568> =======
========== pattern matched <code object linear at 0x7f8e59c04190, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mkldnn_fusion.py", line 11
21> =======
========== pattern matched <code object reshape_linear_reshape_pattern at 0x7f8e59dfb520, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mk
ldnn_fusion.py", line 732> =======
========== pattern matched <code object fn at 0x7f8e59dfa290, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mkldnn_fusion.py", line 476> =
======

I fixed this test by +1 to the expected number if cpp wrapper is enabled. But I think fundamentally can we not assert for the total number of patterns matched in the test? I think that makes the test very fragile. People adding new patterns may keep breaking these 'un-related' tests. One possible way to improve is, we have a counter for each specific pattern, in the tests, instead of check the total number of patterns matched, just check the match count for the RELEVANT patterns. That should reduce false-positive for broken tests. cc possible test creator @jgong5

Fixes #139812 (we need to have this to run this disabled test on your PR)

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2024-11-07T00:12:38Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139942

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit b4a6dbf with merge base 8f077b8 ():

NEW FAILURES - The following jobs have failed:

slow / linux-focal-py3.9-clang10 / test (slow, 1, 2, linux.2xlarge) (gh)
inductor/test_cpu_cpp_wrapper.py::TestCppWrapper::test_linear_binary_cpp_wrapper
slow / linux-jammy-py3.10-clang15-asan / test (slow, 3, 3, linux.4xlarge) (gh)
inductor/test_cpu_cpp_wrapper.py::TestCppWrapper::test_linear_binary_cpp_wrapper

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 9dc2d99 Pull Request resolved: #139942

huydhn

Thank you for the fix!

leslie-fang-intel · 2024-11-07T00:33:15Z

One possible way to improve is, we have a counter for each specific pattern, in the tests, instead of check the total number of patterns matched, just check the match count for the RELEVANT patterns. That should reduce false-positive for broken tests.

I think we have added specific count for oneDNN quantization pattern matcher such as:

pytorch/test/inductor/test_mkldnn_pattern_matcher.py

Line 1637 in e675c67

counters["inductor"]["qlinear_weight_prepack_matcher_count"], 2

So, we probably also need to add some other specific count for the other oneDNN pattern matchers.

jgong5

So, we probably also need to add some other specific count for the other oneDNN pattern matchers.

@leslie-fang-intel Can we add a BE task to revise existing inductor cpp tests?

leslie-fang-intel · 2024-11-07T05:18:07Z

Track this task: #139970 @Valentine233 could you help on this?

shunting314 · 2024-11-07T20:15:12Z

@pytorchbot merge

pytorchmergebot · 2024-11-07T20:17:17Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

huydhn · 2024-11-08T01:54:01Z

@pytorchbot revert -m 'Sorry for revert this, but I think we miss running the test and it is now failing in trunk' -c nosignal

inductor/test_cpu_cpp_wrapper.py::TestCppWrapper::test_linear_binary_cpp_wrapper GH job link HUD commit link

It's one of those slow tests, so need ciflow/slow to run them

pytorchmergebot · 2024-11-08T01:55:42Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

pytorchmergebot · 2024-11-08T01:55:51Z

@shunting314 your PR has been successfully reverted.

…139942)" This reverts commit 0618c7f. Reverted #139942 on behalf of https://github.com/huydhn due to Sorry for revert this, but I think we miss running the test and it is now failing in trunk ([comment](#139942 (comment)))

shunting314 · 2024-11-08T21:34:01Z

Interesting, I think they pass on my dev gpu. Let me double check

shunting314 · 2024-11-08T21:57:05Z

hmm, this is just another evidence how fragile these tests are.

Reverting my PR can make those previously failed test pass now. I think maybe something recently changed in cpp-wrapper and make the different number of patterns being matched. @huydhn I'll close this PR. I can look more if you still see this failure on trunk. In a bit longer term, I think Intel folks agreed to improves these tests

@jgong5

…#139942) I recently added a new pattern here pytorch#139136 to remove pointless view/permute pairs. At that PR, I've already updated the matched pattern/node count in `test_linear_binary` to account for the new pattern. But it looks like with cpp wrapper, one more pattern will be matched. ``` 7 patterns without cpp-wrapper: ========== pattern matched <code object pointless_view at 0x7f6d25c67aa0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l ine 568> ======= ========== pattern matched <code object pointless_view_pair at 0x7f6d25c67b50, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.p y", line 581> ======= ========== pattern matched <code object pointless_view at 0x7f6d25c67aa0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l ine 568> ======= ========== pattern matched <code object pointless_view at 0x7f6d25c67aa0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l ine 568> ======= ========== pattern matched <code object linear at 0x7f6d176e5dc0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mkldnn_fusion.py", line 11 21> ======= ========== pattern matched <code object reshape_linear_reshape_pattern at 0x7f6d176e5210, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mk ldnn_fusion.py", line 732> ======= ========== pattern matched <code object fn at 0x7f6d176d3ec0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mkldnn_fusion.py", line 476> = ====== 8 patterns with cpp wrapper: ========== pattern matched <code object pointless_view at 0x7f8e78bf07c0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l ine 568> ======= ========== pattern matched <code object pointless_view_pair at 0x7f8e78bf0870, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.p y", line 581> ======= ========== pattern matched <code object pointless_view at 0x7f8e78bf07c0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l ine 568> ======= ========== pattern matched <code object pointless_view at 0x7f8e78bf07c0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l ine 568> ======= ========== pattern matched <code object pointless_view at 0x7f8e78bf07c0, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/joint_graph.py", l ine 568> ======= ========== pattern matched <code object linear at 0x7f8e59c04190, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mkldnn_fusion.py", line 11 21> ======= ========== pattern matched <code object reshape_linear_reshape_pattern at 0x7f8e59dfb520, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mk ldnn_fusion.py", line 732> ======= ========== pattern matched <code object fn at 0x7f8e59dfa290, file "/home/shunting/ws/pytorch/torch/_inductor/fx_passes/mkldnn_fusion.py", line 476> = ====== ``` I fixed this test by +1 to the expected number if cpp wrapper is enabled. But I think fundamentally can we not assert for the total number of patterns matched in the test? I think that makes the test very fragile. People adding new patterns may keep breaking these 'un-related' tests. One possible way to improve is, we have a counter for each specific pattern, in the tests, instead of check the total number of patterns matched, just check the match count for the ***RELEVANT*** patterns. That should reduce false-positive for broken tests. cc possible test creator @jgong5 Fixes pytorch#139812 (we need to have this to run this disabled test on your PR) Pull Request resolved: pytorch#139942 Approved by: https://github.com/huydhn, https://github.com/jgong5

…ytorch#139942)" This reverts commit 0618c7f. Reverted pytorch#139942 on behalf of https://github.com/huydhn due to Sorry for revert this, but I think we miss running the test and it is now failing in trunk ([comment](pytorch#139942 (comment)))

[inductor] fix test_linear_binary_dynamic_shapes_cpp_wrapper

b4a6dbf

[ghstack-poisoned]

pytorch-bot bot added module: inductor topic: not user facing topic category labels Nov 7, 2024

shunting314 added a commit that referenced this pull request Nov 7, 2024

[inductor] fix test_linear_binary_dynamic_shapes_cpp_wrapper

587abca

ghstack-source-id: 9dc2d99 Pull Request resolved: #139942

shunting314 requested review from EikanWang, huydhn, jgong5 and leslie-fang-intel November 7, 2024 00:19

huydhn approved these changes Nov 7, 2024

View reviewed changes

jgong5 approved these changes Nov 7, 2024

View reviewed changes

leslie-fang-intel mentioned this pull request Nov 7, 2024

[Inductor] Revise existing Inductor mkldnn pattern matcher tests #139970

Closed

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 7, 2024

pytorchmergebot added the merging label Nov 7, 2024

pytorchmergebot added the Merged label Nov 7, 2024

pytorchmergebot closed this in 0618c7f Nov 7, 2024

pytorchmergebot removed the merging label Nov 7, 2024

huydhn added the ciflow/slow label Nov 8, 2024

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Nov 8, 2024

pytorchmergebot reopened this Nov 8, 2024

shunting314 closed this Nov 8, 2024

github-actions bot deleted the gh/shunting314/185/head branch December 9, 2024 02:14

[inductor] fix test_linear_binary_dynamic_shapes_cpp_wrapper #139942

[inductor] fix test_linear_binary_dynamic_shapes_cpp_wrapper #139942

Uh oh!

Conversation

shunting314 commented Nov 7, 2024 • edited by huydhn Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139942

❌ 2 New Failures

Uh oh!

huydhn left a comment

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel commented Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jgong5 left a comment

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel commented Nov 7, 2024

Uh oh!

shunting314 commented Nov 7, 2024

Uh oh!

pytorchmergebot commented Nov 7, 2024

Merge started

Uh oh!

huydhn commented Nov 8, 2024

Uh oh!

pytorchmergebot commented Nov 8, 2024

Uh oh!

pytorchmergebot commented Nov 8, 2024

Uh oh!

shunting314 commented Nov 8, 2024

Uh oh!

shunting314 commented Nov 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shunting314 commented Nov 7, 2024 •

edited by huydhn

Loading

pytorch-bot bot commented Nov 7, 2024 •

edited

Loading

leslie-fang-intel commented Nov 7, 2024 •

edited

Loading