[inductor] fix mkldnn linear binary fusion check ut #127296

zhuhaozhe · 2024-05-28T14:09:22Z

In this PR:

（1）Fix the unary fusion for bf16 conv/linear.
Previously we registered same fusion pattern for bf16. fp16. And we do not check the dtype while matching the pattern. This results the fp16 case matched the bf16 pattern but in later replacement, we found that we have a float16 here which is not expected, so we do not fuse them. We fix it by checking dtypes to avoid fp16 case matched bf16 pattern.

  def _is_valid_computation_unary_fusion(computation_op, lowp_dtype=None):
      def fn(match):
          matched = _is_single_computation_op(computation_op, **lowp_dtype**)(match) # previously we do not check lowp_dtype here

It is not exposed before because we only check the match count, and the match count is anyway correct because we matched the pattern. To address this, we add check on number of generated_kernel. If it is not fused, there will be an additional kernel to compute the post op.

（2）Previous the ut

python test/inductor/test_mkldnn_pattern_matcher.py -k test_linear_binary

dose not check the fusion status, fix it in this PR.

（3）Extend test_conv_binary to test with lp.

Stack from ghstack (oldest at bottom):

-> [inductor] fix mkldnn linear binary fusion check ut #127296

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

pytorch-bot · 2024-05-28T14:09:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/127296

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 5fbf5bd with merge base 4d4d2a9 ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

inductor / linux-jammy-cpu-py3.8-gcc11-inductor / test (inductor_torchbench_cpu_smoketest_perf, 1, 1, linux.24xl.spr-metal, unstable) (gh) (#126993)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: d9049f3 Pull Request resolved: #127296

[ghstack-poisoned]

jgong5 · 2024-05-29T02:36:11Z

test/inductor/test_mkldnn_pattern_matcher.py

+            mod = M(binary_fn, input_shape[-1], out_feature, bias).eval()
+            v = torch.randn(input_shape)
            other = torch.randn(input_shape[:-1] + [out_feature]).to(dtype)


Why do we not convert dtype on mod and input v but convert it on "out" here?

Hi, Jiong.
Here we choose do not convert dtype on mod and v because we expected autocast to handle it. And for other, the autocast will not cast it because add is full-through op.

y = linear(x) + z

And currently we do not fuse "+ z" if z is float and linear(x) is lp.

Previous the ut ``` python test/inductor/test_mkldnn_pattern_matcher.py -k test_linear_binary ``` dose not check the fusion status, fix it in this PR. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

In this PR: （1）Fix the unary fusion for bf16 conv/linear. Previously we registered same fusion pattern for `bf16. fp16`. And we do not check the dtype while matching the pattern. This results the `fp16` case matched the `bf16` pattern but in later replacement, we found that we have a float16 here which is not expected, so we do not fuse them. We fix it by checking dtypes to avoid `fp16` case matched `bf16` pattern. ``` def _is_valid_computation_unary_fusion(computation_op, lowp_dtype=None): def fn(match): matched = _is_single_computation_op(computation_op, **lowp_dtype**)(match) # previously we do not check lowp_dtype here ``` It is not exposed before because we only check the match count, and the match count is anyway correct because we matched the pattern. To address this, we add check on number of `generated_kernel`. If it is not fused, there will be an additional kernel to compute the post op. （2）Previous the ut ``` python test/inductor/test_mkldnn_pattern_matcher.py -k test_linear_binary ``` dose not check the fusion status, fix it in this PR. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

ghstack-source-id: befb35e Pull Request resolved: #127296

In this PR: （1）Fix the unary fusion for bf16 conv/linear. Previously we registered same fusion pattern for `bf16. fp16`. And we do not check the dtype while matching the pattern. This results the `fp16` case matched the `bf16` pattern but in later replacement, we found that we have a float16 here which is not expected, so we do not fuse them. We fix it by checking dtypes to avoid `fp16` case matched `bf16` pattern. ``` def _is_valid_computation_unary_fusion(computation_op, lowp_dtype=None): def fn(match): matched = _is_single_computation_op(computation_op, **lowp_dtype**)(match) # previously we do not check lowp_dtype here ``` It is not exposed before because we only check the match count, and the match count is anyway correct because we matched the pattern. To address this, we add check on number of `generated_kernel`. If it is not fused, there will be an additional kernel to compute the post op. （2）Previous the ut ``` python test/inductor/test_mkldnn_pattern_matcher.py -k test_linear_binary ``` dose not check the fusion status, fix it in this PR. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

ghstack-source-id: 1a8773f Pull Request resolved: #127296

zhuhaozhe · 2024-05-30T12:27:16Z

@pytorchbot merge

pytorchmergebot · 2024-05-30T12:29:08Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

huydhn · 2024-05-30T17:16:34Z

@pytorchbot revert -m 'Sorry for reverting you change but one of the tests is failing on trunk ROCm. Please help fix and reland the change https://github.com/pytorch/pytorch/actions/runs/9302535020/job/25606932572' -c nosignal

pytorchmergebot · 2024-05-30T17:18:17Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

This reverts commit cdeb242. Reverted #127296 on behalf of https://github.com/huydhn due to Sorry for reverting you change but one of the tests is failing on trunk ROCm. Please help fix and reland the change https://github.com/pytorch/pytorch/actions/runs/9302535020/job/25606932572 ([comment](#127296 (comment)))

pytorchmergebot · 2024-05-30T17:18:26Z

@zhuhaozhe your PR has been successfully reverted.

ghstack-source-id: 9bb09ce Pull Request resolved: #127296

[ghstack-poisoned]

ghstack-source-id: 9482c60 Pull Request resolved: #127296

zhuhaozhe · 2024-05-31T07:44:38Z

@pytorchbot merge

[ghstack-poisoned]

pytorchmergebot · 2024-05-31T07:46:34Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-05-31T13:45:06Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

zhuhaozhe · 2024-06-01T11:07:46Z

@pytorchbot merge

pytorchmergebot · 2024-06-01T11:09:54Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

In this PR: （1）Fix the unary fusion for bf16 conv/linear. Previously we registered same fusion pattern for `bf16. fp16`. And we do not check the dtype while matching the pattern. This results the `fp16` case matched the `bf16` pattern but in later replacement, we found that we have a float16 here which is not expected, so we do not fuse them. We fix it by checking dtypes to avoid `fp16` case matched `bf16` pattern. ``` def _is_valid_computation_unary_fusion(computation_op, lowp_dtype=None): def fn(match): matched = _is_single_computation_op(computation_op, **lowp_dtype**)(match) # previously we do not check lowp_dtype here ``` It is not exposed before because we only check the match count, and the match count is anyway correct because we matched the pattern. To address this, we add check on number of `generated_kernel`. If it is not fused, there will be an additional kernel to compute the post op. （2）Previous the ut ``` python test/inductor/test_mkldnn_pattern_matcher.py -k test_linear_binary ``` dose not check the fusion status, fix it in this PR. （3）Extend `test_conv_binary` to test with lp. Pull Request resolved: pytorch#127296 Approved by: https://github.com/leslie-fang-intel, https://github.com/jgong5, https://github.com/jansel

pytorch-bot bot added module: inductor topic: not user facing topic category labels May 28, 2024

zhuhaozhe added a commit that referenced this pull request May 28, 2024

[inductor] fix mkldnn linear binary fusion check ut

7d4a871

ghstack-source-id: d9049f3 Pull Request resolved: #127296

Update

d5a7075

[ghstack-poisoned]

zhuhaozhe added the ciflow/trunk Trigger trunk jobs on your pull request label May 28, 2024

zhuhaozhe marked this pull request as draft May 28, 2024 14:11

zhuhaozhe requested review from leslie-fang-intel and jgong5 May 28, 2024 14:11

pytorchbot added the open source label May 28, 2024

leslie-fang-intel approved these changes May 29, 2024

View reviewed changes

jgong5 reviewed May 29, 2024

View reviewed changes

pytorch-bot bot added the ciflow/inductor label May 29, 2024

zhuhaozhe requested a review from jgong5 May 29, 2024 08:34

zhuhaozhe added a commit that referenced this pull request May 29, 2024

[inductor] fix fp16 unary not fused and enhance ut

67981a5

ghstack-source-id: befb35e Pull Request resolved: #127296

zhuhaozhe added a commit that referenced this pull request May 29, 2024

[inductor] fix fp16 unary not fused and enhance ut

e1f419a

ghstack-source-id: 1a8773f Pull Request resolved: #127296

jgong5 approved these changes May 29, 2024

View reviewed changes

zhuhaozhe requested a review from jansel May 30, 2024 00:41

jansel approved these changes May 30, 2024

View reviewed changes

zhuhaozhe marked this pull request as ready for review May 30, 2024 12:26

pytorchmergebot added the merging label May 30, 2024

pytorchmergebot closed this in cdeb242 May 30, 2024

pytorchmergebot added Merged and removed merging labels May 30, 2024

huydhn added the ciflow/rocm Trigger "default" config CI on ROCm label May 30, 2024

pytorchmergebot added the Reverted label May 30, 2024

pytorchmergebot reopened this May 30, 2024

zhuhaozhe added a commit that referenced this pull request May 31, 2024

[inductor] fix fp16 unary not fused and enhance ut

95b5d75

ghstack-source-id: 9bb09ce Pull Request resolved: #127296

Update

e593eb6

[ghstack-poisoned]

zhuhaozhe added a commit that referenced this pull request May 31, 2024

[inductor] fix fp16 unary not fused and enhance ut

c7e7d46

ghstack-source-id: 9482c60 Pull Request resolved: #127296

Update

5fbf5bd

[ghstack-poisoned]

pytorchmergebot added the merging label May 31, 2024

pytorchmergebot closed this in c3be459 Jun 1, 2024

pytorchmergebot removed the merging label Jun 1, 2024

github-actions bot deleted the gh/zhuhaozhe/34/head branch July 2, 2024 01:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[inductor] fix mkldnn linear binary fusion check ut #127296

[inductor] fix mkldnn linear binary fusion check ut #127296

Uh oh!

zhuhaozhe commented May 28, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented May 28, 2024 •

edited

Loading

Uh oh!

jgong5 May 29, 2024

Uh oh!

zhuhaozhe May 29, 2024

Uh oh!

zhuhaozhe commented May 30, 2024

Uh oh!

pytorchmergebot commented May 30, 2024

Uh oh!

huydhn commented May 30, 2024

Uh oh!

pytorchmergebot commented May 30, 2024

Uh oh!

pytorchmergebot commented May 30, 2024

Uh oh!

zhuhaozhe commented May 31, 2024

Uh oh!

pytorchmergebot commented May 31, 2024

Uh oh!

pytorchmergebot commented May 31, 2024

Uh oh!

zhuhaozhe commented Jun 1, 2024

Uh oh!

pytorchmergebot commented Jun 1, 2024

Uh oh!

Uh oh!

[inductor] fix mkldnn linear binary fusion check ut #127296

[inductor] fix mkldnn linear binary fusion check ut #127296

Uh oh!

Conversation

zhuhaozhe commented May 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/127296

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

jgong5 May 29, 2024

Choose a reason for hiding this comment

Uh oh!

zhuhaozhe May 29, 2024

Choose a reason for hiding this comment

Uh oh!

zhuhaozhe commented May 30, 2024

Uh oh!

pytorchmergebot commented May 30, 2024

Merge started

Uh oh!

huydhn commented May 30, 2024

Uh oh!

pytorchmergebot commented May 30, 2024

Uh oh!

pytorchmergebot commented May 30, 2024

Uh oh!

zhuhaozhe commented May 31, 2024

Uh oh!

pytorchmergebot commented May 31, 2024

Merge started

Uh oh!

pytorchmergebot commented May 31, 2024

Uh oh!

zhuhaozhe commented Jun 1, 2024

Uh oh!

pytorchmergebot commented Jun 1, 2024

Merge started

Uh oh!

Uh oh!

zhuhaozhe commented May 28, 2024 •

edited

Loading

pytorch-bot bot commented May 28, 2024 •

edited

Loading