[Quant][FX] Lower QConvAddReLU2d for onednn backend #91155

leslie-fang-intel · 2022-12-20T03:55:17Z

Stack from ghstack (oldest at bottom):

Summary
Add quantization mappings for QConvAddReLU2d for int8 inference for onednn backend. The fusion and lowering is supported only in FX mode.

Test plan

python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_onednn
python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_by_default
python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_lowering

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

[ghstack-poisoned]

pytorch-bot · 2022-12-20T03:55:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/91155

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit d341248:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: a250783bd6dd1a6827ddec0e7aaa19d224a7031c Pull Request resolved: #91155

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 7d2e37a96cd34ba1e60875c1cb0d37f87daa886f Pull Request resolved: #91155

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 323865436d6894e7d66c0916bff60eb032e75a31 Pull Request resolved: #91155

jgong5 · 2022-12-20T07:45:29Z

torch/ao/quantization/backend_config/onednn.py

+for add_op in [torch.add, operator.add]:
+    conv_configs.append(
+        BackendPatternConfig()
+            ._set_pattern_complex_format((nn.ReLU, (add_op, nn.Conv2d, MatchAllNode)))


Always on the right?

No, I will add it later. It's still WIP.

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 10f8e3cc9bfc611e32b78d086c8d56b8e0337870 Pull Request resolved: #91155

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 309f7ce0a9da89a3741c8553134fd48abfd4dad2 Pull Request resolved: #91155

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

leslie-fang-intel · 2023-01-26T02:24:01Z

Hi @jerryzh168 Is there any other comments to this PR? Could you help to take a look again?

ghstack-source-id: 1eb7579750f43db56c8df7da31fa4fecf62d62ff Pull Request resolved: #91155

**Summary** Add quantization mappings for QConvAddReLU2d for int8 inference for onednn backend. The fusion and lowering is supported only in FX mode. **Test plan** ``` python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_onednn python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_by_default python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_lowering ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: cda8be852745842f5b1145724b8fed34e2ede525 Pull Request resolved: #91155

**Summary** Add quantization mappings for QConvAddReLU2d for int8 inference for onednn backend. The fusion and lowering is supported only in FX mode. **Test plan** ``` python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_onednn python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_by_default python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_lowering ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 33e9455ec5801cca7b50028be6bfdcfe33030131 Pull Request resolved: #91155

**Summary** Add quantization mappings for QConvAddReLU2d for int8 inference for onednn backend. The fusion and lowering is supported only in FX mode. **Test plan** ``` python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_onednn python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_by_default python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_lowering ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 33f5ab8bc15723191d51178a02bfee41b011ea5f Pull Request resolved: #91155

ghstack-source-id: 09d048380b28a1b2c7659602a14c53166875962f Pull Request resolved: #91155

**Summary** Add quantization mappings for QConvAddReLU2d for int8 inference for onednn backend. The fusion and lowering is supported only in FX mode. **Test plan** ``` python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_onednn python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_by_default python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_lowering ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 34f5bdf9f60e0259e12cdaaf35b8873435323a76 Pull Request resolved: #91155

ghstack-source-id: 7185ad799388d629cf13956af5bfda7a7ceba012 Pull Request resolved: #91155

**Summary** Add quantization mappings for QConvAddReLU2d for int8 inference for onednn backend. The fusion and lowering is supported only in FX mode. **Test plan** ``` python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_onednn python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_by_default python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_lowering ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 2e793f3acb3946dba936e31065faadfed21a8c4d Pull Request resolved: #91155

**Summary** Add quantization mappings for QConvAddReLU2d for int8 inference for onednn backend. The fusion and lowering is supported only in FX mode. **Test plan** ``` python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_onednn python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_by_default python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_lowering ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: 01433833e5bae91266081f9b42b482839cb7e02c Pull Request resolved: #91155

leslie-fang-intel · 2023-01-31T01:21:19Z

Hi @jerryzh168, Could you also take a review of this PR again?

jerryzh168

LGTM!

**Summary** Add quantization mappings for QConvAddReLU2d for int8 inference for onednn backend. The fusion and lowering is supported only in FX mode. **Test plan** ``` python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_onednn python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_by_default python -m pytest test_quantization.py -k test_fuse_conv_bn_add_relu_lowering ``` cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

ghstack-source-id: cdabb4cadbdd89647eca5cb60955697e2d6abd82 Pull Request resolved: #91155

ghstack-source-id: b6c8340458034f576b172459a0c47abdaef6c431 Pull Request resolved: #91155

leslie-fang-intel · 2023-02-01T01:16:59Z

@pytorchbot merge

pytorchmergebot · 2023-02-01T01:18:44Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[Quant][FX] Lower QConvAddReLU2d for onednn backend

5b89a96

[ghstack-poisoned]

leslie-fang-intel mentioned this pull request Dec 20, 2022

[Quant] Remove all the dequant nodes when the ref module has multi input args #90157

Closed

leslie-fang-intel mentioned this pull request Dec 20, 2022

[Quant] Update IDeep to support oneDNN conv add fusion #90605

Closed

pytorch-bot bot added the release notes: AO frontend label Dec 20, 2022

This was referenced Dec 20, 2022

[Quant] Add fused conv2d_add op for onednn backend #90262

Closed

[Quant] Add fused conv2d_add_relu op for onednn backend #90364

Closed

[Quant] Use the true src zero point to query and create conv pd #90818

Closed

github-actions bot added the release notes: quantization release notes category label Dec 20, 2022

This was referenced Dec 20, 2022

[Quant] Add fused ConvAdd2d module for onednn backend #91152

Closed

[Quant][FX] Lower QConvAdd2d for onednn backend #91153

Closed

[Quant] Add fused ConvAddReLU2d module for onednn backend #91154

Closed

leslie-fang-intel added a commit that referenced this pull request Dec 20, 2022

[Quant][FX] Lower QConvAddReLU2d for onednn backend

e087474

ghstack-source-id: a250783bd6dd1a6827ddec0e7aaa19d224a7031c Pull Request resolved: #91155

leslie-fang-intel changed the title ~~[Quant][FX] Lower QConvAddReLU2d for onednn backend~~ [WIP][Quant][FX] Lower QConvAddReLU2d for onednn backend Dec 20, 2022

leslie-fang-intel requested review from jgong5 and Xia-Weiwen December 20, 2022 03:58

leslie-fang-intel added intel This tag is for PR from Intel open source ciflow/trunk Trigger trunk jobs on your pull request labels Dec 20, 2022

leslie-fang-intel marked this pull request as draft December 20, 2022 03:59

Update on "[WIP][Quant][FX] Lower QConvAddReLU2d for onednn backend"

41eed98

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

leslie-fang-intel added a commit that referenced this pull request Dec 20, 2022

[Quant][FX] Lower QConvAddReLU2d for onednn backend

054a33e

ghstack-source-id: 7d2e37a96cd34ba1e60875c1cb0d37f87daa886f Pull Request resolved: #91155

Update on "[WIP][Quant][FX] Lower QConvAddReLU2d for onednn backend"

a0e6ee0

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

leslie-fang-intel added a commit that referenced this pull request Dec 20, 2022

[Quant][FX] Lower QConvAddReLU2d for onednn backend

55cfc96

ghstack-source-id: 323865436d6894e7d66c0916bff60eb032e75a31 Pull Request resolved: #91155

jgong5 reviewed Dec 20, 2022

View reviewed changes

Update on "[WIP][Quant][FX] Lower QConvAddReLU2d for onednn backend"

a19e561

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

leslie-fang-intel added a commit that referenced this pull request Dec 20, 2022

[Quant][FX] Lower QConvAddReLU2d for onednn backend

080782b

ghstack-source-id: 10f8e3cc9bfc611e32b78d086c8d56b8e0337870 Pull Request resolved: #91155

Update on "[WIP][Quant][FX] Lower QConvAddReLU2d for onednn backend"

a2826fa

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

leslie-fang-intel added a commit that referenced this pull request Dec 20, 2022

[Quant][FX] Lower QConvAddReLU2d for onednn backend

f3b0a37

ghstack-source-id: 309f7ce0a9da89a3741c8553134fd48abfd4dad2 Pull Request resolved: #91155

Update on "[WIP][Quant][FX] Lower QConvAddReLU2d for onednn backend"

7fff6f5

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 [ghstack-poisoned]

leslie-fang-intel added a commit that referenced this pull request Jan 26, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

7d7014d

ghstack-source-id: 1eb7579750f43db56c8df7da31fa4fecf62d62ff Pull Request resolved: #91155

leslie-fang-intel added a commit that referenced this pull request Jan 26, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

3f732c7

ghstack-source-id: cda8be852745842f5b1145724b8fed34e2ede525 Pull Request resolved: #91155

leslie-fang-intel added a commit that referenced this pull request Jan 27, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

b799660

ghstack-source-id: 33e9455ec5801cca7b50028be6bfdcfe33030131 Pull Request resolved: #91155

leslie-fang-intel added 2 commits January 28, 2023 11:02

leslie-fang-intel added a commit that referenced this pull request Jan 28, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

c3241c6

ghstack-source-id: 33f5ab8bc15723191d51178a02bfee41b011ea5f Pull Request resolved: #91155

leslie-fang-intel added a commit that referenced this pull request Jan 28, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

e8af4bb

ghstack-source-id: 09d048380b28a1b2c7659602a14c53166875962f Pull Request resolved: #91155

leslie-fang-intel added 2 commits January 28, 2023 13:54

leslie-fang-intel added a commit that referenced this pull request Jan 28, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

5f863cf

ghstack-source-id: 34f5bdf9f60e0259e12cdaaf35b8873435323a76 Pull Request resolved: #91155

leslie-fang-intel added a commit that referenced this pull request Jan 28, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

ac0abfe

ghstack-source-id: 7185ad799388d629cf13956af5bfda7a7ceba012 Pull Request resolved: #91155

leslie-fang-intel added a commit that referenced this pull request Jan 28, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

052eea1

ghstack-source-id: 2e793f3acb3946dba936e31065faadfed21a8c4d Pull Request resolved: #91155

leslie-fang-intel added a commit that referenced this pull request Jan 30, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

545ccb3

ghstack-source-id: 01433833e5bae91266081f9b42b482839cb7e02c Pull Request resolved: #91155

jerryzh168 approved these changes Jan 31, 2023

View reviewed changes

leslie-fang-intel added 2 commits January 31, 2023 11:03

leslie-fang-intel added a commit that referenced this pull request Jan 31, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

f9c9949

ghstack-source-id: cdabb4cadbdd89647eca5cb60955697e2d6abd82 Pull Request resolved: #91155

leslie-fang-intel added a commit that referenced this pull request Jan 31, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend

7e88470

ghstack-source-id: b6c8340458034f576b172459a0c47abdaef6c431 Pull Request resolved: #91155

pytorchmergebot added the Merged label Feb 1, 2023

pytorchmergebot closed this in 0f802ee Feb 1, 2023

facebook-github-bot deleted the gh/leslie-fang-intel/17/head branch June 8, 2023 17:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Quant][FX] Lower QConvAddReLU2d for onednn backend #91155

[Quant][FX] Lower QConvAddReLU2d for onednn backend #91155

leslie-fang-intel commented Dec 20, 2022 •

edited

pytorch-bot bot commented Dec 20, 2022 •

edited

jgong5 Dec 20, 2022

leslie-fang-intel Dec 20, 2022

leslie-fang-intel commented Jan 26, 2023

leslie-fang-intel commented Jan 31, 2023

jerryzh168 left a comment

leslie-fang-intel commented Feb 1, 2023

pytorchmergebot commented Feb 1, 2023

[Quant][FX] Lower QConvAddReLU2d for onednn backend #91155

[Quant][FX] Lower QConvAddReLU2d for onednn backend #91155

Conversation

leslie-fang-intel commented Dec 20, 2022 • edited

pytorch-bot bot commented Dec 20, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/91155

✅ No Failures

jgong5 Dec 20, 2022

Choose a reason for hiding this comment

leslie-fang-intel Dec 20, 2022

Choose a reason for hiding this comment

leslie-fang-intel commented Jan 26, 2023

leslie-fang-intel commented Jan 31, 2023

jerryzh168 left a comment

Choose a reason for hiding this comment

leslie-fang-intel commented Feb 1, 2023

pytorchmergebot commented Feb 1, 2023

Merge started

leslie-fang-intel commented Dec 20, 2022 •

edited

pytorch-bot bot commented Dec 20, 2022 •

edited