Cortex-M: Fuse relu activation into quantized_add by rascani · Pull Request #18462 · pytorch/executorch

rascani · 2026-03-24T20:44:45Z

Summary

ResNet8 has skip connections with relu(add(conv(x), skip(x))). The ActivationFusionPass only fused relu into conv/linear, leaving 3 unfused relu ops that fell through to portable aten::relu.out which incorrectly clamps int8 tensors to literal 0 instead of the quantized zero_point, causing numerical mismatches on the FVP.

Add fused activation patterns (relu, hardtanh, clamp) for add/add_ to quantizer_support.py BINARY_OP_PATTERNS so the quantizer produces activation-aware quantization bounds. Add aten.add.Tensor to ActivationFusionPass FUSE_OPS. Update QuantizedOpFusionPass to read activation bounds from output_qparams and pass them to quantized_add. Update the quantized_add operator (schema, meta, impl, C++) to accept activation_min/activation_max parameters.

ResNet8 has skip connections with relu(add(conv(x), skip(x))). The ActivationFusionPass only fused relu into conv/linear, leaving 3 unfused relu ops that fell through to portable aten::relu.out which incorrectly clamps int8 tensors to literal 0 instead of the quantized zero_point, causing numerical mismatches on the FVP. Add fused activation patterns (relu, hardtanh, clamp) for add/add_ to quantizer_support.py BINARY_OP_PATTERNS so the quantizer produces activation-aware quantization bounds. Add aten.add.Tensor to ActivationFusionPass FUSE_OPS. Update QuantizedOpFusionPass to read activation bounds from output_qparams and pass them to quantized_add. Update the quantized_add operator (schema, meta, impl, C++) to accept activation_min/activation_max parameters. Co-authored-by: Claude <noreply@anthropic.com>

pytorch-bot · 2026-03-24T20:44:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18462

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 1 Cancelled Job, 3 Unrelated Failures

As of commit e527a04 with merge base 7c79395 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner-mypy (gh)
>>> Lint for backends/arm/test/models/stable_diffusion/stable_diffusion_module_test_configs.py:
pull / unittest-editable / macos / macos-job (gh)
backends/xnnpack/test/recipes/test_xnnpack_recipes.py::TestXnnpackRecipes::test_all_models_with_recipes
trunk / test-arm-backend-vkml (test_pytest_ops_vkml) / linux-job (gh)
RuntimeError: Command docker exec -t fcc5a1c65c61c13e73fa67e8225b3a96459bfaf1f16aa2a7e44a204f69e8641f /exec failed with exit code 1

CANCELLED JOB - The following job was cancelled. Please retry:

Test CoreML Backend / test-coreml / test-backend-macos (coreml, models) / macos-job (gh)
##[error]The operation was canceled.

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
trunk / unittest-release / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

rascani · 2026-03-24T20:45:24Z

@AdrianLundell - I still need to add tests for this, but I wanted to make sure this is the right approach for the Quantizer.

github-actions · 2026-03-24T20:50:29Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Add add_relu, add_relu_channels_last, add_hardtanh, and add_hardtanh_channels_last test cases to test_add.py verifying that relu/hardtanh activations are fused into quantized_add. Remove the conv_add_relu xfail from test_nn_modules.py since the fusion now works. Co-authored-by: Claude <noreply@anthropic.com>

AdrianLundell

Looks correct to me, nice! Just two comments

Remove add_.Tensor + activation fused patterns from BINARY_OP_PATTERNS. Functionalization converts inplace ops to out-of-place before the quantizer runs, so these patterns are never matched. Co-authored-by: Claude <noreply@anthropic.com>

rascani · 2026-03-27T20:55:18Z

Failured unrelated.

rascani requested review from AdrianLundell and psiddh March 24, 2026 20:44

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 24, 2026

AdrianLundell reviewed Mar 25, 2026

View reviewed changes

Comment thread backends/cortex_m/quantizer/quantizer_support.py Outdated

AdrianLundell reviewed Mar 25, 2026

View reviewed changes

Comment thread backends/cortex_m/quantizer/quantizer_support.py

AdrianLundell reviewed Mar 25, 2026

View reviewed changes

AdrianLundell approved these changes Mar 27, 2026

View reviewed changes

Merge branch 'main' into cortex-m-fuse-add-relu

6c489b4

rascani added the ciflow/trunk label Mar 27, 2026

Fix lint errors

e527a04

rascani merged commit 6fccd5a into pytorch:main Mar 27, 2026
411 of 420 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cortex-M: Fuse relu activation into quantized_add#18462

Cortex-M: Fuse relu activation into quantized_add#18462
rascani merged 5 commits intopytorch:mainfrom
rascani:cortex-m-fuse-add-relu

rascani commented Mar 24, 2026

Uh oh!

pytorch-bot Bot commented Mar 24, 2026 •

edited

Loading

Uh oh!

rascani commented Mar 24, 2026

Uh oh!

github-actions Bot commented Mar 24, 2026

Uh oh!

Uh oh!

Uh oh!

AdrianLundell left a comment •

edited

Loading

Uh oh!

rascani commented Mar 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rascani commented Mar 24, 2026

Summary

Uh oh!

pytorch-bot Bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18462

❌ 3 New Failures, 1 Cancelled Job, 3 Unrelated Failures

Uh oh!

rascani commented Mar 24, 2026

Uh oh!

github-actions Bot commented Mar 24, 2026

This PR needs a release notes: label

Uh oh!

Uh oh!

Uh oh!

AdrianLundell left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rascani commented Mar 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot Bot commented Mar 24, 2026 •

edited

Loading

This PR needs a `release notes:` label

AdrianLundell left a comment •

edited

Loading