Cortex_m backend: Add quantizer + avoid linear decomp #15459

AdrianLundell · 2025-10-30T11:13:03Z

Changes to_edge_and_transform to to_edge which supports the preserver_ops arg of the EdgeCompileConfig to avoid decomposing of the linear op. This significantly simplifies lowering the linear operator as it does not have to be re-fused.

Adds a cortex_m quantizer, with the intention to be general enough to be used for a general MCU. It is implemented as a ComposableQuantizer using multiple instances of a new OperatorConfigQuantizer class. This gives a number of abstraction levels for configuration

McuQuantizer
ComposableQuantizer
OperatorConfig
QuantizerConfig
QuantizationSpec

The new quantizer also adds a transform_for_annotation pass pipeline which allows to fix scalar + tensor operations.

cc @freddan80 @per @zingo @oscarandersson8218 @digantdesai

Changes to_edge_and_transform to to_edge which supports the preserver_ops arg of the EdgeCompileConfig to avoid decomposing of the linear op. This significantly simplifies lowering the linear operator as it does not have to be re-fused. Adds a cortex_m quantizer, with the intention to be general enough to be used for a general MCU. It is implemented as a ComposableQuantizer using multiple instances of a new OperatorConfigQuantizer class. This gives a number of abstraction levels for configuration - McuQuantizer - ComposableQuantizer - OperatorConfig - QuantizerConfig - QuantizationSpec The new quantizer also adds a transform_for_annotation pass pipeline which allows to fix scalar + tensor operations. Old test_quantize_op_fusion_pass test is removed since it is not relevant anymore after the add implementation has been redone. Change-Id: Ic1d5b48623a14d220cb1ba472948db6a1406e0b7 Signed-off-by: Adrian Lundell <adrian.lundell@arm.com>

pytorch-bot · 2025-10-30T11:13:07Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15459

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[ROCm][CI] Machines under the label linux.rocm.gpu.2, label linux.rocm.gpu.4, linux.rocm.gpu.gfx1100 are undergoing maintenance.

✅ No Failures

As of commit 364159b with merge base a11d555 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

psiddh · 2025-10-31T18:15:08Z

backends/cortex_m/passes/cortex_m_pass_manager.py

 # LICENSE file in the root directory of this source tree.


+from executorch.backends.arm._passes import ScalarsToAttributePass


Do you plan to add "EdgeCompileConfig(preserve_ops...." step in CortexMPassManager to avoid decomposition ?

Or "Do you plan to implement a CortexMPartitioner with ops_to_not_decompose() method to work with to_edge_transform_and_lower(), similar to how the Cadence backend handles operation preservation?"

executorch/backends/cadence/aot/compiler.py

Lines 275 to 277 in 007ccc6

_core_aten_ops_exception_list=TO_EDGE_OP_EXCEPTION_LIST

+ (core_aten_exceptions or []),

preserve_ops=TO_EDGE_PRESERVE_OPS,

I am rather neutral towards how the list of ops to not preserve should be implemented, do you have a preference? From what I noticed preserve_ops in EdgeCompileConfig does not do anything in to_edge_transfrom_and_lower however, only in to_edge.

…nge-1137032

Changes to_edge_and_transform to to_edge which supports the preserver_ops arg of the EdgeCompileConfig to avoid decomposing of the linear op. This significantly simplifies lowering the linear operator as it does not have to be re-fused. Adds a cortex_m quantizer, with the intention to be general enough to be used for a general MCU. It is implemented as a ComposableQuantizer using multiple instances of a new OperatorConfigQuantizer class. This gives a number of abstraction levels for configuration - McuQuantizer - ComposableQuantizer - OperatorConfig - QuantizerConfig - QuantizationSpec The new quantizer also adds a transform_for_annotation pass pipeline which allows to fix scalar + tensor operations. Signed-off-by: Adrian Lundell <adrian.lundell@arm.com>

AdrianLundell added partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm release notes: none Do not include this in the release notes labels Oct 30, 2025

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 30, 2025

AdrianLundell requested a review from psiddh October 30, 2025 11:13

psiddh reviewed Oct 31, 2025

View reviewed changes

Merge branch 'main' of https://github.com/pytorch/executorch into cha…

364159b

…nge-1137032

psiddh approved these changes Nov 3, 2025

View reviewed changes

AdrianLundell merged commit 6ab8723 into pytorch:main Nov 3, 2025
145 checks passed

AdrianLundell deleted the change-1137032 branch November 7, 2025 12:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cortex_m backend: Add quantizer + avoid linear decomp #15459

Cortex_m backend: Add quantizer + avoid linear decomp #15459

Uh oh!

AdrianLundell commented Oct 30, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 30, 2025 •

edited

Loading

Uh oh!

psiddh Oct 31, 2025

Uh oh!

psiddh Oct 31, 2025

Uh oh!

AdrianLundell Nov 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		# LICENSE file in the root directory of this source tree.


		from executorch.backends.arm._passes import ScalarsToAttributePass

	_core_aten_ops_exception_list=TO_EDGE_OP_EXCEPTION_LIST
	+ (core_aten_exceptions or []),
	preserve_ops=TO_EDGE_PRESERVE_OPS,

Cortex_m backend: Add quantizer + avoid linear decomp #15459

Cortex_m backend: Add quantizer + avoid linear decomp #15459

Uh oh!

Conversation

AdrianLundell commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15459

❗ 1 Active SEVs

✅ No Failures

Uh oh!

psiddh Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

psiddh Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

AdrianLundell Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AdrianLundell commented Oct 30, 2025 •

edited

Loading

pytorch-bot bot commented Oct 30, 2025 •

edited

Loading

AdrianLundell Nov 3, 2025 •

edited

Loading