Arm backend: Rework reporting of qspecs and qconfigs by martinlsm · Pull Request #19016 · pytorch/executorch

martinlsm · 2026-04-21T08:36:28Z

This PR contains four commits:

Move quantizer_reporter out from cortex_m.quantizer
When importing backends.cortex_m.quantizer.quantizer_reporter from
backends.arm.quantizer.arm_quantizer_utils, a cyclic dependency chain is
is formed. The problem is that quantizer_reporter triggers
backends/cortex_m/quantizer/__init__.py when imported, which in turn has
imports leading back to the Arm backend. To fix this problem, move
quantizer_reporter to backends/cortex_m so it can be imported without
forming any cycle.
Arm backend: Remove _QuantizerReporterUserMixin
_QuantizerReporterUserMixin was a duplicated class with the name
QuantizerReporterUser. Remove the former to instead use the latter.
Arm backend: Add label attribute to QuantizationConfig
The quantizer reporter logs the quantization config in a human-readable
format. Prior to this patch, this was done with the help of a dict
called SUPPORTED_QCONFIGS, which was defined in quantizer_reporter.py
and populated by the user. This patch reworks this concept by instead
adding a label attribute to QuantizationConfig that the reporter can
use to print the config in a human-readable format.
Arm backend: Rework reporting of qspecs
The quantization reporter prints quantization specs in human-readable
format. Prior to this patch, this was implemented such that
quantizer_reporter.py defined a dict SUPPORTED_QSPECS which was
populated by the user. This dict would map qspec objects to string
representations. This patch removes this dict and instead modifies
the helper function _qspec_repr to return a compact string
representation based on the attributes of the qspec.

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell

When importing backends.cortex_m.quantizer.quantizer_reporter from backends.arm.quantizer.arm_quantizer_utils, a cyclic dependency chain is is formed. The problem is that quantizer_reporter triggers backends/cortex_m/quantizer/__init__.py when imported, which in turn has imports leading back to the Arm backend. To fix this problem, move quantizer_reporter to backends/cortex_m so it can be imported without forming any cycle. Signed-off-by: Martin Lindström <Martin.Lindstroem@arm.com> Change-Id: I757aa090d35f47bdce4d523b064217c3069adf41

_QuantizerReporterUserMixin was a duplicated class with the name QuantizerReporterUser. Remove the former to instead use the latter. Signed-off-by: Martin Lindström <Martin.Lindstroem@arm.com> Change-Id: Iafe28304ab1813a8f88f58e75fc9a277158e7773

The quantizer reporter logs the quantization config in a human-readable format. Prior to this patch, this was done with the help of a dict called `SUPPORTED_QCONFIGS`, which was defined in quantizer_reporter.py and populated by the user. This patch reworks this concept by instead adding a label attribute to `QuantizationConfig` that the reporter can use to print the config in a human-readable format. Signed-off-by: Martin Lindström <Martin.Lindstroem@arm.com> Change-Id: I38e80c9c3d57fb9d858119fe4281b713bf472475

The quantization reporter prints quantization specs in human-readable format. Prior to this patch, this was implemented such that quantizer_reporter.py defined a dict `SUPPORTED_QSPECS` which was populated by the user. This dict would map qspec objects to string representations. This patch removes this dict and instead modifies the helper function `_qspec_repr` to return a compact string representation based on the attributes of the qspec. Signed-off-by: Martin Lindström <Martin.Lindstroem@arm.com> Change-Id: I9ccd9127b8c332e7c30662be6986ccad4a38881f

pytorch-bot · 2026-04-21T08:36:33Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19016

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Rolling out OSDC (ARC) runners on pull & trunk workflows in PyTorch main

❌ 3 New Failures, 3 Unrelated Failures

As of commit 1076083 with merge base 6be4fb5 ():

NEW FAILURES - The following jobs have failed:

Cadence Build & Test / cpu-test / test-aot / test-aot (gh)
backends/cadence/aot/tests/test_replace_ops_passes.py::TestReplaceOpsPasses::test_replace_transposed_conv_with_linear_0
pull / test-multimodal-linux (gemma3-4b) / linux-job (gh)
RuntimeError: Command docker exec -t e800aff921fdfda329dca66bfd3a664b1b0b392cec9d09dc1bb17dab25f359b4 /exec failed with exit code 139
trunk / test-models-macos-cpu (resnet50, xnnpack-quantization-delegation) / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
trunk / unittest-release / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

martinlsm · 2026-04-21T08:36:45Z

@pytorchbot label ciflow/trunk

martinlsm · 2026-04-21T08:36:53Z

@pytorchbot label "partner: arm"

martinlsm · 2026-04-21T08:37:02Z

@pytorchbot label "release notes: none"

Copilot

Pull request overview

This PR refactors quantization reporting for Arm/Cortex-M backends to avoid import cycles and to make qconfig/qspec reporting self-describing (via QuantizationConfig.label and attribute-based qspec formatting) rather than relying on external registration dicts.

Changes:

Move quantizer_reporter to backends/cortex_m/ and update imports/build targets to eliminate a Cortex-M ↔ Arm cyclic dependency.
Replace qconfig/qspec “registry dict” reporting with QuantizationConfig.label and a new qspec_repr() helper for compact, attribute-based qspec formatting.
Update Arm/Cortex-M quantizers and tests to use the new reporter API (QuantizerInfo.qconfig_label) and new string representations.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
backends/cortex_m/test/misc/test_quantizer_reporter.py	Updates tests to use the relocated reporter module and asserts new qspec string formatting via `qspec_repr()`.
backends/cortex_m/quantizer_reporter.py	Relocates and refactors reporter: removes qspec/qconfig registries, adds `qspec_repr()`, and changes `QuantizerInfo` to use `qconfig_label`.
backends/cortex_m/quantizer/quantizer.py	Updates reporter import path to the new module location.
backends/cortex_m/quantizer/quantization_configs.py	Stops registering qspec/qconfig dicts; instead sets human-readable labels directly on config instances.
backends/cortex_m/quantizer/TARGETS	Removes per-subdir reporter target; depends on the new top-level `//executorch/backends/cortex_m:quantizer_reporter`.
backends/cortex_m/TARGETS	Adds a new Buck target exporting `quantizer_reporter.py` from the Cortex-M root.
backends/arm/quantizer/quantization_config.py	Adds optional `label` field to `QuantizationConfig` for debug/reporting.
backends/arm/quantizer/arm_quantizer_utils.py	Removes duplicated reporter mixin and switches to `QuantizerReporterUser`; updates qconfig labeling logic.
backends/arm/quantizer/arm_quantizer.py	Constructs and propagates meaningful `label` strings when creating quantization configs (instead of external registration).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

This PR restructures Cortex-M/Arm quantization reporting to eliminate import cycles and to make qconfig/qspec reporting more self-describing (via labels and attribute-based qspec formatting).

Changes:

Move quantizer_reporter to backends/cortex_m/quantizer_reporter.py and adjust Buck targets/imports accordingly.
Remove the Arm-side _QuantizerReporterUserMixin in favor of QuantizerReporterUser, and update QuantizerInfo to use a qconfig_label.
Add label to QuantizationConfig and rework qspec reporting to use qspec_repr() instead of user-populated registries.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
backends/cortex_m/test/misc/test_quantizer_reporter.py	Updates tests for new reporter module path and new `qspec_repr()` output (adds unit tests for representations).
backends/cortex_m/quantizer_reporter.py	Introduces `qspec_repr()`, renames `QuantizerInfo` field, and changes `QuantizerReport` to store `QuantizerInfo` directly.
backends/cortex_m/quantizer/quantizer.py	Updates import to the relocated `QuantizerReporter`.
backends/cortex_m/quantizer/quantization_configs.py	Removes reporter registry updates and sets `label` on predefined Cortex-M qconfigs.
backends/cortex_m/quantizer/TARGETS	Removes local `quantizer_reporter` library and depends on the new top-level cortex_m reporter target.
backends/cortex_m/TARGETS	Adds new Buck `python_library` target for `quantizer_reporter.py`.
backends/arm/quantizer/quantization_config.py	Adds optional `label` field to `QuantizationConfig` for debugging/visualization/reporting.
backends/arm/quantizer/arm_quantizer_utils.py	Removes duplicated reporter-user mixin; uses `QuantizerReporterUser` and reports `qconfig_label` from `QuantizationConfig.label`.
backends/arm/quantizer/arm_quantizer.py	Builds human-readable labels for produced quantization configs; removes old reporter registration logic.

Comments suppressed due to low confidence (1)

backends/cortex_m/quantizer_reporter.py:53

qspec_repr()'s QuantizationSpec formatting currently only includes dtype and (optional) quant_min/quant_max. This collapses distinct qspecs (e.g., per-tensor vs per-channel, different qscheme/ch_axis/is_dynamic) to the same string, which can make the reporter output misleading and less useful for debugging. Consider including qscheme (and ch_axis when per-channel) and possibly is_dynamic in the representation so different specs remain distinguishable.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@@ -357,59 +361,32 @@ def get_symmetric_a16w8_quantization_config(
        is_qat=is_qat,
        is_dynamic=is_dynamic,


This PR contains four commits: 1. **Move quantizer_reporter out from cortex_m.quantizer** When importing `backends.cortex_m.quantizer.quantizer_reporter` from `backends.arm.quantizer.arm_quantizer_utils`, a cyclic dependency chain is is formed. The problem is that quantizer_reporter triggers `backends/cortex_m/quantizer/__init__.py` when imported, which in turn has imports leading back to the Arm backend. To fix this problem, move quantizer_reporter to backends/cortex_m so it can be imported without forming any cycle. 3. **Arm backend: Remove _QuantizerReporterUserMixin** _QuantizerReporterUserMixin was a duplicated class with the name QuantizerReporterUser. Remove the former to instead use the latter. 4. **Arm backend: Add label attribute to QuantizationConfig** The quantizer reporter logs the quantization config in a human-readable format. Prior to this patch, this was done with the help of a dict called `SUPPORTED_QCONFIGS`, which was defined in quantizer_reporter.py and populated by the user. This patch reworks this concept by instead adding a label attribute to `QuantizationConfig` that the reporter can use to print the config in a human-readable format. 5. **Arm backend: Rework reporting of qspecs** The quantization reporter prints quantization specs in human-readable format. Prior to this patch, this was implemented such that quantizer_reporter.py defined a dict `SUPPORTED_QSPECS` which was populated by the user. This dict would map qspec objects to string representations. This patch removes this dict and instead modifies the helper function `_qspec_repr` to return a compact string representation based on the attributes of the qspec. Signed-off-by: Martin Lindström <Martin.Lindstroem@arm.com> Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Martin Lindström added 4 commits April 21, 2026 10:30

martinlsm requested a review from rascani as a code owner April 21, 2026 08:36

Copilot AI review requested due to automatic review settings April 21, 2026 08:36

martinlsm requested a review from digantdesai as a code owner April 21, 2026 08:36

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 21, 2026

pytorch-bot Bot added the ciflow/trunk label Apr 21, 2026

pytorch-bot Bot added the partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm label Apr 21, 2026

Copilot started reviewing on behalf of martinlsm April 21, 2026 08:37 View session

pytorch-bot Bot added the release notes: none Do not include this in the release notes label Apr 21, 2026

Copilot AI reviewed Apr 21, 2026

View reviewed changes

Comment thread backends/arm/quantizer/arm_quantizer_utils.py Outdated

rascani requested a review from AdrianLundell April 21, 2026 15:34

Potential fix for pull request finding

abafacd

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings April 22, 2026 06:03

github-actions Bot added the module: arm Issues related to arm backend label Apr 22, 2026

Copilot started reviewing on behalf of martinlsm April 22, 2026 06:03 View session

Merge branch 'main' into qspecs

1076083

Copilot AI reviewed Apr 22, 2026

View reviewed changes

Comment thread backends/arm/quantizer/arm_quantizer.py

@@ -357,59 +361,32 @@ def get_symmetric_a16w8_quantization_config(

is_qat=is_qat,

is_dynamic=is_dynamic,

AdrianLundell approved these changes Apr 22, 2026

View reviewed changes

martinlsm merged commit 48ec3fc into pytorch:main Apr 22, 2026
431 of 439 checks passed

martinlsm deleted the qspecs branch April 22, 2026 11:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arm backend: Rework reporting of qspecs and qconfigs#19016

Arm backend: Rework reporting of qspecs and qconfigs#19016
martinlsm merged 6 commits intopytorch:mainfrom
martinlsm:qspecs

martinlsm commented Apr 21, 2026 •

edited by pytorch-bot Bot

Loading

Uh oh!

pytorch-bot Bot commented Apr 21, 2026 •

edited

Loading

Uh oh!

martinlsm commented Apr 21, 2026

Uh oh!

martinlsm commented Apr 21, 2026

Uh oh!

martinlsm commented Apr 21, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -357,59 +361,32 @@ def get_symmetric_a16w8_quantization_config(
		is_qat=is_qat,
		is_dynamic=is_dynamic,

Conversation

martinlsm commented Apr 21, 2026 • edited by pytorch-bot Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19016

❗ 1 Active SEVs

❌ 3 New Failures, 3 Unrelated Failures

Uh oh!

martinlsm commented Apr 21, 2026

Uh oh!

martinlsm commented Apr 21, 2026

Uh oh!

martinlsm commented Apr 21, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

martinlsm commented Apr 21, 2026 •

edited by pytorch-bot Bot

Loading

pytorch-bot Bot commented Apr 21, 2026 •

edited

Loading