Arm backend: Fix quantized constant-folding for aten.cat lists (#18971) by perheld · Pull Request #19064 · pytorch/executorch

perheld · 2026-04-23T11:28:03Z

FuseConstantArgsPass resolved input_qparams by flattened input-node index, while FoldAndAnnotateQParamsPass stores them by top-level argument index. For aten.cat with a list-valued tensor argument, this caused only the first tensor to be dequantized before folding, which corrupted the fused constant.

Resolve qparams by top-level argument index and propagate that qparam through nested list and tuple arguments. Add a regression test for quantized aten.cat constant folding with list-valued tensor inputs.

Change-Id: I6e1a012d82a5dbeecb403c440a2944953dd5cba7

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell

…ch#18971) FuseConstantArgsPass resolved input_qparams by flattened input-node index, while FoldAndAnnotateQParamsPass stores them by top-level argument index. For aten.cat with a list-valued tensor argument, this caused only the first tensor to be dequantized before folding, which corrupted the fused constant. Resolve qparams by top-level argument index and propagate that qparam through nested list and tuple arguments. Add a regression test for quantized aten.cat constant folding with list-valued tensor inputs. Signed-off-by: Per Held <per.held@arm.com> Change-Id: I6e1a012d82a5dbeecb403c440a2944953dd5cba7

pytorch-bot · 2026-04-23T11:28:08Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19064

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Rolling out OSDC (ARC) runners on pull & trunk workflows in PyTorch main

❌ 1 New Failure, 2 Cancelled Jobs, 3 Unrelated Failures

As of commit ca01b3e with merge base c48ea12 ():

NEW FAILURE - The following job has failed:

pull / test-multimodal-linux (gemma3-4b) / linux-job (gh)
RuntimeError: Command docker exec -t 5ee66772297764cadc18f8974251c696b120fa6e6afcd9bb3e4666919ff4faef /exec failed with exit code 139

CANCELLED JOBS - The following jobs were cancelled. Please retry:

pull / unittest / macos / macos-job (gh)
trunk / test-models-macos-mps / macos-job (gh)
##[error]The operation was canceled.

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / unittest / windows / windows-job (gh) (matched win rule in flaky-rules.json)
##[error]The operation was canceled.
trunk / unittest-release / windows / windows-job (gh) (matched win rule in flaky-rules.json)
##[error]The operation was canceled.

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copilot

Pull request overview

Fixes Arm backend constant-folding for quantized ops by aligning how input_qparams are resolved with how they’re produced, specifically for ops like aten.cat where tensor inputs can be nested inside list/tuple arguments.

Changes:

Update FuseConstantArgsPass to resolve input_qparams by top-level positional argument index (and propagate that qparam through nested list/tuple args).
Add a regression test that constant-folds a quantized aten.cat whose tensor inputs are passed via a list/tuple.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
`backends/arm/_passes/fuse_constant_ops_pass.py`	Fix qparam lookup to use top-level arg index and apply it to nested list/tuple tensor args during constant folding.
`backends/arm/test/passes/test_fuse_constant_ops_pass.py`	Add a regression test ensuring quantized constant-folding for `aten.cat` with list/tuple tensor inputs produces the correct fused constant.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+    cat_node = next(
+        node
+        for node in exported_program.graph_module.graph.nodes
+        if node.op == "call_function"


+        exported_program.graph_module
+    )
+
+    assert list(exported_program.state_dict) == ["aten_cat_default_fused_const"]


Copilot AI review requested due to automatic review settings April 23, 2026 11:28

perheld requested a review from digantdesai as a code owner April 23, 2026 11:28

perheld added partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm ciflow/trunk release notes: arm Changes to the ARM backend delegate labels Apr 23, 2026

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 23, 2026

github-actions Bot added the module: arm Issues related to arm backend label Apr 23, 2026

Copilot started reviewing on behalf of perheld April 23, 2026 11:28 View session

oscarandersson8218 approved these changes Apr 23, 2026

View reviewed changes

Copilot AI reviewed Apr 23, 2026

View reviewed changes

perheld linked an issue Apr 23, 2026 that may be closed by this pull request

Arm backend: Quantized aten.cat constant-fold bug with list-valued tensor inputs #18971

Closed

perheld merged commit 2d995bc into pytorch:main Apr 23, 2026
436 of 454 checks passed

perheld deleted the change-1242954 branch April 23, 2026 19:46

chanil222 mentioned this pull request May 7, 2026

[ARM/Ethos-U] Partition boundary buffer dtype mismatch causes silent accuracy loss in attention models #19364

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arm backend: Fix quantized constant-folding for aten.cat lists (#18971)#19064

Arm backend: Fix quantized constant-folding for aten.cat lists (#18971)#19064
perheld merged 1 commit intopytorch:mainfrom
perheld:change-1242954

perheld commented Apr 23, 2026 •

edited by pytorch-bot Bot

Loading

Uh oh!

pytorch-bot Bot commented Apr 23, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

perheld commented Apr 23, 2026 • edited by pytorch-bot Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19064

❗ 1 Active SEVs

❌ 1 New Failure, 2 Cancelled Jobs, 3 Unrelated Failures

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

perheld commented Apr 23, 2026 •

edited by pytorch-bot Bot

Loading

pytorch-bot Bot commented Apr 23, 2026 •

edited

Loading