Arm backend: Fix torch.matmul() failures for 2D tensor inputs #14624

YufengShi-dudu · 2025-09-26T08:40:42Z

ConvertMmToBmmPass converts an MM node to BMM nodes, turns input and output tensors from rank-2 to rank-3 via unsqueeze/squeeze, and inserts q-dq before and after BMM node when necessary.
After ConvertMmToBmmPass:

  x -> q   -> dq   -> unsqueeze -> q_2 -> dq_2 ->
                                                 \
                                                bmm -> q_4 -> dq_4
                                                 /
  y -> q_1 -> dq_1 -> unsqueeze -> q_3 -> dq_3 ->

Therefore, if the original matmul was 2D, the bmm already has DQ nodes on its inputs and Q node on its output. If AnnotateDecomposedMatmulPass (Arm backend: Add support for single input matmul #10654) is still applied in this case, it produces illegal sequences such as: x -> q -> unsqueeze -> q_2 (invalid)
Fix by checking whether the BMM is already surrounded by DQ nodes on its inputs and Q nodes on its output.

Change-Id: I9949d59b0b4a96fa34a88b0734014567ea6f24cc

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218

- ConvertMmToBmmPass converts an MM node to BMM nodes, turns input and output tensors from rank-2 to rank-3 via unsqueeze/squeeze, and inserts q-dq before and after BMM node when necessary. - After ConvertMmToBmmPass: x -> q -> dq -> unsqueeze -> q_2 -> dq_2 -> \ bmm -> q_4 -> dq_4 / y -> q_1 -> dq_1 -> unsqueeze -> q_3 -> dq_3 -> - Therefore, if the original matmul was 2D, the bmm already has DQ nodes on its inputs and Q node on its output. If AnnotateDecomposedMatmulPass (pytorch#10654) is still applied in this case, it produces illegal sequences such as: x -> q -> unsqueeze -> q_2 (invalid) - Fix by checking whether the BMM is already surrounded by DQ nodes on its inputs and Q nodes on its output. Change-Id: I9949d59b0b4a96fa34a88b0734014567ea6f24cc Signed-off-by: Yufeng Shi <yufeng.shi@arm.com> Co-authored-by: Oscar Andersson <oscar.andersson@arm.com>

pytorch-bot · 2025-09-26T08:40:46Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14624

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 2 Unrelated Failures

As of commit bb2fbb9 with merge base dcc3978 ():

NEW FAILURE - The following job has failed:

pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t 2b6b52fe1104f8f3a54e8a5995b00c6e3e8837b26f0bcb258ccd12a659740215 /exec failed with exit code 1

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-binary-size-linux-gcc / linux-job (gh) (trunk failure)
pull / test-setup-linux-gcc / linux-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

YufengShi-dudu · 2025-10-02T13:05:04Z

If possible, I suggest we get this fix into the 1.0 release branch

digantdesai

Thanks @YufengShi-dudu for the bug fix. Please mark the PR to be cherry-picked in 1.0. Thanks again.

YufengShi-dudu · 2025-10-07T07:29:37Z

@pytorchbot cherry-pick --onto release/1.0 -c regression

@digantdesai

- ConvertMmToBmmPass converts an MM node to BMM nodes, turns input and output tensors from rank-2 to rank-3 via unsqueeze/squeeze, and inserts q-dq before and after BMM node when necessary. - After ConvertMmToBmmPass: ``` x -> q -> dq -> unsqueeze -> q_2 -> dq_2 -> \ bmm -> q_4 -> dq_4 / y -> q_1 -> dq_1 -> unsqueeze -> q_3 -> dq_3 -> ``` - Therefore, if the original matmul was 2D, the bmm already has DQ nodes on its inputs and Q node on its output. If AnnotateDecomposedMatmulPass (#10654) is still applied in this case, it produces illegal sequences such as: x -> q -> unsqueeze -> q_2 (invalid) - Fix by checking whether the BMM is already surrounded by DQ nodes on its inputs and Q nodes on its output. Change-Id: I9949d59b0b4a96fa34a88b0734014567ea6f24cc cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 Signed-off-by: Yufeng Shi <yufeng.shi@arm.com> Co-authored-by: Oscar Andersson <oscar.andersson@arm.com> (cherry picked from commit 9a7fb42)

pytorchbot · 2025-10-07T07:32:01Z

Cherry picking #14624

The cherry pick PR is at #14845 and it is recommended to link a regression cherry pick PR with an issue. The following tracker issues are updated:

[v1.0.0] Release Tracker #14288 (comment)

Details for Dev Infra team

Raised by workflow job

YufengShi-dudu requested review from zingo and oscarandersson8218 September 26, 2025 08:40

YufengShi-dudu requested a review from digantdesai as a code owner September 26, 2025 08:40

YufengShi-dudu added partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm ciflow/trunk release notes: none Do not include this in the release notes labels Sep 26, 2025

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 26, 2025

YufengShi-dudu added 3 commits September 26, 2025 09:40

Merge branch 'main' into fix-matmul-failures-for-2d-inputs

ffe14cc

Merge branch 'main' into fix-matmul-failures-for-2d-inputs

3865429

Merge branch 'main' into fix-matmul-failures-for-2d-inputs

bb2fbb9

digantdesai approved these changes Oct 6, 2025

View reviewed changes

zingo merged commit 9a7fb42 into pytorch:main Oct 6, 2025
276 of 279 checks passed

pytorchbot mentioned this pull request Oct 7, 2025

[v1.0.0] Release Tracker #14288

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Arm backend: Fix torch.matmul() failures for 2D tensor inputs #14624

Arm backend: Fix torch.matmul() failures for 2D tensor inputs #14624

Uh oh!

YufengShi-dudu commented Sep 26, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Sep 26, 2025 •

edited

Loading

Uh oh!

YufengShi-dudu commented Oct 2, 2025

Uh oh!

digantdesai left a comment

Uh oh!

Uh oh!

YufengShi-dudu commented Oct 7, 2025

Uh oh!

pytorchbot commented Oct 7, 2025

Uh oh!

Uh oh!

Arm backend: Fix torch.matmul() failures for 2D tensor inputs #14624

Arm backend: Fix torch.matmul() failures for 2D tensor inputs #14624

Uh oh!

Conversation

YufengShi-dudu commented Sep 26, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14624

❌ 1 New Failure, 2 Unrelated Failures

Uh oh!

YufengShi-dudu commented Oct 2, 2025

Uh oh!

digantdesai left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

YufengShi-dudu commented Oct 7, 2025

Uh oh!

pytorchbot commented Oct 7, 2025

Cherry picking #14624

Uh oh!

Uh oh!

YufengShi-dudu commented Sep 26, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Sep 26, 2025 •

edited

Loading