Fix int32 torch.mm runtime by lowering to matmul by holly-agyei · Pull Request #2673 · apple/coremltools

holly-agyei · 2026-04-20T19:32:42Z

Summary

lower torch.mm/torch.bmm through mb.matmul instead of the constant-weight mb.linear path
keep the int32 constant-weight runtime path off the buggy linear lowering
add a regression test that checks the converted graph uses matmul rather than linear for int32 torch.mm

Testing

/opt/miniconda3/envs/adamed/bin/python -m py_compile coremltools/converters/mil/frontend/torch/ops.py coremltools/converters/mil/frontend/torch/test/test_torch_ops.py
/opt/miniconda3/envs/adamed/bin/python -m pytest coremltools/converters/mil/frontend/torch/test/test_torch_ops.py -k "mm_with_int32_constant_weight" -q
- local result: 1 passed, 1 skipped
- the mlprogram case is skipped in this source checkout when BlobWriter is not available locally

Copilot

Pull request overview

This PR fixes a Core ML runtime failure for int32 torch.mm by ensuring torch.mm/torch.bmm lower to MIL matmul instead of the constant-weight linear lowering path.

Changes:

Update Torch frontend matmul lowering to always emit mb.matmul (after dtype promotion) rather than conditionally using mb.linear for constant RHS.
Add a regression test verifying int32 constant-weight torch.mm converts to a graph containing matmul and not linear, and (when runnable) matches runtime output.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
coremltools/converters/mil/frontend/torch/ops.py	Changes lowering for `mm/bmm/matmul` to always use `mb.matmul` to avoid buggy `linear` lowering with `int32` constant weights.
coremltools/converters/mil/frontend/torch/test/test_torch_ops.py	Adds regression coverage to ensure converted graphs use `matmul` (not `linear`) for `int32` `torch.mm` with a constant weight.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

 @register_torch_op(torch_alias=["bmm", "mm"])
 def matmul(context, node):
    x, y = _get_inputs(context, node, expected=2)
-    res = _construct_matmul(x, y, node.name)
+    x, y = promote_input_dtypes([x, y])
+    # Keep mm/bmm on the matmul path even when the RHS is constant. Lowering
+    # constant int32 weights to linear produces incorrect/runtime behavior.
+    res = mb.matmul(x=x, y=y, name=node.name)


TobyRoseman · 2026-04-22T18:51:56Z

CI: https://gitlab.com/coremltools1/coremltools/-/pipelines/2472453197

TobyRoseman · 2026-04-23T22:03:24Z

@holly-agyei - there are CI failures, please take a look:
https://gitlab.com/coremltools1/coremltools/-/pipelines/2472453197

holly-agyei · 2026-04-23T22:22:14Z

@TobyRoseman
Thank you for the notice, it seems I left out the _construct_matmul compatibility wrapper, which broke the import tests. I just pushed a fix for the CI failures on this PR to restore it. Can you check it for me when you have a moment?

TobyRoseman · 2026-04-27T21:59:39Z

Updated CI: https://gitlab.com/coremltools1/coremltools/-/pipelines/2483559856

holly-agyei · 2026-04-27T22:49:13Z

Hi @TobyRoseman
It seems the only failing job (test_py310_pytorch_executorch) died in scripts/env_create.sh before any tests ran, with [Errno 2] No such file or directory: '/Users/gitlab/miniforge3/pkgs/packaging-26.2-pyhc364b38_0' during conda create. Looks like a stale conda package cache on the runner. Could you retry it when you have a moment? Thanks!

Edit: I can see that it has been retried. Thanks.

TobyRoseman · 2026-04-28T15:10:12Z

I restarted those jobs and the CI passed. Thanks for the pull request @holly-agyei.

Fix int32 torch.mm runtime by lowering to matmul

69f44fc

Copilot AI review requested due to automatic review settings April 20, 2026 19:32

Copilot started reviewing on behalf of holly-agyei April 20, 2026 19:33 View session

Copilot AI reviewed Apr 20, 2026

View reviewed changes

Restore constant-weight lowering for non-int32 matmul

ade6444

TobyRoseman approved these changes Apr 28, 2026

View reviewed changes

TobyRoseman merged commit e95804f into apple:main Apr 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix int32 torch.mm runtime by lowering to matmul#2673

Fix int32 torch.mm runtime by lowering to matmul#2673
TobyRoseman merged 2 commits intoapple:mainfrom
holly-agyei:fix-int32-torch-mm-runtime

holly-agyei commented Apr 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

TobyRoseman commented Apr 22, 2026

Uh oh!

TobyRoseman commented Apr 23, 2026

Uh oh!

holly-agyei commented Apr 23, 2026 •

edited

Loading

Uh oh!

TobyRoseman commented Apr 27, 2026

Uh oh!

holly-agyei commented Apr 27, 2026 •

edited

Loading

Uh oh!

TobyRoseman commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

holly-agyei commented Apr 20, 2026

Summary

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

TobyRoseman commented Apr 22, 2026

Uh oh!

TobyRoseman commented Apr 23, 2026

Uh oh!

holly-agyei commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TobyRoseman commented Apr 27, 2026

Uh oh!

holly-agyei commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TobyRoseman commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

holly-agyei commented Apr 23, 2026 •

edited

Loading

holly-agyei commented Apr 27, 2026 •

edited

Loading