Refactor quantizer: Only replace with per-tensor variants #14974

DrJessop · 2025-10-09T23:59:26Z

Summary:
In our previous flow, we would replace ops with default variants, have a special fusion pass which constructs singleton tensors for a variety of fused quantized ops, and then we would call a replace ops to turn them into per-tensor-variants.

I confirmed this was for legacy reasons, so a cleanup was much due.

This diff also fixes any ref implementations during the refactor.

Reviewed By: zonglinpeng

Differential Revision: D83873738

pytorch-bot · 2025-10-09T23:59:29Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14974

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2025-10-09T23:59:34Z

@DrJessop has exported this pull request. If you are a Meta employee, you can view the originating Diff in D83873738.

github-actions · 2025-10-10T00:00:13Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

) Summary: In our previous flow, we would replace ops with default variants, have a special fusion pass which constructs singleton tensors for a variety of fused quantized ops, and then we would call a replace ops to turn them into per-tensor-variants. I confirmed this was for legacy reasons, so a cleanup was much due. This diff also fixes any ref implementations during the refactor. Reviewed By: zonglinpeng Differential Revision: D83873738

) Summary: In our previous flow, we would replace ops with default variants, have a special fusion pass which constructs singleton tensors for a variety of fused quantized ops, and then we would call a replace ops to turn them into per-tensor-variants. I confirmed this was for legacy reasons, so a cleanup was much due. This diff directly replaces ops with the per-tensor variants and removes the pass which replaces singleton tensors with scalars. Reviewed By: zonglinpeng Differential Revision: D83873738

Summary: Matmul was relying on linear infra which didn't support batched second argument. This adds support. Differential Revision: D84279595

) Summary: In our previous flow, we would replace ops with default variants, have a special fusion pass which constructs singleton tensors for a variety of fused quantized ops, and then we would call a replace ops to turn them into per-tensor-variants. I confirmed this was for legacy reasons, so a cleanup was much due. This diff directly replaces ops with the per-tensor variants and removes the pass which replaces singleton tensors with scalars. Reviewed By: zonglinpeng Differential Revision: D83873738

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 9, 2025

meta-codesync bot added fb-exported meta-exported labels Oct 9, 2025

DrJessop force-pushed the export-D83873738 branch from c434d1e to 3e65bea Compare October 10, 2025 00:08

DrJessop force-pushed the export-D83873738 branch from 3e65bea to 33a47e1 Compare October 10, 2025 16:40

DrJessop force-pushed the export-D83873738 branch from 33a47e1 to 43e31be Compare October 10, 2025 16:44

DrJessop force-pushed the export-D83873738 branch 2 times, most recently from f84f831 to 271538a Compare October 10, 2025 20:52

Andrew Grebenisan added 2 commits October 10, 2025 15:09

Support for batched matmul (pytorch#14956)

2f5c517

Summary: Matmul was relying on linear infra which didn't support batched second argument. This adds support. Differential Revision: D84279595

DrJessop force-pushed the export-D83873738 branch from 271538a to c805660 Compare October 10, 2025 22:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor quantizer: Only replace with per-tensor variants #14974

Refactor quantizer: Only replace with per-tensor variants #14974

DrJessop commented Oct 9, 2025

Uh oh!

pytorch-bot bot commented Oct 9, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Refactor quantizer: Only replace with per-tensor variants #14974

Are you sure you want to change the base?

Refactor quantizer: Only replace with per-tensor variants #14974

Conversation

DrJessop commented Oct 9, 2025

Uh oh!

pytorch-bot bot commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14974

Uh oh!

meta-codesync bot commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 10, 2025

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pytorch-bot bot commented Oct 9, 2025 •

edited

Loading

This PR needs a `release notes:` label