Add shared fusion infrastructure and QuantFusionPass (#19724) by ethansfng · Pull Request #19724 · pytorch/executorch

ethansfng · 2026-05-21T20:16:24Z

Summary:

Add infrastructure for per-pattern fuse() methods on Cadence QuantizationPattern:

Add anchor_ops() (default: tuple(partition_types())) and fuse() (default: None) to QuantizationPattern base class
Add shared fusion helpers: _get_dequant, _find_quant_user, _insert_fused_op, _maybe_route_depthwise_conv1d, _fuse_conv, _fuse_linear, _fuse_matmul
Add QuantFusionPass to compiler_funcs.py — shared executor that iterates patterns, matches anchor_ops(), calls fuse() with debug logging and dead code elimination

Differential Revision: D105728137

pytorch-bot · 2026-05-21T20:16:28Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19724

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 2 Unrelated Failures

As of commit f91d610 with merge base ec76470 ():

NEW FAILURE - The following job has failed:

pull / test-parakeet-xnnpack-linux / linux-job (gh)
RuntimeError: Command docker exec -t c2b680200c75a0bedd6a3aa95f794cb7324330648cab93d99095a7aa759c47de /exec failed with exit code 1

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2026-05-21T20:16:31Z

The committers listed above are authorized under a signed CLA.

✅ login: ethansfng / name: Ethan Ng (639d11e)

meta-codesync · 2026-05-21T20:16:32Z

@ethansfng has exported this pull request. If you are a Meta employee, you can view the originating Diff in D105728137.

github-actions · 2026-05-21T20:17:27Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Summary: Add infrastructure for per-pattern `fuse()` methods on Cadence `QuantizationPattern`: - Add `anchor_ops()` (default: `tuple(partition_types())`) and `fuse()` (default: `None`) to `QuantizationPattern` base class - Add shared fusion helpers: `_get_dequant`, `_find_quant_user`, `_insert_fused_op`, `_maybe_route_depthwise_conv1d`, `_fuse_conv`, `_fuse_linear`, `_fuse_matmul` - Add `QuantFusionPass` to `compiler_funcs.py` — shared executor that iterates patterns, matches `anchor_ops()`, calls `fuse()` with debug logging and dead code elimination Differential Revision: D105728137

…19743) Summary: torchao's `convert_pt2e` adds `out_dtype` kwargs to dequant nodes for bf16 models. `cadence::dequantize_per_tensor` doesn't support this kwarg (it hardcodes float32 output), so `ReplacePT2DequantWithCadenceDequantPass` crashes when it forwards kwargs blindly to the cadence op. Strip `out_dtype` from kwargs before creating the cadence dequant node, and insert an `aten.to.dtype` cast after it to preserve the original output dtype semantics. Differential Revision: D105630451

Summary: Add infrastructure for per-pattern `fuse()` methods on Cadence `QuantizationPattern`: - Add `anchor_ops()` (default: `tuple(partition_types())`) and `fuse()` (default: `None`) to `QuantizationPattern` base class - Add shared fusion helpers: `_get_dequant`, `_find_quant_user`, `_insert_fused_op`, `_maybe_route_depthwise_conv1d`, `_fuse_conv`, `_fuse_linear`, `_fuse_matmul` - Add `QuantFusionPass` to `compiler_funcs.py` — shared executor that iterates patterns, matches `anchor_ops()`, calls `fuse()` with debug logging and dead code elimination Differential Revision: D105728137

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 21, 2026

meta-codesync Bot added fb-exported meta-exported labels May 21, 2026

meta-codesync Bot changed the title ~~Add shared fusion infrastructure and QuantFusionPass~~ Add shared fusion infrastructure and QuantFusionPass (#19724) May 21, 2026

ethansfng force-pushed the export-D105728137 branch from 639d11e to 7bfa849 Compare May 21, 2026 20:27

ethansfng force-pushed the export-D105728137 branch from 7bfa849 to 4f270a7 Compare May 21, 2026 21:09

ethansfng force-pushed the export-D105728137 branch from 4f270a7 to c257454 Compare May 22, 2026 18:48

ethansfng added 2 commits May 22, 2026 17:33

ethansfng force-pushed the export-D105728137 branch from c257454 to f91d610 Compare May 23, 2026 00:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add shared fusion infrastructure and QuantFusionPass (#19724)#19724

Add shared fusion infrastructure and QuantFusionPass (#19724)#19724
ethansfng wants to merge 2 commits into
pytorch:mainfrom
ethansfng:export-D105728137

ethansfng commented May 21, 2026 •

edited by meta-codesync Bot

Loading

Uh oh!

pytorch-bot Bot commented May 21, 2026 •

edited

Loading

Uh oh!

linux-foundation-easycla Bot commented May 21, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented May 21, 2026

Uh oh!

github-actions Bot commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ethansfng commented May 21, 2026 • edited by meta-codesync Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19724

❌ 1 New Failure, 2 Unrelated Failures

Uh oh!

linux-foundation-easycla Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

meta-codesync Bot commented May 21, 2026

Uh oh!

github-actions Bot commented May 21, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ethansfng commented May 21, 2026 •

edited by meta-codesync Bot

Loading

pytorch-bot Bot commented May 21, 2026 •

edited

Loading

linux-foundation-easycla Bot commented May 21, 2026 •

edited

Loading

This PR needs a `release notes:` label