Move optimization passes from opt_level=0 to opt_level=1 (#18206) by mcremon-meta · Pull Request #18206 · pytorch/executorch

mcremon-meta · 2026-03-16T19:54:17Z

Summary:

Many passes in the cadence backend were incorrectly placed at opt_level=0
with outdated comments claiming ops like mm, repeat, scalar_tensor, full_like
were 'not supported'. These ops have portable kernel fallbacks, so
the passes are optimizations, not correctness requirements.

This diff:

Moves 18 passes from opt_level=0 to opt_level=1:
- replace_ops.py: ReplaceLogicalNotBooleanWhereWithWherePass,
  ReplaceSafeSoftmaxWithSoftmax, ReplaceSqueezeAndUnsqueezeWithViewPass,
  ReplaceFunctionallyEquivalentOpTargets, ReplaceMMWithAddMMPass,
  ReplaceConvolutionOptionalArgsWithConcreteArgsPass, ReplaceRepeatWithCatPass,
  ReplaceScalarTensorWithFullPass, ReplaceFullLikeWithFullPass,
  ReplaceInfArgInFullWithValuePass, ReplaceMatmulWithTransposedMatmulPass
- remove_ops.py: RemoveCloneOpsTransformImported, RemoveDetachCopyPass,
  RemoveZeroSizedCatArgsPass, RemoveNopExpandOpPass, RemoveToOpsPass,
  RemoveAliasCopyOpPass
- decompose_ops.py: DecomposeAtenApproxGeluPass
- simplify_ops.py: SimplifySliceOpPass
Updates docstrings to remove incorrect 'not supported' claims and
clarify these are optimizations with portable fallbacks available.

Reviewed By: ethansfng

Differential Revision: D96766073

pytorch-bot · 2026-03-16T19:54:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18206

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 0626c21 with merge base 569cf41 ():

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-03-16T19:54:26Z

@mcremon-meta has exported this pull request. If you are a Meta employee, you can view the originating Diff in D96766073.

github-actions · 2026-03-16T19:55:04Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

digantdesai

Any OSS visible impact from this? Stamping to unblock you.

Summary: Pull Request resolved: #18239 As titled. Should perform better and also allow removing some permutes when convolutions are also moved to channel last. Differential Revision: D96869747 Reviewed By: hsharma35

Summary: Pull Request resolved: #18240 As titled. Calls into nnlib directly. Differential Revision: D96874522 Reviewed By: hsharma35

…better (#18256) Summary: Pull Request resolved: #18256 As titled. It is currently not cleaning up as much as it should, and the pass is only capable of handling single input cases. Result: from 9 to 1 (minimum by construction) permutes on Wake Gesture. Differential Revision: D96940254 Reviewed By: abeakkas

Summary: Many passes in the cadence backend were incorrectly placed at opt_level=0 with outdated comments claiming ops like `mm`, `repeat`, `scalar_tensor`, `full_like` were 'not supported'. These ops have portable kernel fallbacks, so the passes are optimizations, not correctness requirements. This diff: 1. Moves 18 passes from opt_level=0 to opt_level=1: - replace_ops.py: ReplaceLogicalNotBooleanWhereWithWherePass, ReplaceSafeSoftmaxWithSoftmax, ReplaceSqueezeAndUnsqueezeWithViewPass, ReplaceFunctionallyEquivalentOpTargets, ReplaceMMWithAddMMPass, ReplaceConvolutionOptionalArgsWithConcreteArgsPass, ReplaceRepeatWithCatPass, ReplaceScalarTensorWithFullPass, ReplaceFullLikeWithFullPass, ReplaceInfArgInFullWithValuePass, ReplaceMatmulWithTransposedMatmulPass - remove_ops.py: RemoveCloneOpsTransformImported, RemoveDetachCopyPass, RemoveZeroSizedCatArgsPass, RemoveNopExpandOpPass, RemoveToOpsPass, RemoveAliasCopyOpPass - decompose_ops.py: DecomposeAtenApproxGeluPass - simplify_ops.py: SimplifySliceOpPass 2. Updates docstrings to remove incorrect 'not supported' claims and clarify these are optimizations with portable fallbacks available. Reviewed By: ethansfng Differential Revision: D96766073

Summary: Pull Request resolved: #18206 Many passes in the cadence backend were incorrectly placed at opt_level=0 with outdated comments claiming ops like `mm`, `repeat`, `scalar_tensor`, `full_like` were 'not supported'. These ops have portable kernel fallbacks, so the passes are optimizations, not correctness requirements. This diff: 1. Moves 18 passes from opt_level=0 to opt_level=1: - replace_ops.py: ReplaceLogicalNotBooleanWhereWithWherePass, ReplaceSafeSoftmaxWithSoftmax, ReplaceSqueezeAndUnsqueezeWithViewPass, ReplaceFunctionallyEquivalentOpTargets, ReplaceMMWithAddMMPass, ReplaceConvolutionOptionalArgsWithConcreteArgsPass, ReplaceRepeatWithCatPass, ReplaceScalarTensorWithFullPass, ReplaceFullLikeWithFullPass, ReplaceInfArgInFullWithValuePass, ReplaceMatmulWithTransposedMatmulPass - remove_ops.py: RemoveCloneOpsTransformImported, RemoveDetachCopyPass, RemoveZeroSizedCatArgsPass, RemoveNopExpandOpPass, RemoveToOpsPass, RemoveAliasCopyOpPass - decompose_ops.py: DecomposeAtenApproxGeluPass - simplify_ops.py: SimplifySliceOpPass 2. Updates docstrings to remove incorrect 'not supported' claims and clarify these are optimizations with portable fallbacks available. Reviewed By: ethansfng Differential Revision: D96766073

)" This reverts commit bf2243a.

)" (#18331) Reverts #18206

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 16, 2026

meta-codesync bot added fb-exported meta-exported labels Mar 16, 2026

digantdesai approved these changes Mar 18, 2026

View reviewed changes

ethansfng approved these changes Mar 18, 2026

View reviewed changes

mcremon-meta force-pushed the export-D96766073 branch from d0dbde5 to 2748715 Compare March 18, 2026 22:33

mcremon-meta added 3 commits March 18, 2026 21:35

Add NHWC version of max_pool2d (#18239)

9b586c7

Summary: Pull Request resolved: #18239 As titled. Should perform better and also allow removing some permutes when convolutions are also moved to channel last. Differential Revision: D96869747 Reviewed By: hsharma35

Add dedicated HiFi kernel for max pool 2d (#18240)

c5dfd18

Summary: Pull Request resolved: #18240 As titled. Calls into nnlib directly. Differential Revision: D96874522 Reviewed By: hsharma35

meta-codesync bot changed the title ~~Move optimization passes from opt_level=0 to opt_level=1~~ Move optimization passes from opt_level=0 to opt_level=1 (#18206) Mar 19, 2026

meta-codesync bot force-pushed the export-D96766073 branch from 2748715 to 681810c Compare March 19, 2026 04:38

meta-codesync bot force-pushed the export-D96766073 branch from 681810c to a2eab64 Compare March 19, 2026 04:39

mcremon-meta force-pushed the export-D96766073 branch from a2eab64 to 0626c21 Compare March 19, 2026 04:43

meta-codesync bot merged commit bf2243a into main Mar 19, 2026
142 of 145 checks passed

meta-codesync bot deleted the export-D96766073 branch March 19, 2026 11:08

mcremon-meta added a commit that referenced this pull request Mar 19, 2026

Revert "Move optimization passes from opt_level=0 to opt_level=1 (#18206

45d73d8

)" This reverts commit bf2243a.

mcremon-meta mentioned this pull request Mar 19, 2026

Revert "Move optimization passes from opt_level=0 to opt_level=1 (#18206)" #18331

Merged

mcremon-meta added a commit that referenced this pull request Mar 19, 2026

Revert "Move optimization passes from opt_level=0 to opt_level=1 (#18206

2548ee1

)" (#18331) Reverts #18206

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move optimization passes from opt_level=0 to opt_level=1 (#18206)#18206

Move optimization passes from opt_level=0 to opt_level=1 (#18206)#18206
meta-codesync[bot] merged 4 commits intomainfrom
export-D96766073

mcremon-meta commented Mar 16, 2026 •

edited by meta-codesync bot

Loading

Uh oh!

pytorch-bot bot commented Mar 16, 2026 •

edited

Loading

Uh oh!

meta-codesync bot commented Mar 16, 2026

Uh oh!

github-actions bot commented Mar 16, 2026

Uh oh!

digantdesai left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mcremon-meta commented Mar 16, 2026 • edited by meta-codesync bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18206

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

meta-codesync bot commented Mar 16, 2026

Uh oh!

github-actions bot commented Mar 16, 2026

This PR needs a release notes: label

Uh oh!

digantdesai left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mcremon-meta commented Mar 16, 2026 •

edited by meta-codesync bot

Loading

pytorch-bot bot commented Mar 16, 2026 •

edited

Loading

This PR needs a `release notes:` label