NXP Backend: Add pass to remove unnecessary Quantize/Dequantize nodes. #15148

MartinPavella · 2025-10-15T12:05:28Z

Summary

This PR adds an edge dialect pre-processing pass to remove some Q/DQ nodes. This enables some non-delegated nodes (which run on the CPU) to run in directly in int8 and avoid the QDQ compute overhead. This improves the inference speed (by eliminating the need to artificially quantize and de-quantize input and output values.

Test plan

Unit tests provided.

cc @robert-kalmar

pytorch-bot · 2025-10-15T12:05:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15148

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit b2831c3 with merge base 3b1aeda ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / unittest-arm-backend-with-no-fvp (test_pytest_models) / linux-job (gh) (matched linux rule in flaky-rules.json)
The runner has received a shutdown signal. This can happen when the runner service is stopped, or a manually started runner is canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

MartinPavella · 2025-11-18T13:50:05Z

@pytorchbot label "module: nxp" "release notes: nxp"

backends/nxp/tests/test_turning_batch_first_gru_to_time_major.py

robert-kalmar · 2025-11-27T10:45:16Z

Update the Summary, the pass has different intention:

This PR adds an edge dialect pre-processing pass to remove some Q/DQ nodes. This enables some non-delegated nodes (which run on the CPU) to run in directly in int8 and avoid the QDQ compute overhead. This improves the inference speed (by eliminating the need to artificially quantize and de-quantize input and output values.

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 15, 2025

digantdesai added the module: nxp Issues related to NXP Neutron NPU delegation and code under backends/nxp/ label Oct 27, 2025

roman-janik-nxp changed the title ~~NXP Backend: Add padd to remove unnecessary Quantize/Dequantize nodes.~~ NXP Backend: Add pass to remove unnecessary Quantize/Dequantize nodes. Oct 30, 2025

MartinPavella force-pushed the upstream/main-nxp/EIEX-519-upstream-removeadditionalqdqclusters-pass branch from aa651f1 to 972ad89 Compare November 18, 2025 13:49

pytorch-bot bot added the release notes: nxp Changes to the NXP Neutron backend delegate label Nov 18, 2025

MartinPavella force-pushed the upstream/main-nxp/EIEX-519-upstream-removeadditionalqdqclusters-pass branch from 972ad89 to 66e43e8 Compare November 20, 2025 08:29

MartinPavella requested a review from roman-janik-nxp November 20, 2025 08:30

MartinPavella marked this pull request as ready for review November 20, 2025 08:30

MartinPavella requested a review from robert-kalmar as a code owner November 20, 2025 08:30

MartinPavella force-pushed the upstream/main-nxp/EIEX-519-upstream-removeadditionalqdqclusters-pass branch from 66e43e8 to b093129 Compare November 20, 2025 09:05

roman-janik-nxp reviewed Nov 20, 2025

View reviewed changes

backends/nxp/tests/test_turning_batch_first_gru_to_time_major.py Outdated Show resolved Hide resolved

MartinPavella requested a review from roman-janik-nxp November 24, 2025 08:44

MartinPavella force-pushed the upstream/main-nxp/EIEX-519-upstream-removeadditionalqdqclusters-pass branch 4 times, most recently from d5ba591 to e529f83 Compare November 25, 2025 09:11

roman-janik-nxp approved these changes Nov 25, 2025

View reviewed changes

MartinPavella force-pushed the upstream/main-nxp/EIEX-519-upstream-removeadditionalqdqclusters-pass branch from e529f83 to b276902 Compare November 26, 2025 07:25

robert-kalmar approved these changes Nov 27, 2025

View reviewed changes

NXP backend: Add RemoveAdditionalQDQClustersPass.

b2831c3

robert-kalmar force-pushed the upstream/main-nxp/EIEX-519-upstream-removeadditionalqdqclusters-pass branch from b276902 to b2831c3 Compare December 2, 2025 08:19

MartinPavella merged commit c00d726 into pytorch:main Dec 3, 2025
140 of 141 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NXP Backend: Add pass to remove unnecessary Quantize/Dequantize nodes. #15148

NXP Backend: Add pass to remove unnecessary Quantize/Dequantize nodes. #15148

Uh oh!

MartinPavella commented Oct 15, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 15, 2025 •

edited

Loading

Uh oh!

MartinPavella commented Nov 18, 2025

Uh oh!

Uh oh!

robert-kalmar commented Nov 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

NXP Backend: Add pass to remove unnecessary Quantize/Dequantize nodes. #15148

NXP Backend: Add pass to remove unnecessary Quantize/Dequantize nodes. #15148

Uh oh!

Conversation

MartinPavella commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15148

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

MartinPavella commented Nov 18, 2025

Uh oh!

Uh oh!

robert-kalmar commented Nov 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

MartinPavella commented Oct 15, 2025 •

edited

Loading

pytorch-bot bot commented Oct 15, 2025 •

edited

Loading