NXP backend: Add preprocessing pass to split multilayer `GRU`. #13757

MartinPavella · 2025-08-28T11:23:44Z

Summary

This PR introduces a pre-processing pass on the aten dialect level, which splits gru nodes with num_layers > 1 into an equivalent sequence of single layer gru nodes.

Test plan

Unit tests provided in backends/nxp/tests/test_gru_splitting.py.

cc @robert-kalmar @roman-janik-nxp @StrycekSimon @jirioc

pytorch-bot · 2025-08-28T11:23:48Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13757

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ROCm MI2xx CI/CD workflows failing due to : download from https://api.github.com/repos/pytorch/pytorch timed out.

❌ 1 New Failure

As of commit b0f6ecb with merge base f8a422c ():

NEW FAILURE - The following job has failed:

Build documentation / build (buck2) / Build doc (gh)
At least one of the pre-conditions you specified did not hold

This comment was automatically generated by Dr. CI and updates every 15 minutes.

MartinPavella · 2025-08-28T11:23:58Z

@pytorchbot label "module: nxp" "release notes: nxp"

digantdesai · 2025-08-29T21:26:52Z

backends/nxp/aten_passes/neutron_aten_pass_manager.py

        passes: list[PassType] = passes or [
            FuseBatchNormWithConvPass(),
            FuseBatchNormWithLinearPass(),
+            SplitGRUBasedOnNumLayers(),


we do it here in aten is because we want to support quantization?

We do it in the aten dialect because it allows the pass to focus only of the transformation, and not worry about the quantization. This way, the implementation is simpler, and if the quantization requirements of GRU (or the other ops) ever change, we only need to update the quantizer (otherwise we would also have to update this pass).

We do it here for 2 reasons:

Neutron NPU supports GRU as primitive operation Neutron IR ==> for Neutron NPU we do not need and want to decompose it to primitive ops by the ExecuTorch.

Neutron NPU supports GRU, but only single layer. Therefore we transform the multilayer GRU to a sequence of single layer GRU, perform the quantization and obtain a graph what can be represented in Neutron IR, including the proper quantization parameters for inputs and outputs of individual GRU layers.
Note: we will preserve the GRU op in to_edge tranformation - coming change

…ltiple operators. The `aten.gru.input` has a parameter `num_layers`. For values > 1, it represents multiple `aten.gru.input` operators chained together. The introduced pass can split the original GRU into a chain of simpler GRU nodes.

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 28, 2025

pytorch-bot bot added module: nxp Issues related to NXP Neutron NPU delegation and code under backends/nxp/ release notes: nxp Changes to the NXP Neutron backend delegate labels Aug 28, 2025

MartinPavella force-pushed the upstream/main-nxp/EIEX-501-upstream-pass-to-decompose-multilayer-gru branch from ffc75b4 to 325ee60 Compare August 28, 2025 12:51

robert-kalmar approved these changes Aug 29, 2025

View reviewed changes

digantdesai reviewed Aug 29, 2025

View reviewed changes

MartinPavella force-pushed the upstream/main-nxp/EIEX-501-upstream-pass-to-decompose-multilayer-gru branch 2 times, most recently from 5d6e466 to fbb061c Compare September 1, 2025 08:22

MartinPavella force-pushed the upstream/main-nxp/EIEX-501-upstream-pass-to-decompose-multilayer-gru branch from fbb061c to b0f6ecb Compare September 3, 2025 09:40

robert-kalmar merged commit 5abad6c into pytorch:main Sep 3, 2025
112 of 113 checks passed

robert-kalmar deleted the upstream/main-nxp/EIEX-501-upstream-pass-to-decompose-multilayer-gru branch September 3, 2025 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NXP backend: Add preprocessing pass to split multilayer `GRU`. #13757

NXP backend: Add preprocessing pass to split multilayer `GRU`. #13757

Uh oh!

MartinPavella commented Aug 28, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 28, 2025 •

edited

Loading

Uh oh!

MartinPavella commented Aug 28, 2025

Uh oh!

digantdesai Aug 29, 2025

Uh oh!

MartinPavella Sep 1, 2025

Uh oh!

robert-kalmar Sep 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

NXP backend: Add preprocessing pass to split multilayer GRU. #13757

NXP backend: Add preprocessing pass to split multilayer GRU. #13757

Uh oh!

Conversation

MartinPavella commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13757

❗ 1 Active SEVs

❌ 1 New Failure

Uh oh!

MartinPavella commented Aug 28, 2025

Uh oh!

digantdesai Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

MartinPavella Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

robert-kalmar Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

NXP backend: Add preprocessing pass to split multilayer `GRU`. #13757

NXP backend: Add preprocessing pass to split multilayer `GRU`. #13757

MartinPavella commented Aug 28, 2025 •

edited

Loading

pytorch-bot bot commented Aug 28, 2025 •

edited

Loading

robert-kalmar Sep 1, 2025 •

edited

Loading