Skip to content

Conversation

mgiordy
Copy link
Contributor

@mgiordy mgiordy commented Oct 10, 2025

Summary

This diff includes a general and HiFi4 optimized GRU operator.
Specifically, it adds both a standard GRU implementation and a version optimized for HiFi4 DSPs, ensuring better performance on supported hardware.


#hthtemplate

Reviewed By: mcremon-meta

Differential Revision: D81703253

Copy link

pytorch-bot bot commented Oct 10, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15011

Note: Links to docs will display an error until the docs builds have been completed.

❗ 2 Active SEVs

There are 2 currently active SEVs. If your PR is affected, please view them below:

❌ 4 New Failures, 3 Unrelated Failures

As of commit b44d355 with merge base 09eac16 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copy link

meta-codesync bot commented Oct 10, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating Diff in D81703253.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 10, 2025
Copy link

This PR needs a release notes: label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Oct 10, 2025
Summary:

# Context

With the goal of porting mHML on Executorch, a few operators are missing.
The main focus is on improving performance for the operators used by the model.

# Summary

This diff includes a general and HiFi4 optimized GRU operator.
Specifically, it adds both a standard GRU implementation and a version optimized for HiFi4 DSPs, ensuring better performance on supported hardware.


---
#hthtemplate

Reviewed By: skrtskrtfb, mcremon-meta

Differential Revision: D81703253
mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Oct 10, 2025
Summary:

# Context

With the goal of porting mHML on Executorch, a few operators are missing.
The main focus is on improving performance for the operators used by the model.

# Summary

This diff includes a general and HiFi4 optimized GRU operator.
Specifically, it adds both a standard GRU implementation and a version optimized for HiFi4 DSPs, ensuring better performance on supported hardware.


---
#hthtemplate

Reviewed By: skrtskrtfb, mcremon-meta

Differential Revision: D81703253
mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Oct 11, 2025
Summary:

# Context

With the goal of porting mHML on Executorch, a few operators are missing.
The main focus is on improving performance for the operators used by the model.

# Summary

This diff includes a general and HiFi4 optimized GRU operator.
Specifically, it adds both a standard GRU implementation and a version optimized for HiFi4 DSPs, ensuring better performance on supported hardware.


---
#hthtemplate

Reviewed By: skrtskrtfb, mcremon-meta

Differential Revision: D81703253
Summary:

# Context

With the goal of porting mHML on Executorch, a few operators are missing.
The main focus is on improving performance for the operators used by the model.

# Summary

This diff includes a general and HiFi4 optimized GRU operator.
Specifically, it adds both a standard GRU implementation and a version optimized for HiFi4 DSPs, ensuring better performance on supported hardware.


---
#hthtemplate

Reviewed By: skrtskrtfb, mcremon-meta

Differential Revision: D81703253
@meta-codesync meta-codesync bot merged commit 703d25a into pytorch:main Oct 12, 2025
105 of 114 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported meta-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants