Static attention: support local-global attention #13043

sxu · 2025-07-31T15:56:58Z

Summary:
Runtime: support different cache lengths for different layer.
Python: add sliding window cache update which was already in the runtime.

Differential Revision: D79267644

pytorch-bot · 2025-07-31T15:57:01Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13043

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 1 Unrelated Failure

As of commit 1595b73 with merge base 3e70463 ():

NEW FAILURES - The following jobs have failed:

Build documentation / build (buck2) / Build doc (gh)
At least one of the pre-conditions you specified did not hold
pull / test-moshi-linux / linux-job (gh)
RuntimeError: Command docker exec -t a2f17d8dfb716a770bb26c13f6cc41ed39fa8adcb50b5425b5e224ca3fe3810c /exec failed with exit code 1

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-openvino-linux / linux-job (gh) (trunk failure)
AttributeError: '_OpNamespace' 'quantized_decomposed' object has no attribute 'convert_element_type'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-07-31T15:57:11Z

This pull request was exported from Phabricator. Differential Revision: D79267644

github-actions · 2025-07-31T15:57:49Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Summary: Runtime: support different cache lengths for different layer. Python: add sliding window cache update which was already in the runtime. Reviewed By: billmguo Differential Revision: D79267644

facebook-github-bot · 2025-08-04T16:21:27Z

This pull request was exported from Phabricator. Differential Revision: D79267644

Summary: Pull Request resolved: pytorch#13043 Runtime: support different cache lengths for different layer. Python: add sliding window cache update which was already in the runtime. Reviewed By: billmguo Differential Revision: D79267644

Summary: Runtime: support different cache lengths for different layer. Python: add sliding window cache update which was already in the runtime. Reviewed By: billmguo Differential Revision: D79267644

facebook-github-bot · 2025-08-04T16:27:10Z

This pull request was exported from Phabricator. Differential Revision: D79267644

Summary: Runtime: support different cache lengths for different layer. Python: add sliding window cache update which was already in the runtime. Reviewed By: billmguo Differential Revision: D79267644

facebook-github-bot · 2025-08-04T17:59:14Z

This pull request was exported from Phabricator. Differential Revision: D79267644

Summary: Runtime: support different cache lengths for different layer. Python: add sliding window cache update which was already in the runtime. Reviewed By: billmguo Differential Revision: D79267644

facebook-github-bot · 2025-08-04T19:40:20Z

This pull request was exported from Phabricator. Differential Revision: D79267644

Summary: Pull Request resolved: pytorch#13043 Runtime: support different cache lengths for different layer. Python: add sliding window cache update which was already in the runtime. Reviewed By: billmguo Differential Revision: D79267644

Summary: Runtime: support different cache lengths for different layer. Python: add sliding window cache update which was already in the runtime. Reviewed By: billmguo Differential Revision: D79267644

facebook-github-bot · 2025-08-04T19:42:46Z

This pull request was exported from Phabricator. Differential Revision: D79267644

Summary: Runtime: support different cache lengths for different layer. Python: add sliding window cache update which was already in the runtime. Reviewed By: billmguo Differential Revision: D79267644

facebook-github-bot · 2025-08-04T20:52:59Z

This pull request was exported from Phabricator. Differential Revision: D79267644

Differential Revision: D79267644 Pull Request resolved: pytorch#13043

sxu requested review from jackzhxng and lucylq as code owners July 31, 2025 15:56

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 31, 2025

facebook-github-bot added the fb-exported label Jul 31, 2025

sxu requested review from YIWENX14 and billmguo July 31, 2025 16:01

sxu added the topic: not user facing label Jul 31, 2025

billmguo approved these changes Jul 31, 2025

View reviewed changes

sxu force-pushed the export-D79267644 branch from 7413129 to 5c2f27c Compare August 4, 2025 16:17

sxu force-pushed the export-D79267644 branch from 5c2f27c to 08b2735 Compare August 4, 2025 16:21

sxu force-pushed the export-D79267644 branch from 08b2735 to 1510e3b Compare August 4, 2025 16:26

sxu force-pushed the export-D79267644 branch from 1510e3b to 33b52ef Compare August 4, 2025 17:59

sxu force-pushed the export-D79267644 branch from 33b52ef to 07bbd73 Compare August 4, 2025 19:36

sxu force-pushed the export-D79267644 branch from 07bbd73 to 57dd4dd Compare August 4, 2025 19:40

sxu force-pushed the export-D79267644 branch from 57dd4dd to b72d2d4 Compare August 4, 2025 19:42

Static attention: support local-global attention (pytorch#13043)

1595b73

Summary: Runtime: support different cache lengths for different layer. Python: add sliding window cache update which was already in the runtime. Reviewed By: billmguo Differential Revision: D79267644

sxu force-pushed the export-D79267644 branch from b72d2d4 to 1595b73 Compare August 4, 2025 20:52

facebook-github-bot merged commit 07b6059 into pytorch:main Aug 4, 2025
100 of 105 checks passed

agrima1304 pushed a commit to agrima1304/executorch that referenced this pull request Aug 26, 2025

Static attention: support local-global attention

d5f792d

Differential Revision: D79267644 Pull Request resolved: pytorch#13043

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Static attention: support local-global attention #13043

Static attention: support local-global attention #13043

Uh oh!

sxu commented Jul 31, 2025

Uh oh!

pytorch-bot bot commented Jul 31, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jul 31, 2025

Uh oh!

github-actions bot commented Jul 31, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Static attention: support local-global attention #13043

Static attention: support local-global attention #13043

Uh oh!

Conversation

sxu commented Jul 31, 2025

Uh oh!

pytorch-bot bot commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13043

❌ 2 New Failures, 1 Unrelated Failure

Uh oh!

facebook-github-bot commented Jul 31, 2025

Uh oh!

github-actions bot commented Jul 31, 2025

This PR needs a release notes: label

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

facebook-github-bot commented Aug 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Jul 31, 2025 •

edited

Loading

This PR needs a `release notes:` label