Allow Partitioner to Force Dynamic Linear Computation #5338

mcr229 · 2024-09-13T03:39:31Z

Summary:

Motivation

A current drawback to XNNPACK is that weights are duplicated across delegate instances if they do not soley belong to one partition. For ops like LSTM, they use the same few weights and bias's in multiple linear nodes. This can explode out LSTM as we have to duplicate the LSTM Weight/Bias for every instance of linear.

XNNPACK has dynamic linear in which weights are given at runtime, rather than packed AoT. This allows us to force the partitioner to not partition weights so XNNPACK delegate does not own the weights, and thus does not duplicate them. This is only supported for FP32 weights atm, but we can leverage this to balance between slower perf with smaller file sizes.

Differential Revision: D62621998

pytorch-bot · 2024-09-13T03:39:35Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5338

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b566387 with merge base aa1bcc3 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-09-13T03:39:54Z

This pull request was exported from Phabricator. Differential Revision: D62621998

Summary: Pull Request resolved: pytorch#5338 # Motivation A current drawback to XNNPACK is that weights are duplicated across delegate instances if they do not soley belong to one partition. For ops like LSTM, they use the same few weights and bias's in multiple linear nodes. This can explode out LSTM as we have to duplicate the LSTM Weight/Bias for every instance of linear. XNNPACK has dynamic linear in which weights are given at runtime, rather than packed AoT. This allows us to force the partitioner to not partition weights so XNNPACK delegate does not own the weights, and thus does not duplicate them. This is only supported for FP32 weights atm, but we can leverage this to balance between slower perf with smaller file sizes. Differential Revision: D62621998

facebook-github-bot · 2024-09-13T03:45:32Z

This pull request was exported from Phabricator. Differential Revision: D62621998

facebook-github-bot · 2024-09-13T21:59:11Z

This pull request has been merged in 71602a0.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 13, 2024

facebook-github-bot added the fb-exported label Sep 13, 2024

mcr229 force-pushed the export-D62621998 branch from 7fc6449 to b566387 Compare September 13, 2024 03:45

GregoryComer approved these changes Sep 13, 2024

View reviewed changes

facebook-github-bot closed this in 71602a0 Sep 13, 2024

facebook-github-bot added the Merged label Sep 13, 2024

mcr229 deleted the export-D62621998 branch July 25, 2025 22:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow Partitioner to Force Dynamic Linear Computation #5338

Allow Partitioner to Force Dynamic Linear Computation #5338

Uh oh!

mcr229 commented Sep 13, 2024

Uh oh!

pytorch-bot bot commented Sep 13, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 13, 2024

Uh oh!

facebook-github-bot commented Sep 13, 2024

Uh oh!

facebook-github-bot commented Sep 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Allow Partitioner to Force Dynamic Linear Computation #5338

Allow Partitioner to Force Dynamic Linear Computation #5338

Uh oh!

Conversation

mcr229 commented Sep 13, 2024

Motivation

Uh oh!

pytorch-bot bot commented Sep 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5338

✅ No Failures

Uh oh!

facebook-github-bot commented Sep 13, 2024

Uh oh!

facebook-github-bot commented Sep 13, 2024

Uh oh!

facebook-github-bot commented Sep 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Sep 13, 2024 •

edited

Loading