Default max_seq_length to 128 for ExecuTorch export #1192

swolchok · 2024-09-24T17:59:27Z

With the current default behavior, performance for e.g. stories110Mwithout custom SDPA is bad because the QKV tensors are long (8192 in the last dim). Limiting the max sequence length remedies this.

[ghstack-poisoned]

swolchok · 2024-09-24T17:59:28Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2024-09-24T17:59:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1192

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit d702862 with merge base 04ea309 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

swolchok added 6 commits September 24, 2024 10:59

Update

58a0519

[ghstack-poisoned]

Update

b592ed5

[ghstack-poisoned]

Update

c4ce634

[ghstack-poisoned]

Update

c1f3d29

[ghstack-poisoned]

Update

3d7723d

[ghstack-poisoned]

Update

04b8c09

[ghstack-poisoned]

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 24, 2024

Jack-Khuu approved these changes Sep 24, 2024

View reviewed changes

swolchok added 6 commits September 24, 2024 11:03

Update

61f3cca

[ghstack-poisoned]

Update

7808b9a

[ghstack-poisoned]

Update

0d54555

[ghstack-poisoned]

Update

eebb8b7

[ghstack-poisoned]

Update

4dda7f3

[ghstack-poisoned]

Update

d702862

[ghstack-poisoned]

Base automatically changed from gh/swolchok/12/head to main September 24, 2024 19:19

swolchok merged commit c40c6bb into main Sep 24, 2024
100 checks passed

swolchok deleted the gh/swolchok/13/head branch September 24, 2024 19:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Default max_seq_length to 128 for ExecuTorch export #1192

Default max_seq_length to 128 for ExecuTorch export #1192

Uh oh!

swolchok commented Sep 24, 2024

Uh oh!

swolchok commented Sep 24, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 24, 2024 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Default max_seq_length to 128 for ExecuTorch export #1192

Default max_seq_length to 128 for ExecuTorch export #1192

Uh oh!

Conversation

swolchok commented Sep 24, 2024

Uh oh!

swolchok commented Sep 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1192

✅ No Failures

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

swolchok commented Sep 24, 2024 •

edited

Loading

pytorch-bot bot commented Sep 24, 2024 •

edited

Loading