[AOTI] Add a --max-seq-length option for export #1018

desertfire · 2024-08-07T13:35:15Z

Summary: This improves best tokens/sec from 73 to 85.

pytorch-bot · 2024-08-07T13:35:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1018

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1be8432 with merge base ce41944 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: This improves best tokens/sec from 73 to 85.

byjlw · 2024-08-07T22:25:14Z

cli.py

+        "--max-seq-length",
+        type=int,
+        default=None,
+        help="Set maximum length sequence when before calling torch.export",


set the default to 300 and update the help string so that it's clear what the default is (300)

That was my initial implementation. Then there was an issue when running eval.

When running eval, we use --dynamic-shapes which uses a larger max_seq_length, i.e. model.config.max_seq_length. But in theory, we should not stop user from calling eval with both options, something like --dynamic-shapes --max-seq-length 1000. When that happens, if args.max_seq_length has a default value, we will have no way to distinguish if args.max_seq_length is from a default value or from an intentional user overwriting.

byjlw · 2024-08-07T22:26:10Z

export.py

+            and not builder_args.dynamic_shapes
+        ):
+            print("Setting max_seq_length to 300 for DSO export.")
+            builder_args.max_seq_length = 300


you shouldn't need this if you set the default in the other file

I can add more details to the printout if that helps.

300 is used in specific cases, makes sense to not set it in argparse

Summary: This improves best tokens/sec from 73 to 85. Co-authored-by: Jack-Khuu <jack.khuu.7@gmail.com>

desertfire requested a review from Jack-Khuu August 7, 2024 13:35

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 7, 2024

desertfire removed the request for review from Jack-Khuu August 7, 2024 15:09

[AOTI] Add a --max-seq-length option for export

76718e7

Summary: This improves best tokens/sec from 73 to 85.

desertfire requested review from Jack-Khuu and byjlw August 7, 2024 17:45

byjlw previously requested changes Aug 7, 2024

View reviewed changes

desertfire requested a review from byjlw August 8, 2024 20:02

Jack-Khuu approved these changes Aug 9, 2024

View reviewed changes

Merge branch 'main' into aoti_3

1be8432

Jack-Khuu merged commit 5aed7ae into pytorch:main Aug 9, 2024

vmpuri pushed a commit that referenced this pull request Aug 12, 2024

[AOTI] Add a --max-seq-length option for export (#1018)

9eba058

Summary: This improves best tokens/sec from 73 to 85. Co-authored-by: Jack-Khuu <jack.khuu.7@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AOTI] Add a --max-seq-length option for export #1018

[AOTI] Add a --max-seq-length option for export #1018

Uh oh!

desertfire commented Aug 7, 2024

Uh oh!

pytorch-bot bot commented Aug 7, 2024 •

edited

Loading

Uh oh!

byjlw Aug 7, 2024

Uh oh!

desertfire Aug 8, 2024

Uh oh!

byjlw Aug 7, 2024

Uh oh!

desertfire Aug 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[AOTI] Add a --max-seq-length option for export #1018

[AOTI] Add a --max-seq-length option for export #1018

Uh oh!

Conversation

desertfire commented Aug 7, 2024

Uh oh!

pytorch-bot bot commented Aug 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1018

✅ No Failures

Uh oh!

byjlw Aug 7, 2024

Choose a reason for hiding this comment

Uh oh!

desertfire Aug 8, 2024

Choose a reason for hiding this comment

Uh oh!

byjlw Aug 7, 2024

Choose a reason for hiding this comment

Uh oh!

desertfire Aug 8, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Aug 7, 2024 •

edited

Loading