[AOTI] Add a --dynamic-shapes option to export #1011

desertfire · 2024-08-05T15:08:22Z

Summary: The --dynamic-shapes option will default to False. When the actual inputs are with static shapes, calling export with static shapes will make sure more Inductor optimizations take effect down the line. This change by itself improves average tokens/sec from 29.60 to 33.43 on A100. Some following PRs will provide further perf gains.

python3 torchchat.py export llama3 --quantize '{"precision": {"dtype":"bfloat16"}, "executor":{"accelerator":"cuda"}}' --output-dso-path /tmp/model16.so && python3 torchchat.py generate llama3 --dso-path /tmp/model16.so --prompt "Once upon a time," --max-new-tokens 256 --device cuda --num-samples 3

Summary: The inputs to model forward are with static shapes, so changing the export call to make sure more Inductor optimizations will take effect down the stream. This change by itself improves average tokens/sec from 29.60 to 33.43 on A100. Some following PRs will provide further perf gains.

pytorch-bot · 2024-08-05T15:08:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1011

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 1 Pending

As of commit 334654e with merge base 912917f ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu · 2024-08-05T19:24:45Z

Great to see the perf gains, is anything lost by switching over to static? Does model support change?

desertfire · 2024-08-05T19:38:26Z

Great to see the perf gains, is anything lost by switching over to static? Does model support change?

ET export is doing the same so it should be fine,

torchchat/export_util/export_et.py

Lines 57 to 65 in 912917f

    
           input = ( 
        
               torch.tensor([[1]], dtype=torch.long, device=device), 
        
               torch.tensor([0], dtype=torch.long, device=device), 
        
           ) 
        
           state_dict = model.state_dict() 
        
           state_dict_dtype = state_dict[next(iter(state_dict))].dtype 
        
           target_precision = get_precision() 
        
           dynamic_shapes = None

Jack-Khuu · 2024-08-06T00:49:35Z

Amazing, thanks for fixing this

One last ask: Can you put the repro commands for the #'s in your description?

desertfire · 2024-08-06T02:40:09Z

Amazing, thanks for fixing this

One last ask: Can you put the repro commands for the #'s in your description?

Done. We should add a benchmarking script so that everyone can run the same experiment.

desertfire requested a review from Jack-Khuu August 5, 2024 15:08

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 5, 2024

Jack-Khuu requested a review from malfet August 5, 2024 19:24

Add a dynamic-shapes option for export

69388a2

desertfire changed the title ~~[AOTI] Change export to use static shapes~~ [AOTI] Add a --dynamic-shapes option to export Aug 5, 2024

desertfire added 2 commits August 5, 2024 17:05

Actually add --dynamic-shapes to CLI

8d80acf

Access args.dynamic_shapes correctly

334654e

Jack-Khuu approved these changes Aug 6, 2024

View reviewed changes

desertfire merged commit 46e3ab7 into pytorch:main Aug 6, 2024

desertfire deleted the aoti_1 branch August 6, 2024 02:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AOTI] Add a --dynamic-shapes option to export #1011

[AOTI] Add a --dynamic-shapes option to export #1011

Uh oh!

desertfire commented Aug 5, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 5, 2024 •

edited

Loading

Uh oh!

Jack-Khuu commented Aug 5, 2024

Uh oh!

desertfire commented Aug 5, 2024

Uh oh!

Jack-Khuu commented Aug 6, 2024 •

edited

Loading

Uh oh!

desertfire commented Aug 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[AOTI] Add a --dynamic-shapes option to export #1011

[AOTI] Add a --dynamic-shapes option to export #1011

Uh oh!

Conversation

desertfire commented Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1011

⏳ No Failures, 1 Pending

Uh oh!

Jack-Khuu commented Aug 5, 2024

Uh oh!

desertfire commented Aug 5, 2024

Uh oh!

Jack-Khuu commented Aug 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

desertfire commented Aug 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

desertfire commented Aug 5, 2024 •

edited

Loading

pytorch-bot bot commented Aug 5, 2024 •

edited

Loading

Jack-Khuu commented Aug 6, 2024 •

edited

Loading