Forward rank and world size info to Torchbench models when using dynamo runner #108438

xmfan · 2023-09-01T18:46:22Z

Adding support to pass rank and world_size to torchbench model, via its extra_args parameter: https://github.com/pytorch/benchmark/blob/main/torchbenchmark/util/model.py#L83C80-L83C90

This is used for models which distribute over multiple GPUs e.g. simple_gpt pytorch/benchmark#1867

Also add an option to skip multiprocess only gpu models

Testing via python benchmarks/dynamo/torchbench.py -d cuda --output=benchmark_logs/performance.csv --inference --performance --timing --print-memory --multiprocess --only simple_gpt

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @chenyang78 @aakhundov @kadeng @anijain2305

pytorch-bot · 2023-09-01T18:46:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/108438

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 20aad90 with merge base 8851603 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

xmfan · 2023-09-05T20:43:42Z

@pytorchbot label "topic: not user facing"

xmfan · 2023-09-05T20:50:43Z

Failures about unhandled extra_args should be resolved once the changes from torchbenchmark/util/extra_args.py https://github.com/pytorch/benchmark/pull/1867/files#diff-6ccf656b90a64ee9ee4e55aec794320710e717b65271baeae74e69940524bb6a land

H-Huang · 2023-09-05T21:56:27Z

benchmarks/dynamo/common.py

                try:
                    with tqdm(desc="loading model"):
+                        extra_args = []
+                        if hasattr(args, "rank") and hasattr(args, "world_size"):


I think we also need to update the hash with your torchbench changes on they land in order for the CI to pick it up

https://github.com/pytorch/pytorch/blob/main/.github/ci_commit_pins/torchbench.txt

Summary: Adds simple_gpt + DTensor implemented in meta-pytorch/simple_gpt#7 to torchbench Tested via `python benchmarks/dynamo/torchbench.py -d cuda --output-directory=benchmark_logs --output=performance.csv --inference --performance --timing --print-memory --multiprocess --nothing --only simple_gpt`. Note: --nothing is used here to disable compile, since DTensor + compile isn't yet supported in main ``` dev,name,batch_size,speedup,abs_latency,compilation_latency,compression_ratio,eager_peak_mem,dynamo_peak_mem,calls_captured,unique_graphs,graph_breaks,unique_graph_breaks cuda,simple_gpt,1,0.966153,196.819773,-0.059319,1.000000,4.576880,4.576880,0,0,0,0 cuda,simple_gpt,1,0.967389,196.608152,-0.058833,1.000000,4.577404,4.577404,0,0,0,0 cuda,simple_gpt,1,0.973152,196.093583,-0.059316,1.000000,4.593133,4.593133,0,0,0,0 cuda,simple_gpt,1,0.973087,196.124046,-0.075580,1.000000,4.611483,4.611483,0,0,0,0 cuda,simple_gpt,1,0.967908,193.998484,-0.040192,1.000000,4.593133,4.593133,0,0,0,0 cuda,simple_gpt,1,0.968949,193.798088,-0.028878,1.000000,4.593133,4.593133,0,0,0,0 ``` 2 changes were required to the model: - decorate torch.no_grad() on the caches, previously this was done outside the model, the entire eval call was wrapped in a torch.no_grad() context. After using torchbench, I notice even with only inference mode, we don't disable gradient calculations - rank/world size, added support from torchbench side in pytorch/pytorch#108438 and updated model to fetch from the provided extra_args Pull Request resolved: #1867 Reviewed By: msaroufim Differential Revision: D49065244 Pulled By: xmfan fbshipit-source-id: d4709fa3997c6a25c75e87eff7c13492b370b1af

xmfan · 2023-09-14T16:54:59Z

@pytorchbot merge

pytorchmergebot · 2023-09-14T16:57:14Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

github-actions bot added module: dynamo ciflow/inductor labels Sep 1, 2023

xmfan force-pushed the xmfan/distributed_torchbench branch 2 times, most recently from 39b6fcc to e5ba94d Compare September 1, 2023 21:33

xmfan requested review from Chillee and H-Huang September 1, 2023 21:37

xmfan changed the title ~~wip torchbench~~ Forward rank and world size info to Torchbench models when using dynamo runner Sep 1, 2023

xmfan marked this pull request as ready for review September 1, 2023 21:38

Chillee approved these changes Sep 5, 2023

View reviewed changes

xmfan mentioned this pull request Sep 5, 2023

Add DTensor LLaMA inference model: simple_gpt pytorch/benchmark#1867

Closed

pytorch-bot bot added the topic: not user facing topic category label Sep 5, 2023

H-Huang reviewed Sep 5, 2023

View reviewed changes

xmfan requested a review from a team as a code owner September 8, 2023 21:41

xmfan added 5 commits September 13, 2023 11:23

Forward rank and world size to torchbench model

272185a

address lintrunner suggestions

f71587c

add unused extra_args to huggingface and timms

3aa6e1c

update torchbench commit pin

301596d

skip simple_gpt for cpu, train and single gpu modes

20aad90

xmfan force-pushed the xmfan/distributed_torchbench branch from 266257b to 20aad90 Compare September 13, 2023 21:00

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 14, 2023

pytorchmergebot added the merging label Sep 14, 2023

pytorchmergebot added Merged and removed merging labels Sep 14, 2023

pytorchmergebot closed this in 54c5f47 Sep 14, 2023

xmfan mentioned this pull request Sep 20, 2023

Fix torchbench --multiprocess #109657

Closed

github-actions bot deleted the xmfan/distributed_torchbench branch March 23, 2025 02:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Forward rank and world size info to Torchbench models when using dynamo runner #108438

Forward rank and world size info to Torchbench models when using dynamo runner #108438

Uh oh!

xmfan commented Sep 1, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 1, 2023 •

edited

Loading

Uh oh!

xmfan commented Sep 5, 2023

Uh oh!

xmfan commented Sep 5, 2023

Uh oh!

H-Huang Sep 5, 2023

Uh oh!

xmfan commented Sep 14, 2023

Uh oh!

pytorchmergebot commented Sep 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Forward rank and world size info to Torchbench models when using dynamo runner #108438

Forward rank and world size info to Torchbench models when using dynamo runner #108438

Uh oh!

Conversation

xmfan commented Sep 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/108438

✅ No Failures

Uh oh!

xmfan commented Sep 5, 2023

Uh oh!

xmfan commented Sep 5, 2023

Uh oh!

H-Huang Sep 5, 2023

Choose a reason for hiding this comment

Uh oh!

xmfan commented Sep 14, 2023

Uh oh!

pytorchmergebot commented Sep 14, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

xmfan commented Sep 1, 2023 •

edited

Loading

pytorch-bot bot commented Sep 1, 2023 •

edited

Loading