Improvements for DDP Optimizer #87549

wconstab · 2022-10-22T14:50:44Z

Stack from ghstack (oldest at bottom):

adds support for 'first_bucket_cap' arg, to align bucketing more precisely
with DDP, which may start a smaller first bucket
refactors the bucket splitting logic to be cleaner
adds pretty-print for bucket info, and a way to access bucket info
from the DDPOptimizer class from a test case or benchmark
dumps debug logs to stdout

cc @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang

- adds support for 'first_bucket_cap' arg, to align bucketing more precisely with DDP, which may start a smaller first bucket - refactors the bucket splitting logic to be cleaner - adds pretty-print for bucket info, and a way to access bucket info from the DDPOptimizer class from a test case or benchmark - dumps debug logs to stdout [ghstack-poisoned]

pytorch-bot · 2022-10-22T14:50:47Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/87549

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fa66c39:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

- adds support for 'first_bucket_cap' arg, to align bucketing more precisely with DDP, which may start a smaller first bucket - refactors the bucket splitting logic to be cleaner - adds pretty-print for bucket info, and a way to access bucket info from the DDPOptimizer class from a test case or benchmark - dumps debug logs to stdout ghstack-source-id: 35e76e7acc34229a874830a06a58a1316d37e0b3 Pull Request resolved: #87549

github-actions · 2022-10-22T14:53:05Z

This PR needs a label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

For more information, see https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

wconstab · 2022-10-24T03:39:11Z

@pytorchbot merge

pytorchmergebot · 2022-10-24T03:40:40Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

@jansel

- adds support for 'first_bucket_cap' arg, to align bucketing more precisely with DDP, which may start a smaller first bucket - refactors the bucket splitting logic to be cleaner - adds pretty-print for bucket info, and a way to access bucket info from the DDPOptimizer class from a test case or benchmark - dumps debug logs to stdout cc @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang Pull Request resolved: pytorch#87549 Approved by: https://github.com/soumith

@jansel

- adds support for 'first_bucket_cap' arg, to align bucketing more precisely with DDP, which may start a smaller first bucket - refactors the bucket splitting logic to be cleaner - adds pretty-print for bucket info, and a way to access bucket info from the DDPOptimizer class from a test case or benchmark - dumps debug logs to stdout cc @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang Pull Request resolved: pytorch#87549 Approved by: https://github.com/soumith

@jansel

- adds support for 'first_bucket_cap' arg, to align bucketing more precisely with DDP, which may start a smaller first bucket - refactors the bucket splitting logic to be cleaner - adds pretty-print for bucket info, and a way to access bucket info from the DDPOptimizer class from a test case or benchmark - dumps debug logs to stdout cc @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang Pull Request resolved: pytorch#87549 Approved by: https://github.com/soumith

wconstab requested review from mrshenli, pritamdamania87, zhaojuanmao, rohan-varma, H-Huang, awgu and kwen2501 as code owners October 22, 2022 14:50

github-actions bot added ciflow/inductor module: dynamo labels Oct 22, 2022

github-actions bot requested review from albanD, anjali411, antoniojkim, bdhirsh, Chillee, ezyang, Krovatkin and miladm October 22, 2022 14:50

wconstab added topic: not user facing topic category ciflow/trunk Trigger trunk jobs on your pull request labels Oct 22, 2022

This was referenced Oct 22, 2022

Update skips to xfails for dynamo distributed tests #87558

Closed

Enable inductor ddp baseline test #87560

Closed

Add distributed dynamo benchmarking utils #87419

Closed

albanD removed their request for review October 23, 2022 19:46

soumith approved these changes Oct 24, 2022

View reviewed changes

pytorchmergebot added the Merged label Oct 24, 2022

pytorchmergebot closed this in 233305a Oct 24, 2022

facebook-github-bot deleted the gh/wconstab/17/head branch June 8, 2023 19:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements for DDP Optimizer #87549

Improvements for DDP Optimizer #87549

wconstab commented Oct 22, 2022 •

edited

pytorch-bot bot commented Oct 22, 2022 •

edited

github-actions bot commented Oct 22, 2022

wconstab commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

Improvements for DDP Optimizer #87549

Improvements for DDP Optimizer #87549

Conversation

wconstab commented Oct 22, 2022 • edited

pytorch-bot bot commented Oct 22, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/87549

✅ No Failures

github-actions bot commented Oct 22, 2022

This PR needs a label

wconstab commented Oct 24, 2022

pytorchmergebot commented Oct 24, 2022

Merge started

wconstab commented Oct 22, 2022 •

edited

pytorch-bot bot commented Oct 22, 2022 •

edited