Add flag for dynamo+ddp optimizations #1221

davidberard98 · 2022-09-29T23:19:45Z

Stack from ghstack:

-> Add flag for dynamo+ddp optimizations #1221

Add a flag that can be used to turn dynamo+ddp optimizations on. This
will be used to compare how dynamo+ddp performs with and without the
additional graph break strategy for improving dynamo+ddp
compute/communication overlap.

Differential Revision: D39976005

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. [ghstack-poisoned]

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. ghstack-source-id: 5d3c06a Pull Request resolved: #1221

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. [ghstack-poisoned]

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. ghstack-source-id: d614859 Pull Request resolved: #1221

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. [ghstack-poisoned]

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. ghstack-source-id: b590e00 Pull Request resolved: #1221

davidberard98 · 2022-09-30T17:37:20Z

@davidberard98 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

xuzhao9

See nits in comments. Approved to unblock development.

xuzhao9 · 2022-10-03T22:25:35Z

torchbenchmark/util/extra_args.py

        precision = 'fp16' if not model.dargs.precision == "fp32" else 'fp32'
        model.set_module(enable_torchtrt(precision=precision, model=module, example_inputs=exmaple_inputs))
+
+    if args.optimize_dynamo_ddp:


Just curious what kind of optimizations does torchdynamo do for ddp optimizations?
Also, to prevent code bloating in this file, how about we move this part in torchdynamo.py?

pytorch/torchdynamo#628 adds extra graph breaks in dynamo. The idea is that instead of DDP having to wait until the entire backward pass is completed, extra graph breaks should allow autograd hooks to get called earlier, and then you can get better overlap of communication (syncing the gradients once they are ready) and computation (computing the rest of the backward pass)

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. Differential Revision: [D39976005](https://our.internmc.facebook.com/intern/diff/D39976005) [ghstack-poisoned]

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. ghstack-source-id: cd3d9cd Pull Request resolved: #1221

davidberard98 · 2022-10-03T23:09:57Z

@davidberard98 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. Differential Revision: [D39976005](https://our.internmc.facebook.com/intern/diff/D39976005) [ghstack-poisoned]

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. ghstack-source-id: 74dfb49 Pull Request resolved: #1221

davidberard98 · 2022-10-03T23:27:52Z

@davidberard98 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. Differential Revision: [D39976005](https://our.internmc.facebook.com/intern/diff/D39976005) [ghstack-poisoned]

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. ghstack-source-id: ff80ac9 Pull Request resolved: #1221

davidberard98 · 2022-10-04T20:48:32Z

@davidberard98 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Add flag for dynamo+ddp optimizations

081e35c

Add a flag that can be used to turn dynamo+ddp optimizations on. This will be used to compare how dynamo+ddp performs with and without the additional graph break strategy for improving dynamo+ddp compute/communication overlap. [ghstack-poisoned]

facebook-github-bot added the cla signed label Sep 29, 2022

davidberard98 requested review from wconstab and xuzhao9 and removed request for wconstab September 30, 2022 17:35

davidberard98 marked this pull request as ready for review September 30, 2022 17:45

davidberard98 mentioned this pull request Oct 1, 2022

ddp+dynamo: assorted fixes #1223

Closed

xuzhao9 approved these changes Oct 3, 2022

View reviewed changes

facebook-github-bot closed this in c813395 Oct 5, 2022

facebook-github-bot deleted the gh/davidberard98/10/head branch October 9, 2022 14:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add flag for dynamo+ddp optimizations #1221

Add flag for dynamo+ddp optimizations #1221

Uh oh!

davidberard98 commented Sep 29, 2022 •

edited

Loading

Uh oh!

davidberard98 commented Sep 30, 2022

Uh oh!

xuzhao9 left a comment

Uh oh!

xuzhao9 Oct 3, 2022

Uh oh!

davidberard98 Oct 3, 2022 •

edited

Loading

Uh oh!

davidberard98 commented Oct 3, 2022

Uh oh!

davidberard98 commented Oct 3, 2022

Uh oh!

davidberard98 commented Oct 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Add flag for dynamo+ddp optimizations #1221

Add flag for dynamo+ddp optimizations #1221

Uh oh!

Conversation

davidberard98 commented Sep 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davidberard98 commented Sep 30, 2022

Uh oh!

xuzhao9 left a comment

Choose a reason for hiding this comment

Uh oh!

xuzhao9 Oct 3, 2022

Choose a reason for hiding this comment

Uh oh!

davidberard98 Oct 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidberard98 commented Oct 3, 2022

Uh oh!

davidberard98 commented Oct 3, 2022

Uh oh!

davidberard98 commented Oct 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

davidberard98 commented Sep 29, 2022 •

edited

Loading

davidberard98 Oct 3, 2022 •

edited

Loading