[Not to Merge] A PoC of Graph Break Compiler #489

alanwaketan · 2022-06-29T22:57:11Z

Summary:
This is a PoC to break graphs in order to support gradient hooks for the
AOT backend. E2E use cases are composing DDP/FSDP with dynamo.

Test Plan:
WIP.

Summary: This is a PoC to break graphs in order to support gradient hooks for the AOT backend. E2E use cases are composing DDP/FSDP with dynamo. Test Plan: WIP.

jansel · 2022-07-02T17:53:02Z

Overall this seems reasonable, it proves you can insert graph breaks inside the backend.

I'm wondering if putting communication primitives in the graph would also be possible.

alanwaketan · 2022-07-05T17:20:17Z

I'm wondering if putting communication primitives in the graph would also be possible.

Do you mean tracing through the gradient hooks or manually inserting communication ops?

jansel · 2022-07-08T20:33:39Z

Do you mean tracing through the gradient hooks or manually inserting communication ops?

We could do whichever is easier, but the result would be a graph that contains communication ops.

alanwaketan · 2022-07-14T17:50:54Z

This is the output from dynamo_example2.py which is an improved PoC that actually can have gradient hooks fired in between.

(pytorch39) jwtan@ip-10-200-66-59:/fsx/users/jwtan/work/torchdynamo$ gpurun python dynamo_example2.py 
STAGE:2022-07-14 06:12:26 8511:8511 ActivityProfilerController.cpp:294] Completed Stage: Warm Up
graph_break_compiler() called with FX graph:
opcode         name              target                             args                                        kwargs
-------------  ----------------  ---------------------------------  ------------------------------------------  ------------------
placeholder    x                 x                                  ()                                          {}
placeholder    self_net1_weight  self_net1_weight                   ()                                          {}
placeholder    self_net1_bias    self_net1_bias                     ()                                          {}
placeholder    self_net2_weight  self_net2_weight                   ()                                          {}
placeholder    self_net2_bias    self_net2_bias                     ()                                          {}
placeholder    self_net3_weight  self_net3_weight                   ()                                          {}
placeholder    self_net3_bias    self_net3_bias                     ()                                          {}
placeholder    self_net4_weight  self_net4_weight                   ()                                          {}
placeholder    self_net4_bias    self_net4_bias                     ()                                          {}
call_function  linear            <built-in function linear>         (x, self_net1_weight, self_net1_bias)       {}
call_function  relu              <function relu at 0x7f353b8f8160>  (linear,)                                   {'inplace': False}
call_function  linear_1          <built-in function linear>         (relu, self_net2_weight, self_net2_bias)    {}
call_function  relu_1            <function relu at 0x7f353b8f8160>  (linear_1,)                                 {'inplace': False}
call_function  linear_2          <built-in function linear>         (relu_1, self_net3_weight, self_net3_bias)  {}
call_function  relu_2            <function relu at 0x7f353b8f8160>  (linear_2,)                                 {'inplace': False}
call_function  linear_3          <built-in function linear>         (relu_2, self_net4_weight, self_net4_bias)  {}
output         output            output                             ((linear_3,),)                              {}

graph_break_compiler() called with splitted graphs:
opcode         name              target                      args                                   kwargs
-------------  ----------------  --------------------------  -------------------------------------  --------
placeholder    self_net1_bias    self_net1_bias              ()                                     {}
placeholder    self_net1_weight  self_net1_weight            ()                                     {}
placeholder    x                 x                           ()                                     {}
call_function  linear            <built-in function linear>  (x, self_net1_weight, self_net1_bias)  {}
output         output            output                      ((linear,),)                           {}

opcode         name              target                             args                                      kwargs
-------------  ----------------  ---------------------------------  ----------------------------------------  ------------------
placeholder    linear            linear                             ()                                        {}
placeholder    self_net2_weight  self_net2_weight                   ()                                        {}
placeholder    self_net2_bias    self_net2_bias                     ()                                        {}
call_function  relu              <function relu at 0x7f353b8f8160>  (linear,)                                 {'inplace': False}
call_function  linear_1          <built-in function linear>         (relu, self_net2_weight, self_net2_bias)  {}
call_function  relu_1            <function relu at 0x7f353b8f8160>  (linear_1,)                               {'inplace': False}
output         output            output                             ((relu_1,),)                              {}

opcode         name              target                             args                                        kwargs
-------------  ----------------  ---------------------------------  ------------------------------------------  ------------------
placeholder    relu_1            relu_1                             ()                                          {}
placeholder    self_net3_weight  self_net3_weight                   ()                                          {}
placeholder    self_net3_bias    self_net3_bias                     ()                                          {}
placeholder    self_net4_weight  self_net4_weight                   ()                                          {}
placeholder    self_net4_bias    self_net4_bias                     ()                                          {}
call_function  linear_2          <built-in function linear>         (relu_1, self_net3_weight, self_net3_bias)  {}
call_function  relu_2            <function relu at 0x7f353b8f8160>  (linear_2,)                                 {'inplace': False}
call_function  linear_3          <built-in function linear>         (relu_2, self_net4_weight, self_net4_bias)  {}
output         output            output                             ((linear_3,),)                              {}

AOT compiled all 3 modules

graph_break_compiler() called with stitched graph:
opcode         name              target                                                                        args                                                                          kwargs
-------------  ----------------  ----------------------------------------------------------------------------  ----------------------------------------------------------------------------  --------
placeholder    x                 x                                                                             ()                                                                            {}
placeholder    self_net1_weight  self_net1_weight                                                              ()                                                                            {}
placeholder    self_net1_bias    self_net1_bias                                                                ()                                                                            {}
placeholder    self_net2_weight  self_net2_weight                                                              ()                                                                            {}
placeholder    self_net2_bias    self_net2_bias                                                                ()                                                                            {}
placeholder    self_net3_weight  self_net3_weight                                                              ()                                                                            {}
placeholder    self_net3_bias    self_net3_bias                                                                ()                                                                            {}
placeholder    self_net4_weight  self_net4_weight                                                              ()                                                                            {}
placeholder    self_net4_bias    self_net4_bias                                                                ()                                                                            {}
call_function  forward           <bound method aot_module_simplified.<locals>.AOTModule.forward of AOTModule(  (self_net1_bias, self_net1_weight, x)                                         {}
                                   (orig_module): GraphModule()
                                 )>
call_method    linear            __getitem__                                                                   (forward, 0)                                                                  {}
call_function  forward_1         <bound method aot_module_simplified.<locals>.AOTModule.forward of AOTModule(  (linear, self_net2_weight, self_net2_bias)                                    {}
                                   (orig_module): GraphModule()
                                 )>
call_method    relu_1            __getitem__                                                                   (forward_1, 0)                                                                {}
call_function  forward_2         <bound method aot_module_simplified.<locals>.AOTModule.forward of AOTModule(  (relu_1, self_net3_weight, self_net3_bias, self_net4_weight, self_net4_bias)  {}
                                   (orig_module): GraphModule()
                                 )>
call_method    linear_3          __getitem__                                                                   (forward_2, 0)                                                                {}
STAGE:2022-07-14 06:12:48 8511:8511 ActivityProfilerController.cpp:300] Completed Stage: Collection
STAGE:2022-07-14 06:12:48 8511:8511 output_json.cpp:417] Completed Stage: Post Processing
output         output            output                                                                        ((linear_3,),)                                                                {}

gradient hook fired
gradient hook fired
gradient hook fired
gradient hook fired
gradient hook fired
gradient hook fired
gradient hook fired
gradient hook fired
8511: iteration 0, loss 0.8432953953742981

alanwaketan · 2022-07-15T00:43:50Z

Here is the profiler output which indicates that the gradient hooks are fired in between each compiled AOT submodule.

facebook-github-bot · 2022-07-15T11:04:35Z

Hi @alanwaketan!

Thank you for your pull request.

We require contributors to sign our Contributor License Agreement, and yours needs attention.

You currently have a record in our system, but the CLA is no longer valid, and will need to be resubmitted.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

jansel · 2022-10-15T21:18:15Z

We have migrated torchdynamo to torch._dynamo and will use the pytorch/pytorch repo for future development. Please resubmit this PR to https://github.com/pytorch/pytorch/

More details and instructions to port this PR over can be found in #1588

[Not to Merge] A PoC of Graph Break Compiler

931eeaf

Summary: This is a PoC to break graphs in order to support gradient hooks for the AOT backend. E2E use cases are composing DDP/FSDP with dynamo. Test Plan: WIP.

facebook-github-bot added the cla signed label Jun 29, 2022

This comment was marked as outdated.

Sign in to view

Jiewen Tan added 3 commits June 30, 2022 00:26

Add profiler to validate gradient hooks

d42337a

Adds some comments

ae3dfa9

Add a manual graaph break example

fb1eb92

Jiewen Tan added 2 commits July 12, 2022 22:08

Adds more experiments

2c095f9

Add a PoC that works

efa7d5d

alanwaketan requested a review from wconstab July 14, 2022 18:27

alanwaketan requested review from Chillee, aazzolini, ezyang, jamesr66a and jansel July 15, 2022 00:44

wconstab mentioned this pull request Jul 21, 2022

DDP optimization via graph-breaks in Dynamo #628

Merged

jansel removed their request for review August 2, 2022 21:02

jansel closed this Oct 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Not to Merge] A PoC of Graph Break Compiler #489

[Not to Merge] A PoC of Graph Break Compiler #489

Uh oh!

alanwaketan commented Jun 29, 2022

Uh oh!

This comment was marked as outdated.

jansel commented Jul 2, 2022

Uh oh!

alanwaketan commented Jul 5, 2022

Uh oh!

jansel commented Jul 8, 2022

Uh oh!

alanwaketan commented Jul 14, 2022

Uh oh!

alanwaketan commented Jul 15, 2022 •

edited

Loading

Uh oh!

facebook-github-bot commented Jul 15, 2022

Uh oh!

jansel commented Oct 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Not to Merge] A PoC of Graph Break Compiler #489

[Not to Merge] A PoC of Graph Break Compiler #489

Uh oh!

Conversation

alanwaketan commented Jun 29, 2022

Uh oh!

This comment was marked as outdated.

jansel commented Jul 2, 2022

Uh oh!

alanwaketan commented Jul 5, 2022

Uh oh!

jansel commented Jul 8, 2022

Uh oh!

alanwaketan commented Jul 14, 2022

Uh oh!

alanwaketan commented Jul 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Jul 15, 2022

Process

Uh oh!

jansel commented Oct 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

alanwaketan commented Jul 15, 2022 •

edited

Loading