[Primitive][shard] Use autograd function for all sync ops by comaniac · Pull Request #33 · awslabs/slapo

comaniac · 2023-02-01T00:40:48Z

Description

We found that the behavior of PyTorch backward hooks is a bit weird, so for safety, this PR changes all use cases of backward hooks to be forward pre-hook + autograd function. Specifically:

Before

register_backward_hook(dist.all_reduce)

Now

class _ReduceBackwardGradient(torch.autograd.Function):
    def forward(...):
        # no-op

    def backward(...):
        # all-reduce gradient

register_forward_pre_hook(allreduce_backward_gradient)

Accordingly, _ReduceBackwardGradient is added. In addition, this PR also adds unit tests for all supported sync ops.

Checklist

PR's title starts with a category (e.g. [Bugfix], [Model], [Tutorial], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

cc @szhengac @chhzh123

comaniac · 2023-02-01T01:08:00Z

Thanks @szhengac

comaniac added 2 commits February 1, 2023 00:35

[Primitive][shard] Use auto-grad fn for all sync ops

eb0a273

add tests

5d6e46a

comaniac changed the title ~~[Primitive][shard] Use auto-grad fn for all sync ops~~ [Primitive][shard] Use autograd function for all sync ops Feb 1, 2023

szhengac approved these changes Feb 1, 2023

View reviewed changes

comaniac merged commit 1e15ee2 into awslabs:main Feb 1, 2023

comaniac deleted the dont_use_bwd_hook branch February 1, 2023 01:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Primitive][shard] Use autograd function for all sync ops#33

[Primitive][shard] Use autograd function for all sync ops#33
comaniac merged 2 commits intoawslabs:mainfrom
comaniac:dont_use_bwd_hook

comaniac commented Feb 1, 2023

Uh oh!

comaniac commented Feb 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

comaniac commented Feb 1, 2023

Description

Checklist

Uh oh!

comaniac commented Feb 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants