Skip to content

Conversation

@pritamdamania87
Copy link
Contributor

@pritamdamania87 pritamdamania87 commented Dec 18, 2019

Stack from ghstack:

Multiple "trainers" could each create different instances of DistributedOptimizer, which means we can still have a race condition unless we do a trully global per worker lock.

Differential Revision: D19154248

Multiple "trainers" could each create different instances of DistributedOptimizer, which means we can still have a race condition unless we do a trully global per worker lock.

Differential Revision: [D19154248](https://our.internmc.facebook.com/intern/diff/D19154248/)

[ghstack-poisoned]
pritamdamania87 pushed a commit that referenced this pull request Dec 18, 2019
Multiple "trainers" could each create different instances of DistributedOptimizer, which means we can still have a race condition unless we do a trully global per worker lock.

Differential Revision: [D19154248](https://our.internmc.facebook.com/intern/diff/D19154248/)

ghstack-source-id: 95874624
Pull Request resolved: #31404
@pritamdamania87
Copy link
Contributor Author

This is a resubmission of #30117.

Copy link
Contributor

@mrshenli mrshenli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reviewed in #30117

cc @aazzolini

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 359c39b.

@facebook-github-bot facebook-github-bot deleted the gh/pritamdamania87/46/head branch December 22, 2019 15:17
wuhuikx pushed a commit to wuhuikx/pytorch that referenced this pull request Jan 30, 2020
Summary:
Pull Request resolved: pytorch#31404

Multiple "trainers" could each create different instances of DistributedOptimizer, which means we can still have a race condition unless we do a trully global per worker lock.
ghstack-source-id: 95874624

Test Plan: run unit tests -- unfortunatelly due to the non-deterministic behavior it's not clear how to unit test this properly.

Differential Revision: D19154248

fbshipit-source-id: fab6286c17212f534f1bd1cbdf9f0de002d48c74
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants