Skip to content

Specify num_replicas and rank when creating sampler#216

Merged
ShadenSmith merged 3 commits intodeepspeedai:masterfrom
vidushv:vidushv/modelParallelism
Jun 4, 2020
Merged

Specify num_replicas and rank when creating sampler#216
ShadenSmith merged 3 commits intodeepspeedai:masterfrom
vidushv:vidushv/modelParallelism

Conversation

@vidushv
Copy link
Contributor

@vidushv vidushv commented May 15, 2020

Addressing issue #99. If mpu is provided to deepspeed.initialize(), then pass on the world size and rank information to DistributedSampler to ensure multiple processes will load the appropriate subset of the original dataset.

@vidushv vidushv marked this pull request as ready for review May 15, 2020 21:26
@ShadenSmith ShadenSmith linked an issue May 21, 2020 that may be closed by this pull request
Copy link
Contributor

@ShadenSmith ShadenSmith left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! Just a minor comment.

@ShadenSmith
Copy link
Contributor

Thanks @vidushv! This looks good. I'll merge after the tests pass.

@ShadenSmith ShadenSmith merged commit 0f72988 into deepspeedai:master Jun 4, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

DeepSpeed using DistributedSampler with model parallelism

2 participants