Skip to content
This repository has been archived by the owner on Nov 16, 2023. It is now read-only.

cifar10-single-mul-DDP-nccl-gloo mp.spawn lack gpu arg for train #126

Closed
SwordFaith opened this issue Nov 27, 2020 · 1 comment · Fixed by microsoft/pai#5228
Closed

cifar10-single-mul-DDP-nccl-gloo mp.spawn lack gpu arg for train #126

SwordFaith opened this issue Nov 27, 2020 · 1 comment · Fixed by microsoft/pai#5228
Assignees
Labels
template job bug Bug when running template jobs

Comments

@SwordFaith
Copy link
Contributor

image

https://github.com/microsoft/pai/blob/master/examples/Distributed-example/cifar10-single-mul-DDP-nccl-gloo.py

line 63: train function signature: def train(gpu, args, trainset)
line 42: mp.spawn(train, nprocs=args.gpus, args=(args, trainset)), where args= should be a tuple with size of 3.

@SwordFaith
Copy link
Contributor Author

image

same error in cifar10-single-mul-DDP-gloo-Apex-mixed

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
template job bug Bug when running template jobs
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants