module 'torch.distributed' has no attribute 'ReduceOp' #53

sctrueew · 2019-06-17T04:58:30Z

Hi everyone,

I installed all requirements and when I run python eval.py I got this error:
module 'torch.distributed' has no attribute 'ReduceOp'
my Torch version is 1.1.0
thanks.

ycszen · 2019-07-04T08:04:59Z

Could you give more details?
Meanwhile, in the eval.py script, it didn't call the 'torch.distributed' or 'ReduceOp'.
Therefore, I need more details to know why you get this problem.

jay1009 · 2019-07-24T12:07:00Z

I met this problem, too.
But it occurred when I run python train.py.
Can you tell me what happened about it?
thanks.

ycszen · 2019-07-25T11:10:46Z

@jay1009
You need to use this script to run train.py as follows:

export NGPUS=8
python -m torch.distributed.launch --nproc_per_node=$NGPUS train.py

jay1009 · 2019-08-06T14:27:29Z

@ycszen
Thanks for your reply. But I only have one GPU, and after I use
export NGPUS=1
python -m torch.distributed.launch --nproc_per_node=$NGPUS train.py
It still met the same problem.
Is it necessary to use 8 gpus to run this program?

ycszen closed this as completed Aug 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

module 'torch.distributed' has no attribute 'ReduceOp' #53

module 'torch.distributed' has no attribute 'ReduceOp' #53

sctrueew commented Jun 17, 2019

ycszen commented Jul 4, 2019

jay1009 commented Jul 24, 2019

ycszen commented Jul 25, 2019

jay1009 commented Aug 6, 2019

module 'torch.distributed' has no attribute 'ReduceOp' #53

module 'torch.distributed' has no attribute 'ReduceOp' #53

Comments

sctrueew commented Jun 17, 2019

ycszen commented Jul 4, 2019

jay1009 commented Jul 24, 2019

ycszen commented Jul 25, 2019

jay1009 commented Aug 6, 2019