How to train on multi GPU #148

KaleidoZhouYN · 2017-11-10T03:16:25Z

Hi:
I have trained with your code on single GPU and get a pretty nice result,excellent job!!!!

also I met something trouble when i want to accelerate with Multi GPU:

And the source code with multiGPU makes me confused:
options/base_options.py: if len(self.opt.gpu_ids) > 0:
options/base_options.py: torch.cuda.set_device(self.opt.gpu_ids[0])

Any advice?Thank you very much

junyanz · 2017-11-10T03:23:11Z

You can set the gpu_ids as 0,1,2. len(self.opt.gpu_ids) > 0
checks if there are more than one gpu ids.
torch.cuda.set_device(self.opt.gpu_ids[0]) sets the main device as the first gpu.
See here for more discussion. The multi-GPU code should work with instancenorm. We haven't implemented the synchronized batchnorm as mentioned here

KaleidoZhouYN · 2017-11-10T03:55:49Z

@junyanz Yes,I append the args --gpu_ids=0,1,2 and did not change the type of "norm".
will there be some trouble with pytorch version or something else?

junyanz · 2017-11-10T04:00:23Z

it should be fine. You can see which norm you are using, from the output log.

junyanz closed this as completed Nov 10, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train on multi GPU #148

How to train on multi GPU #148

KaleidoZhouYN commented Nov 10, 2017

junyanz commented Nov 10, 2017

KaleidoZhouYN commented Nov 10, 2017

junyanz commented Nov 10, 2017

How to train on multi GPU #148

How to train on multi GPU #148

Comments

KaleidoZhouYN commented Nov 10, 2017

junyanz commented Nov 10, 2017

KaleidoZhouYN commented Nov 10, 2017

junyanz commented Nov 10, 2017