MultiGPU #29

FengLoveBella · 2018-01-30T14:26:57Z

How to run your code on multi-GPU? Thank you very much.

zhengyang-wang · 2018-01-30T18:24:35Z

I haven't explored Tensorflow on multi-GPU currently.

myhooo · 2018-01-31T07:00:06Z

I add one GPU at the line "os.environ['CUDA_VISIBLE_DEVICES'] = '1,3'" in the main.py and the code can run on these two GPUs. @zhoufengbuaa

John1231983 · 2018-01-31T07:11:46Z

I do not think so. For multiple GPU, you have to compute average gradient and batch normalization. It is very difficult. For easy, just compute average gradient and it will work. See the example of mnist dataset

FengLoveBella · 2018-01-31T10:32:06Z

@myhooo os.environ['CUDA_VISIBLE_DEVICES'] = '1,3' it is absolutely not ok, the gpu1 and gpu3 are allocated, but only the gpu1 is used for network.
@John1231983 I try a lot to use multi-gpu, I really compute average grads and average loss, but there is still some problem. reuse_variables and some else drive me crazy.

FengLoveBella · 2018-01-31T10:34:12Z

@zhengyang-wang It is very important to use large batch when semantic segmentation. Multi-gpu is absolutely a good chiose.

myhooo · 2018-02-02T10:14:30Z

@zhoufengbuaa Thank you for telling me that I am wrong~ ^_^

zhengyang-wang · 2018-02-02T19:37:33Z

@zhoufengbuaa I'm aware of that. However, there is an easy way as suggested by @John1231983, which is to use accumulated gradients. A similar way is used in the implementation of msc training. You can read my code to figure out how to do it. This approach allows you to use a large batch of larger patches, but it takes longer time to train.

John1231983 · 2018-02-02T22:20:52Z

I thinl gradient is one one problem of multiple gpu. The another is syn. batch norm statistic that is not support in tensorflow now

zhengyang-wang closed this as completed Jan 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MultiGPU #29

MultiGPU #29

FengLoveBella commented Jan 30, 2018

zhengyang-wang commented Jan 30, 2018

myhooo commented Jan 31, 2018

John1231983 commented Jan 31, 2018

FengLoveBella commented Jan 31, 2018

FengLoveBella commented Jan 31, 2018

myhooo commented Feb 2, 2018

zhengyang-wang commented Feb 2, 2018

John1231983 commented Feb 2, 2018

MultiGPU #29

MultiGPU #29

Comments

FengLoveBella commented Jan 30, 2018

zhengyang-wang commented Jan 30, 2018

myhooo commented Jan 31, 2018

John1231983 commented Jan 31, 2018

FengLoveBella commented Jan 31, 2018

FengLoveBella commented Jan 31, 2018

myhooo commented Feb 2, 2018

zhengyang-wang commented Feb 2, 2018

John1231983 commented Feb 2, 2018