About the args.batch_size_val #9

sunpeng1996 · 2019-07-23T08:37:02Z

Hi, hs.
In your code,

if args.distributed: torch.cuda.set_device(gpu) args.batch_size = int(args.batch_size / ngpus_per_node) args.batch_size_val = int(args.batch_size_val / ngpus_per_node) args.workers = int(args.workers / ngpus_per_node)

I think the default batch_size_val should be as the same as the ngpus_per_node, or get an error:
ValueError: batch_size should be a positive integeral value, but got batch_size=0

The text was updated successfully, but these errors were encountered:

sunpeng1996 · 2019-07-24T05:56:21Z

Hi, hengshuang:
why input size in pspnet must be 8*n+1?

sunpeng1996 · 2019-07-25T01:34:14Z

And why use Apex? Is it better than original pytorch-distributed？？

hszhao · 2019-09-09T12:45:05Z

Thanks for the issue. I updated the default value of batch_size_val in the config files.
We follow the previous DeepLab in Caffe where the crop size need to be 8*n+1 (this is due to the implementation of the interp' layer that needs to do align corners). 8*n is also fine now in PyTorch interpolate' function.
At the time of the development of this repo, sync bn is not included in the official PyTorch. You can use PyTorch 1.1 or newer versions with sync bn incorporated.

hszhao closed this as completed Sep 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the args.batch_size_val #9

About the args.batch_size_val #9

sunpeng1996 commented Jul 23, 2019

sunpeng1996 commented Jul 24, 2019

sunpeng1996 commented Jul 25, 2019

hszhao commented Sep 9, 2019

About the args.batch_size_val #9

About the args.batch_size_val #9

Comments

sunpeng1996 commented Jul 23, 2019

sunpeng1996 commented Jul 24, 2019

sunpeng1996 commented Jul 25, 2019

hszhao commented Sep 9, 2019