why slow when using iter_size. #3808

cheer37 · 2016-03-12T02:53:40Z

I used to use small batch size due to small sized gpu memory.
with help of iter_size, i could increase the number of images to calculate the gradient.
But its slow than original one without iter_size.
What makes this?
Thanks.

seanbell · 2016-03-12T20:15:48Z

I assume you mean that with the same number of images being computed, it's slower? A larger iter_size (and correspondingly smaller batchsize) has more overhead in synchronizing CUDA threads and launching CUDA kernels. Rather than batching all the images together, instead there are several passes through the data.

Also, from https://github.com/BVLC/caffe/blob/master/CONTRIBUTING.md:

Please do not post usage, installation, or modeling questions, or other requests for help to Issues.
Use the caffe-users list instead. This helps developers maintain a clear, uncluttered, and efficient view of the state of Caffe.

seanbell closed this as completed Mar 13, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why slow when using iter_size. #3808

why slow when using iter_size. #3808

cheer37 commented Mar 12, 2016

seanbell commented Mar 12, 2016

why slow when using iter_size. #3808

why slow when using iter_size. #3808

Comments

cheer37 commented Mar 12, 2016

seanbell commented Mar 12, 2016