[Feature request] adding support for "iter_size" like hyperparameters (Caffe) #54

qinglintian · 2017-09-04T06:09:02Z

Hi, thanks a lot for sharing this awesome project.

I wonder if the code currently support the Caffe "iter_size" like hyperparameter? That is, accumulating gradients for "iter_size" number of batches and then apply the gradient. By using this hyperparameter, one can emulate the training with larger batch_size without distributed training. When the bathc_size is set to, let's say 64, and iter_size set to ITER_SIZE, then the effective batch_size will be 64*ITER_SIZE since all the gradients in ITER_SIZE batches are accumulated.

Is this doable in current code? Is there any plan for supporting this feature?

Thank you.

tfboyd · 2017-09-05T15:39:25Z

That is a cool trick. I am not aware of a plan right now but as we start working with converging FP16 this might be something that is done to test larger batch sizes. I will leave this open so everyone on the perf team can see it. If we do something, I will try to update the ticket and if not close it in 30-60 days.

ppwwyyxx · 2017-09-05T16:12:04Z

This can be done by writing a new tf.train.Optimizer.
I have one here which at least works for common cases. (and you can probably copy-paste and use)

tfboyd · 2017-09-05T16:30:46Z

@ppwwyyxx TensorPack has everything. :-) I like looking through your code and examples.

qinglintian · 2017-09-06T01:51:38Z

@tfboyd thanks for your reply. If supporting the case is easy with what @ppwwyyxx has now, I would hope this feature be supported soon.
I'm also looking at the Optimize provided above and not sure I can manage this. I'm quite a newbie in TF and python, and the code in this repo is an excellent example for me to dig in.
Thanks again!

Merge internal changes into public repository (change 184035298)

freedomtan pushed a commit to freedomtan/benchmarks that referenced this issue Apr 18, 2018

Merge pull request tensorflow#54 from tensorflow/internal-to-github-sync

0d42a8e

Merge internal changes into public repository (change 184035298)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature request] adding support for "iter_size" like hyperparameters (Caffe) #54

[Feature request] adding support for "iter_size" like hyperparameters (Caffe) #54

qinglintian commented Sep 4, 2017

tfboyd commented Sep 5, 2017

ppwwyyxx commented Sep 5, 2017 •

edited

Loading

tfboyd commented Sep 5, 2017 •

edited

Loading

qinglintian commented Sep 6, 2017

[Feature request] adding support for "iter_size" like hyperparameters (Caffe) #54

[Feature request] adding support for "iter_size" like hyperparameters (Caffe) #54

Comments

qinglintian commented Sep 4, 2017

tfboyd commented Sep 5, 2017

ppwwyyxx commented Sep 5, 2017 • edited Loading

tfboyd commented Sep 5, 2017 • edited Loading

qinglintian commented Sep 6, 2017

ppwwyyxx commented Sep 5, 2017 •

edited

Loading

tfboyd commented Sep 5, 2017 •

edited

Loading