New lr policies, MultiStep and StepEarly #190

sguada · 2014-03-06T21:13:38Z

MultiStep: See lenet_multistep_solver.prototxt
Allows to define multiple steps in the solver.prototxt by setting lr_policy: multistep and by defining stepvalue when the learning rate should be decreased. This allows to have not evenly distributed steps. One should define the sequence of stepvalue in increasing order.
StepEarly: See lenet_stepearly_solver.prototxt
Allows to decrease the lr_rate dynamically based in the behaviour of Test accuracy.
The learning will be decreased when for a number Tests defined by stepearly the maximum accuracy has not increased.

shelhamer · 2014-03-09T23:12:46Z

Nice policies Sergio. Thanks for the examples. Could you also include tests?

Learning rate policies and termination criteria #76 are both scheduled parts of the solver, and the conversation kind of stalled about the best way to add these. The options were observer/notify classes, coding right into solver, or make learning rate and termination factories like layerfactory.

I think refactoring to a LearningRateFactory could be nice and orderly, and then the solver would call the LearningRate for any updates. What do you think?

Re: naming, StepPlateau or StepFlat might be more descriptive than StepEarly. Or as you suggested elsewhere, EarlyStep has a nice relationship to early stopping.

kloudkl · 2014-03-11T14:52:53Z

@shelhamer, I agree with you since this PR increases the number of learning rates to @Yangqing's refactoring threshold.

I will use AdaptiveLearningRateFactory and AdaptiveLearningRate when I got the time to solve #30. AdaptiveLearningRate can not be mixed with LearningRate because of the different APIs.

template<typename Dtype>
class LearningRate {
 public:
   Dtype schedule(const int iteration);
}

template<typename Dtype>
class AdaptiveLearningRate {
 public:
   // returns parameter wise learning rate
   shared_ptr<Blob<Dtype> > schedule(const int iteration, const shared_ptr<Blob<Dtype> > gradient);
}

tdomhan · 2014-03-15T09:57:55Z

Having a multistep decrease is definitely useful. The only thing I'd like to add is that having an unlimited number of steps, makes parametrizing Caffe more difficult. (The reason I bring this up is that I running hyperparameter optimization on caffe). So maybe instead of having to set each step, the stepsize could, just like the learning rate, follow a parametric function, e.g. decay exponentially or linearly. Let me know what you think.

sguada · 2014-03-16T00:44:38Z

@tdomhan I will fix first the current new_lr_decay policies and will add more later.

shelhamer · 2014-04-04T04:28:57Z

@sguada it'd be great to include these policies, and multistep would simplify the cifar-10 example.

beniz · 2014-10-09T07:24:32Z

hi Caffe! just a heads up to say I did try the merge on my own repo, and did run the test just as Travis did, and I am not getting any error.

FYI Travis reports this after all tests are successful:
src/caffe/solver.cpp:392: Line ends in whitespace. Consider deleting these extra spaces. [whitespace/end_of_line] [4]

Conflicts: include/caffe/solver.hpp src/caffe/proto/caffe.proto src/caffe/solver.cpp

wendlerc · 2014-10-12T17:47:21Z

When will this commit be available approximately?

New lr policies, MultiStep and StepEarly

sguada · 2014-10-16T01:37:21Z

@Mezn it is available, let me know if you have any problems.

beniz · 2014-10-16T05:28:40Z

@sguada my understanding is that stepearly is not part of the commit. Also, the *.prototxt for mnist are in examples/lenet instead of examples/mnist. my 2cents :) and thanks for this.

sguada · 2014-10-16T20:34:48Z

@beniz I removed the stepearly, since it was complicating the solver since now there are many outputs during test.
The examples for mnist should be back in examples/mnist. Fixed after #1293 and #1308

beniz · 2014-10-16T20:41:27Z

@sguada thanks for the explanations. I'm into stochastic optimization, I'd be interested in looking at the old stepearly code. FYI, I am experimenting with a 'stagnation' policy relying on the median losses and or tests in order to speed up the overall training time.

New lr policies, MultiStep and StepEarly

ronghanghu · 2015-11-26T05:00:05Z

Let's remove examples/lenet/lenet_stepearly_solver.prototxt.

This `examples/lenet/lenet_stepearly_solver.prototxt` is introduced in BVLC#190 by mistake, since stepearly is never actually merged.

Fix Python installation with CMake install target

This `examples/lenet/lenet_stepearly_solver.prototxt` is introduced in BVLC#190 by mistake, since stepearly is never actually merged.

shelhamer added the enhancement label Mar 9, 2014

shelhamer added the work in progress label Mar 13, 2014

shelhamer assigned jeffdonahue Mar 13, 2014

tdomhan mentioned this pull request Mar 15, 2014

Added early stopping to the solver #76

Closed

shelhamer mentioned this pull request Jun 10, 2014

GUI version of tools? #481

Closed

shelhamer force-pushed the dev branch 3 times, most recently from 4278286 to c01f07a Compare August 28, 2014 07:00

shelhamer added in progress and removed work in progress labels Aug 29, 2014

shelhamer force-pushed the dev branch from 64258b6 to 403b56b Compare September 19, 2014 04:38

shelhamer added this to the 1.1 milestone Sep 19, 2014

sguada force-pushed the new_lr_policies branch 4 times, most recently from 77de8d7 to 523a39c Compare October 5, 2014 21:53

shelhamer force-pushed the dev branch from d8eb4df to 914da95 Compare October 8, 2014 16:36

sguada force-pushed the new_lr_policies branch from 523a39c to be36e40 Compare October 10, 2014 16:24

Added Multistep, Poly and Sigmoid learning rate decay policies

b025da7

Conflicts: include/caffe/solver.hpp src/caffe/proto/caffe.proto src/caffe/solver.cpp

sguada force-pushed the new_lr_policies branch from be36e40 to b025da7 Compare October 10, 2014 16:38

sguada force-pushed the new_lr_policies branch from b025da7 to 6e20aa3 Compare October 12, 2014 23:27

sguada added a commit that referenced this pull request Oct 16, 2014

Merge pull request #190 from sguada/new_lr_policies

bdd0a00

New lr policies, MultiStep and StepEarly

sguada merged commit bdd0a00 into BVLC:dev Oct 16, 2014

RazvanRanca pushed a commit to RazvanRanca/caffe that referenced this pull request Nov 4, 2014

Merge pull request BVLC#190 from sguada/new_lr_policies

3aa2a6d

New lr policies, MultiStep and StepEarly

sguada mentioned this pull request Dec 21, 2014

Googlenet on master #1612

Merged

ronghanghu added a commit to ronghanghu/caffe that referenced this pull request Nov 26, 2015

Remove bogus stepearly in MNIST example

d3025f5

This `examples/lenet/lenet_stepearly_solver.prototxt` is introduced in BVLC#190 by mistake, since stepearly is never actually merged.

ronghanghu mentioned this pull request Nov 26, 2015

Remove bogus stepearly in MNIST example #3389

Merged

lukeyeager added a commit to lukeyeager/caffe that referenced this pull request Aug 15, 2016

Merge pull request BVLC#190 from lukeyeager/nvidia/cmake-install-python

2b88d9d

Fix Python installation with CMake install target

acmiyaguchi pushed a commit to acmiyaguchi/caffe that referenced this pull request Nov 13, 2017

Remove bogus stepearly in MNIST example

ddcc456

This `examples/lenet/lenet_stepearly_solver.prototxt` is introduced in BVLC#190 by mistake, since stepearly is never actually merged.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New lr policies, MultiStep and StepEarly #190

New lr policies, MultiStep and StepEarly #190

sguada commented Mar 6, 2014

shelhamer commented Mar 9, 2014

kloudkl commented Mar 11, 2014

tdomhan commented Mar 15, 2014

sguada commented Mar 16, 2014

shelhamer commented Apr 4, 2014

beniz commented Oct 9, 2014

wendlerc commented Oct 12, 2014

sguada commented Oct 16, 2014

beniz commented Oct 16, 2014

sguada commented Oct 16, 2014

beniz commented Oct 16, 2014

ronghanghu commented Nov 26, 2015

New lr policies, MultiStep and StepEarly #190

New lr policies, MultiStep and StepEarly #190

Conversation

sguada commented Mar 6, 2014

shelhamer commented Mar 9, 2014

kloudkl commented Mar 11, 2014

tdomhan commented Mar 15, 2014

sguada commented Mar 16, 2014

shelhamer commented Apr 4, 2014

beniz commented Oct 9, 2014

wendlerc commented Oct 12, 2014

sguada commented Oct 16, 2014

beniz commented Oct 16, 2014

sguada commented Oct 16, 2014

beniz commented Oct 16, 2014

ronghanghu commented Nov 26, 2015