Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
Deduplicate solver regularization, logging, and local rates and decays #2518
Conversation
shelhamer
added JL ready for review
labels
May 27, 2015
|
I just looked over this -- looks great! This refactoring was much needed. Thanks @cypof and @shelhamer. Only issue I see is I'm not sure about the verb in |
|
Actually, I guess it is replacing |
|
No the net_->Update() needs to be in, so that it is not executed by solvers that are not of type SGDSolver. This way, only the root solver in a parallel setup will apply the update. That was actually the race I introduced when I split the big PR into small ones. Another name could be ApplyGradients? |
|
Okay, I see -- thanks for the explanation @cypof. I like |
|
Thanks for the comments @cypof and @jeffdonahue. I went with |
|
p.s. ignore the travis push check -- the travis Pr check is the one to heed. The push check was triggered by my accidental push to BVLC/caffe and then my deleting the branch made it fail. |
shelhamer
referenced
this pull request
May 27, 2015
Merged
Decouple the computational batch size and minibatch size by accumulating gradients #1977
|
Cool, LGTM |
jeffdonahue
added a commit
that referenced
this pull request
May 27, 2015
|
|
jeffdonahue |
b12c171
|
shelhamer commentedMay 27, 2015
This simplifies the solver code by de-duplicating shared logic.
Solver::Iteration()toSolver::MakeUpdate()to verb.
I plan to merge this shortly to make way for an updated #1977.