Add Adadelta optimizer #644

Mistobaan · 2015-12-29T19:42:56Z

This is my attempt at #516 . Looking for some early feedback as is my first attempt.

wchan · 2016-01-02T19:01:18Z

missing the GPU impl, did you forget to git add the file? cause the code references it but the file is missing ; )

also, I think a few lines are over 80 chars. but otherwise, LGTM.

vrv · 2016-01-06T22:17:31Z

@zffchen78: when you get a chance can you take a look at this?

tensorflow-jenkins · 2016-01-06T22:17:33Z

Can one of the admins verify this patch?

zffchen78 · 2016-01-07T17:59:03Z

LGTM

Mistobaan · 2016-01-07T21:55:35Z

@vrv ok I will add the cuda part and do the edits !

osdf · 2016-01-08T16:07:45Z

Two suggestions:

could you add a learning rate parameter lr? The original publication does not have it, but flexibility here is a good thing.
could you rename decay_rate to rho, following the paper and the rmsprop impl in the same file?

I think there is a mistake:
https://github.com/Mistobaan/tensorflow/blob/7a262ee6467c909cae723e0de5fb87a2a7e9a664/tensorflow/core/kernels/training_ops.cc#L53
This line either should have accum = ... (instead of +=) or follow the respective line in the rmsprop implementation further down (https://github.com/Mistobaan/tensorflow/blob/7a262ee6467c909cae723e0de5fb87a2a7e9a664/tensorflow/core/kernels/training_ops.cc#L133).

wchan · 2016-01-16T05:09:13Z

FYI, there's a couple of bugs, namely the += shoulda been =

my impl is here (GPU included + separate lr included + sparse):
https://github.com/wchan/tensorflow/blob/master/tensorflow/core/kernels/training_ops.cc

zffchen78 · 2016-01-16T19:04:07Z

Hi, mistobaan, please let us know how you plan to proceed on this? I can see the following work items:

address the accum issue raised by osdf and wchan. I do not have background to judge whether it's a bug or its your design choice. I can only review the implementation matches the specification, where your specification says accum +=; whether that matches the intended algorithm. It's up to you three to judge;
whether to add lr option. It's also up to you and others to discuss and agree on something.

1 & 2 must be settled before we can add this op since it affects the op's interface / specification and will be hard to fix later.

gpu support and test;
sparse support and test;

You can get 1&2 done and merged first; and collaborate w/ others to get 3&4 done or you can incrementally get them done later.

Mistobaan · 2016-01-19T17:55:35Z

Hi!
I am working on a patch that has all the above suggestions and the gpu code
from william chen

On Saturday, January 16, 2016, zffchen78 notifications@github.com wrote:

Hi, mistobaan, please let us know how you plan to proceed on this? I can
see the following work items:

address the accum issue raised by osdf and wchan. I do not have
background to judge whether it's a bug or its your design choice. I can
only review the implementation matches the specification, where your
specification says accum +=; whether that matches the intended algorithm.
It's up to you three to judge;

whether to add lr option. It's also up to you and others to discuss and
agree on something.

1 & 2 must be settled before we can add this op since it affects the op's
interface / specification and will be hard to fix later.

gpu support and test;

sparse support and test;

You can get 1&2 done and merged first; and collaborate w/ others to get
3&4 done or you can incrementally get them done later.

—
Reply to this email directly or view it on GitHub
#644 (comment)
.

LinkedIn: http://linkedin.com/in/fmilo
Twitter: @fabmilo

Github: http://github.com/Mistobaan/

Simplicity, consistency, and repetition - that's how you get through. (Jack
Welch)
Perfection must be reached by degrees; she requires the slow hand of time
(Voltaire)
The best way to predict the future is to invent it (Alan Kay)

vrv · 2016-02-01T19:00:36Z

@Mistobaan: any updates?

Mistobaan · 2016-02-01T22:06:25Z

@vrv I think the operation themselves are up to date and correct. I was not able to create a solid testing for GPU as I don't have a supported GPU environment at the moment. I just rebased the patch.

vrv · 2016-02-01T22:07:22Z

Ok, we'll probably soon have GPU testing, right @martinwicke ? :). Then we can try pulling this in.

vrv · 2016-02-16T21:42:50Z

@tensorflow-jenkins: test this please

tensorflow-jenkins · 2016-02-16T21:42:52Z

Can one of the admins verify this patch?

vrv · 2016-02-16T21:57:26Z

tensorflow/core/kernels/training_ops.cc:338:8: error: template-id 'operator()<>' for 'void tensorflow::functor::ApplyAdadelta<Eigen::GpuDevice, double>::operator()(const GPUDevice&, tensorflow::TTypes<double, 1, long int>::Flat, tensorflow::TTypes<double, 1, long int>::Flat, tensorflow::TTypes<double, 1, long int>::Flat, tensorflow::TTypes<double, 1, long int>::ConstScalar, tensorflow::TTypes<double, 1, long int>::ConstScalar, tensorflow::TTypes<double, 1, long int>::ConstScalar, tensorflow::TTypes<double, 1, long int>::ConstFlat)' does not match any template declaration
void ApplyAdadelta<GPUDevice, T>::operator()(
^
tensorflow/core/kernels/training_ops.cc:348:1: note: in expansion of macro 'DECLARE_GPU_SPEC'
DECLARE_GPU_SPEC(double);
^
tensorflow/core/kernels/training_ops.cc:345:41: note: saw 1 'template<>', need 2 for specializing a member function template
typename TTypes::ConstFlat grad);
^
tensorflow/core/kernels/training_ops.cc:348:1: note: in expansion of macro 'DECLARE_GPU_SPEC'
DECLARE_GPU_SPEC(double);
^
tensorflow/core/kernels/training_ops.cc:352:17: error: expected constructor, destructor, or type conversion before '(' token
REGISTER_KERNELS(GPU, float);
^
tensorflow/core/kernels/training_ops.cc:353:17: error: expected constructor, destructor, or type conversion before '(' token
REGISTER_KERNELS(GPU, double);

vrv · 2016-02-18T19:57:49Z

(ping this thread when this is ready -- it looks like you added a commit but don't know if it's ready)

bernardopires · 2016-03-06T12:13:00Z

@Mistobaan, any update on this? I'm also really interested in trying AdaDelta. Thanks!

madisonmay · 2016-03-11T23:39:07Z

tensorflow/python/training/adadelta_test.py

+# limitations under the License.
+# ==============================================================================
+
+"""Tests for Momentum."""


Docstring references Momentum instead of AdaDelta.

vrv · 2016-03-17T16:42:38Z

@tensorflow-jenkins: test this please

vrv · 2016-03-17T18:43:19Z

@Mistobaan: I think this looks good pending the GPU tests finishing -- the only other thing I think we need is for you to run

bazel-bin/tensorflow/core/ops/compat/update_ops tensorflow/core/ops

and add that updated file to your commit for tracking backwards compatibility.

Mistobaan · 2016-03-21T21:52:38Z

@vrv I run and added the modified files as you requested.

vrv · 2016-03-21T21:57:57Z

@tensorflow-jenkins: test this please

vrv · 2016-03-21T22:01:33Z

Looks like nothing built -- can you double check and verify that your commit compiles and passes in at least one config?

Mistobaan · 2016-03-21T23:08:11Z

@vrv let's try again I ran the updated command on an old branch and force pushed

vrv · 2016-03-21T23:18:29Z

@tensorflow-jenkins: test this please

Add Adadelta optimizer

…axPotentialBlockSize enable hipOccupancyMaxPotentialBlockSize

googlebot added the cla: yes label Dec 29, 2015

Mistobaan force-pushed the master branch from 998c988 to 7da0336 Compare January 3, 2016 01:03

vrv assigned zffchen78 Jan 4, 2016

Mistobaan force-pushed the master branch from 7da0336 to 7a262ee Compare January 7, 2016 22:22

Mistobaan force-pushed the master branch 3 times, most recently from 18dfed6 to 16a7c89 Compare January 20, 2016 23:39

Mistobaan force-pushed the master branch from 16a7c89 to 5830545 Compare February 1, 2016 22:05

davidsandberg mentioned this pull request Feb 12, 2016

Start to use AdaDelta as optimizer davidsandberg/facenet#4

Closed

Mistobaan force-pushed the master branch from 5830545 to 0836a06 Compare February 18, 2016 00:15

madisonmay reviewed Mar 11, 2016
View reviewed changes

Mistobaan force-pushed the master branch 2 times, most recently from 7753631 to ad01e6f Compare March 14, 2016 04:21

Mistobaan changed the title ~~[WIP] Add Adadelta optimizer~~ Add Adadelta optimizer Mar 14, 2016

Mistobaan force-pushed the master branch from ad01e6f to c78483a Compare March 21, 2016 21:50

Add Adadelta optimizer

cbfe3a0

Mistobaan force-pushed the master branch from c78483a to cbfe3a0 Compare March 21, 2016 22:55

vrv pushed a commit that referenced this pull request Mar 22, 2016

Merge pull request #644 from Mistobaan/master

a71e1ff

Add Adadelta optimizer

vrv merged commit a71e1ff into tensorflow:master Mar 22, 2016

darkbuck pushed a commit to darkbuck/tensorflow that referenced this pull request Jan 23, 2020

Merge pull request tensorflow#644 from jeffdaily/enable_hipOccupancyM…

0962746

…axPotentialBlockSize enable hipOccupancyMaxPotentialBlockSize

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Adadelta optimizer #644

Add Adadelta optimizer #644

Mistobaan commented Dec 29, 2015

wchan commented Jan 2, 2016

vrv commented Jan 6, 2016

tensorflow-jenkins commented Jan 6, 2016

zffchen78 commented Jan 7, 2016

Mistobaan commented Jan 7, 2016

osdf commented Jan 8, 2016

wchan commented Jan 16, 2016

zffchen78 commented Jan 16, 2016

Mistobaan commented Jan 19, 2016

vrv commented Feb 1, 2016

Mistobaan commented Feb 1, 2016

vrv commented Feb 1, 2016

vrv commented Feb 16, 2016

tensorflow-jenkins commented Feb 16, 2016

vrv commented Feb 16, 2016

vrv commented Feb 18, 2016

bernardopires commented Mar 6, 2016

madisonmay Mar 11, 2016

vrv commented Mar 17, 2016

vrv commented Mar 17, 2016

Mistobaan commented Mar 21, 2016

vrv commented Mar 21, 2016

vrv commented Mar 21, 2016

Mistobaan commented Mar 21, 2016

vrv commented Mar 21, 2016

Add Adadelta optimizer #644

Add Adadelta optimizer #644

Conversation

Mistobaan commented Dec 29, 2015

wchan commented Jan 2, 2016

vrv commented Jan 6, 2016

tensorflow-jenkins commented Jan 6, 2016

zffchen78 commented Jan 7, 2016

Mistobaan commented Jan 7, 2016

osdf commented Jan 8, 2016

wchan commented Jan 16, 2016

zffchen78 commented Jan 16, 2016

Mistobaan commented Jan 19, 2016

Github: http://github.com/Mistobaan/

vrv commented Feb 1, 2016

Mistobaan commented Feb 1, 2016

vrv commented Feb 1, 2016

vrv commented Feb 16, 2016

tensorflow-jenkins commented Feb 16, 2016

vrv commented Feb 16, 2016

vrv commented Feb 18, 2016

bernardopires commented Mar 6, 2016

madisonmay Mar 11, 2016

Choose a reason for hiding this comment

vrv commented Mar 17, 2016

vrv commented Mar 17, 2016

Mistobaan commented Mar 21, 2016

vrv commented Mar 21, 2016

vrv commented Mar 21, 2016

Mistobaan commented Mar 21, 2016

vrv commented Mar 21, 2016