hessian free optimizer needed #2682

rajarsheem · 2016-06-06T08:18:15Z

Hessian free optimizers are successfully applied on neural networks (especially RNNs). Currently need one in TF!

girving · 2016-06-07T17:58:29Z

PRs welcome!

ajaybhat · 2016-06-10T19:12:30Z

I would like to work on this.

girving · 2016-06-10T20:58:24Z

@ajaybhat Let us know if you have questions or issues!

ajaybhat · 2016-06-13T04:31:07Z

@girving Could you let me know which classes to look for as a reference to implement optimizers? I'm having a bit of trouble finding them.

girving · 2016-06-13T04:46:26Z

@ajaybhat Search for classes which inherit from Optimizer. However, note that for Hessian free methods the standard split into compute_gradients and apply_gradients won't work, since you need to compute partial second order gradients. There are a few different ways one could handle that; the simplest would be to compute gradients as normal during compute_gradients and do the higher order stuff in apply_gradients.

ajaybhat · 2016-06-13T05:52:11Z

Thanks! I'll let you know if i've any more questions.

Fhrozen · 2016-06-14T01:54:26Z

I was also try to implement HessianFree (HF) optimization on another framework, however i did not success. I am not sure if you can use this git (https://github.com/drasmuss/hessianfree) as reference or ask for help. The implementation is already on python and cuda, but he did not implemented convolutional. If you have a code to test let me know so i can compare with my others results.

wangzt2012 · 2017-04-05T09:48:26Z

Is there anyone who is still trying to implement HF optimization on the tensorflow framework?

WihanB · 2017-04-20T13:18:14Z

I was attempting to implement HF optimisation and Saddle Free Newton. These algorithms are doable in a FFN framework. Unfortunately for RNN's as discussed in #5985 it is currently not possible to calculate second order derivatives from DynamicRNN's due to the use of a while loop. The current workaround would be to use StaticRNN. However, the while loop second derivative issue seems to be the major issue when attempting to implement a general Hessian Free optimisation algorithm or any other general second order method which requires Hessian Vector Products.

itsmeolivia · 2017-06-16T21:39:48Z

Automatically closing due to lack of recent activity. Since this issue is old at this point, please reopen the issue if it still occurs when tried with the latest version of Tensorflow. Thank you.

alberduris · 2017-09-20T12:10:10Z

It's been more than a year since the Issue was closed, TF 1.3 was recently deployed and I still think that a Hessian Free Optimizer implementation for TF would be great.

Consider reopening?

lixilinx · 2018-04-11T22:45:54Z

Pardon me to promote my second-order optimization methods here. If interested, please check my tensorflow package at https://github.com/lixilinx/psgd_tf
Second-order optimization with five different preconditioners and rnn/cnn examples are provided. It works for both FNN and RNN with while loop. We know that tf.while_loop still does not support second-order derivative. To work around it, you can just use perturbation of gradient to approximate the Hessian-vector product you want.

As for HF optimization, its damping factor and step size in line search are obtained by trial-and-error, and this could cause further troubles for its tensorflow implementation. A second-order method without line search is preferred, and methods in the above link are such examples.

dave-fernandes · 2019-02-04T17:15:06Z

Here is an implementation of the Saddle Free method:
https://github.com/dave-fernandes/SaddleFreeOptimizer

JaeDukSeo · 2019-05-08T06:04:28Z

amazing

girving added stat:contribution welcome Status - Contributions welcome triaged labels Jun 7, 2016

aselle removed the triaged label Jul 28, 2016

aselle added type:feature Feature requests and removed enhancement labels Feb 9, 2017

itsmeolivia closed this as completed Jun 16, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hessian free optimizer needed #2682

hessian free optimizer needed #2682

rajarsheem commented Jun 6, 2016

girving commented Jun 7, 2016

ajaybhat commented Jun 10, 2016

girving commented Jun 10, 2016

ajaybhat commented Jun 13, 2016

girving commented Jun 13, 2016

ajaybhat commented Jun 13, 2016

Fhrozen commented Jun 14, 2016

wangzt2012 commented Apr 5, 2017

WihanB commented Apr 20, 2017

itsmeolivia commented Jun 16, 2017

alberduris commented Sep 20, 2017

lixilinx commented Apr 11, 2018 •

edited

Loading

dave-fernandes commented Feb 4, 2019

JaeDukSeo commented May 8, 2019

hessian free optimizer needed #2682

hessian free optimizer needed #2682

Comments

rajarsheem commented Jun 6, 2016

girving commented Jun 7, 2016

ajaybhat commented Jun 10, 2016

girving commented Jun 10, 2016

ajaybhat commented Jun 13, 2016

girving commented Jun 13, 2016

ajaybhat commented Jun 13, 2016

Fhrozen commented Jun 14, 2016

wangzt2012 commented Apr 5, 2017

WihanB commented Apr 20, 2017

itsmeolivia commented Jun 16, 2017

alberduris commented Sep 20, 2017

lixilinx commented Apr 11, 2018 • edited Loading

dave-fernandes commented Feb 4, 2019

JaeDukSeo commented May 8, 2019

lixilinx commented Apr 11, 2018 •

edited

Loading