L-BFGS for HessianUpdateStrategy #14046

MarcYin · 2021-05-15T01:25:36Z

When the input vector is large, BFGS and SR1 HessianUpdateStrategy give memory error. Is it possible to use L-BFGS for the Hessian update, which should reduce the memory requirements for Hessian.

MarcYin · 2021-08-30T17:36:47Z

Any updates?

andyfaff · 2021-08-30T23:37:17Z

First, I assume that you are using trust-constr, as HessianUpdateStrategy is only available for that method. Only SR1 and BFGS are available at this time. If you were to implement a class that inherits HessianUpdateStrategy and implements its methods then you would also be able to use that.
Is it imperative that you use trust-constr?

MarcYin · 2021-08-31T10:41:27Z

Hi, yes, I am using trust-constr and the existing Hessian update strategies can easily exceed the memory limit of normal computer when large scale optimisation required. Adding L-BFGS hessian update will make the trust-constr a more general optimisation algorithm. I tried to code it up, but have not been able to do it...I just wandering if anyone is familiar with the L-BFGS hessian update and could help on this.

andyfaff · 2021-11-12T04:33:25Z

@MarcYin, is there a reason why you can't use L-BFGS-B directly?

MarcYin · 2021-11-16T18:49:43Z

Hi, it seems that the trust region method is more robust when the optimised variables are highly correlated compared to the L-BFGS-B from here: http://fa.bianp.net/blog/2013/numerical-optimizers-for-logistic-regression/.

In my case, I have a large number of variables (more than 300,000) that are highly correlated, so I want to have a go with the trust region method but with limited memory requirement, to see if there is an improvement on the reduction of iterations or total optimisations time.

paulestano · 2023-08-28T14:51:23Z

Hi what is the status on this? If the documentation is correct, lots of the scipy.optimize optimizers do now support HessianUpdateStrategy. There is also now some evidence in the literature that L-BFGS Hessian update is robust enough to be used in other scenarios than the L-BFGS-B paper (there is for instance, this paper dealing with this). I think it could be useful to port it into HessianUpdateStrategy. I'd be happy to submit something if that's useful

dschmitz89 · 2023-08-28T19:52:02Z

Hi again @paulestano , in general you can pick up any issue as long as there is no open PR yet. :)

AtsushiSakai added scipy.optimize enhancement A new feature or improvement labels May 15, 2021

MarcYin closed this as completed Aug 30, 2021

MarcYin reopened this Aug 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

L-BFGS for HessianUpdateStrategy #14046

L-BFGS for HessianUpdateStrategy #14046

MarcYin commented May 15, 2021

MarcYin commented Aug 30, 2021

andyfaff commented Aug 30, 2021

MarcYin commented Aug 31, 2021

andyfaff commented Nov 12, 2021

MarcYin commented Nov 16, 2021

paulestano commented Aug 28, 2023

dschmitz89 commented Aug 28, 2023

L-BFGS for HessianUpdateStrategy #14046

L-BFGS for HessianUpdateStrategy #14046

Comments

MarcYin commented May 15, 2021

MarcYin commented Aug 30, 2021

andyfaff commented Aug 30, 2021

MarcYin commented Aug 31, 2021

andyfaff commented Nov 12, 2021

MarcYin commented Nov 16, 2021

paulestano commented Aug 28, 2023

dschmitz89 commented Aug 28, 2023