[ML] Avoid zero size steps in L-BFGS #2078

tveasey · 2021-10-19T12:25:47Z

Our implementation could generate zero size steps and fail to converge. There are a couple of different scenarios in which this was possible:

The function gradient was smaller than double epsilon times the norm of argument vector in which case x - g(x) = x to working precision
The gradient function returned zero at the initial point

We were also checking a strict inequality for convergence, which failed to identify we'd converged if we were taking zero sized steps.

To handle both cases I've added a fallback to perform a more elaborate line search if we try to take a too small step which ensures we test steps which are larger than epsilon * x. We also now try some random probes to see if we can find a direction in which the function decreases if the gradient function returns zero.

edsavage

LGTM

Our implementation could generate zero size steps and fail to converge. There are a couple of different scenarios in which this was possible: 1. The function gradient was smaller than double epsilon times the norm of argument vector in which case x - g(x) = x to working precision, 2. The gradient function returned zero at the initial point. We were also checking a strict inequality for convergence, which failed to identify we'd converged if we were taking zero sized steps. To handle both cases I've added a fallback to perform a more elaborate line search if we try to take a too small step which ensures we test steps which are larger than epsilon * x. We also now try some random probes to see if we can find a direction in which the function decreases if the gradient function returns zero (this can be useful if the function is used for finding local minimum of non-convex functions).

Handle divide by zero cases

4c54adf

tveasey added >bug review v8.0.0 :ml/DataFrameAnalysis v7.16.0 labels Oct 19, 2021

tveasey requested review from valeriy42 and edsavage October 19, 2021 12:25

Docs

4d908d3

edsavage mentioned this pull request Oct 19, 2021

[ML] Fix sources of undefined behaviour #2074

Closed

edsavage approved these changes Oct 19, 2021

View reviewed changes

tveasey added 5 commits October 19, 2021 14:36

Test thresholds

a7f0c5e

Build fix

79400ff

Test thresholds for ARM

c2e1cc1

Comment

bfb1014

Merge branch 'main' into lbfgs-divide-by-zero

7e6668c

tveasey merged commit b5dcc59 into elastic:main Oct 20, 2021

tveasey deleted the lbfgs-divide-by-zero branch October 20, 2021 17:46

droberts195 mentioned this pull request Oct 21, 2021

[ML] Failure in CLbfgsTest.testMinimizeWithVerySmallGradient after compiling with debug and assertions #2081

Closed

tveasey mentioned this pull request Oct 25, 2021

[ML] Don't hang uniformly sampling an empty interval #2086

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Avoid zero size steps in L-BFGS #2078

[ML] Avoid zero size steps in L-BFGS #2078

Uh oh!

tveasey commented Oct 19, 2021

Uh oh!

edsavage left a comment

Uh oh!

Uh oh!

[ML] Avoid zero size steps in L-BFGS #2078

[ML] Avoid zero size steps in L-BFGS #2078

Uh oh!

Conversation

tveasey commented Oct 19, 2021

Uh oh!

edsavage left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!