Remove extra check for conclusion of Bisection. #57

btracey · 2015-01-26T03:52:06Z

The original code for Bisection included a secondary check to the strong Wolfe conditions to see if the optmiization was finished. The idea was to help mitigate floating point noise and allow for stronger convergence to the gradient. Unfortunately, all this does is add complexity. The parameters are ad hoc, and trade off floating point noise for actual function modulation. For more complicated functions (especially concurrent ones) the noise will be higher, while other functions may have modulations that are very small. It is impossible to design a tradeoff that is good for all functions. Instead, keep the code simple. This also fixes issues with Bisection and the outer OptLoc disagreeing on the optimum location

btracey · 2015-01-26T04:08:04Z

This will fix #53 . PTAL @vladimir-ch

vladimir-ch · 2015-01-26T04:20:24Z

bisection.go

-		// TODO: Should iterate be updated? Maybe find a function where it needs it.
-		return math.Abs(l.Derivative) < b.GradConst*math.Abs(b.initGrad)
-	}
-
 	return StrongWolfeConditionsMet(l.F, l.Derivative, b.initF, b.initGrad, b.currStep, 0, b.GradConst)


Not related to the proposed changes, but why is the Armijo constant equal to 0?

Strictly, the strong Wolfe conditions only hold if FunConst > 0. However, it can be any number greater than zero, including very small values. In finite precision world, when we get close to the minimum, there are large regions where the function value is "constant" as far as representable float64 are concerned. In this region, the line search has to fail because there are no locations with a smaller function value, even though with infinite precision there would be many steps that satisfy the condition. Setting C1 to zero avoids those problems. On a conceptual level, we care about the function not increasing and the gradient getting smaller. Since these methods assume smooth and continuous functions, driving the gradient to zero is the same as finding the location with minimum F. We're happy stopping the linesearch with a sufficient decrease in gradient norm.

OK, thanks.

vladimir-ch · 2015-01-26T04:49:49Z

LGTM.

You may already know that, but you can close the issue #53 by modifying the commit message. It is described in https://help.github.com/articles/closing-issues-via-commit-messages/ It is quite useful and leaves an explicit reference in the commit message to the issue. Basically, you would have to do something like this:

git checkout fixbisectio
git commit --amend
(add "Closes #53" at the end of the commit message and save)
git push -f (push to github so that it recognizes when the branch has been merged)
git checkout master
git merge fixbisectio
git push
(issue #53 and this PR are closed automatically)

The original code for Bisection included a secondary check to the strong Wolfe conditions to see if the optmiization was finished. The idea was to help mitigate floating point noise and allow for stronger convergence to the gradient. Unfortunately, all this does is add complexity. The parameters are ad hoc, and trade off floating point noise for actual function modulation. For more complicated functions (especially concurrent ones) the noise will be higher, while other functions may have modulations that are very small. It is impossible to design a tradeoff that is good for all functions. Instead, keep the code simple. This also fixes issues with Bisection and the outer OptLoc disagreeing on the optimum location. Closes #53

Remove extra check for conclusion of Bisection.

vladimir-ch reviewed Jan 26, 2015
View reviewed changes

btracey force-pushed the fixbisectio branch from ef0cd18 to e2e6ee7 Compare January 26, 2015 05:12

btracey added a commit that referenced this pull request Jan 26, 2015

Merge pull request #57 from gonum/fixbisectio

04fa751

Remove extra check for conclusion of Bisection.

btracey merged commit 04fa751 into master Jan 26, 2015

vladimir-ch deleted the fixbisectio branch January 26, 2015 05:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove extra check for conclusion of Bisection. #57

Remove extra check for conclusion of Bisection. #57

btracey commented Jan 26, 2015

btracey commented Jan 26, 2015

vladimir-ch Jan 26, 2015

btracey Jan 26, 2015

vladimir-ch Jan 26, 2015

vladimir-ch commented Jan 26, 2015

Remove extra check for conclusion of Bisection. #57

Remove extra check for conclusion of Bisection. #57

Conversation

btracey commented Jan 26, 2015

btracey commented Jan 26, 2015

vladimir-ch Jan 26, 2015

Choose a reason for hiding this comment

btracey Jan 26, 2015

Choose a reason for hiding this comment

vladimir-ch Jan 26, 2015

Choose a reason for hiding this comment

vladimir-ch commented Jan 26, 2015