Improved finite differencing #4

timholy · 2013-01-24T17:32:01Z

I made some improvements in the finite differencing. Here are some brief explanations:

Note that the new formula, 2 * sqrt(1e-12) * (1 + norm(x)), for abs(x) < 1 is approximately equal to 2e-6. The formula I used previously, eps(max(1.0, abs(x)))^(1/3), for abs(x) < 1 is equal to 6e-6, not very different. So contrary to your in-code comment, the small x behavior actually can't be very different for these two.
1e-12 implicitly assumes Float64, whereas using the type-dispatched version of eps does not. I definitely run optimization with Float32s sometimes (when using huge datasets where all the computations are Float32).
The proper power scaling depends on the order of the derivative, and whether centered or forward differencing is used. You'll see the power I used in the revised version is 1/2, 1/3, or 1/4 depending on the circumstance. This is particularly complex for the Hessian.

I also removed the dtype parameter from finite_difference_hessian(f, x) because you basically have to use centered differencing for direct Hessian evaluation.

For your cases in test/finite_difference.jl, I think you'll mostly see improved accuracy. For central differencing the change is quite modest (usually less than one order of magnitude), but for forward differencing and the Hessian it can be dramatic. For example, with the old version

julia> norm(Calculus.finite_difference_hessian(fx, gx, [0.0, 0.0], :central) - [-sin(0.0) 0.0; 0.0 -cos(0.0)])
6.938866148331613e-6

and with the new

julia> norm(Calculus.finite_difference_hessian(fx, [0.0, 0.0]) - [-sin(0.0) 0.0; 0.0 -cos(0.0)])
7.450580596923828e-9

Presumably it should also be more robust in real-world problems, although that remains to be seen under usage.

timholy · 2013-01-24T17:42:23Z

I should have added one more point: despite their mathematical equivalence, there can be real-world differences between (f(x+epsilon)-f(x))/epsilon and xp = x+epsilon; (f(xp)-f(x))/(xp-x). The latter uses numbers that are exactly-represented by the machine, whereas with the former the gap in the numerator might not be equal to the gap in the denominator. In particular, this can be measurably-important when doing computations in Float32.

One concern, however, is whether the compiler might optimize this fine distinction away. CCing @JeffBezanson and @StefanKarpinski for guidance here about whether this is a likely problem and if so, whether there are tricks we can use to keep this from happening.

JeffBezanson · 2013-01-24T18:19:37Z

My guess is it will be fine; we won't optimize that away, and LLVM tends to respect float arithmetic by default.

Thanks Jeff Bezanson

timholy · 2013-01-24T18:56:40Z

I would recommend sqrt and cbrt over ^ here if that works.

Good catch, thanks!

Improved finite differencing

johnmyleswhite · 2013-01-25T14:23:49Z

Thanks for these patches, Tim. I'm going to merge them now because it's fairly clear that you're more knowledgeable on this topic than me. At some point, I'll try to pedal back and see if the example to select between values of epsilon that I previously used was corrupt in some way.

Sorry for the delay. I'm desperately trying to finish my dissertation in the next month, so I'll be slow with things.

timholy · 2013-01-25T14:47:57Z

Wow, good luck with the dissertation!

johnmyleswhite · 2013-01-25T19:32:56Z

Thanks. I'll need it!

timholy added 2 commits January 24, 2013 11:15

Improved accuracy for finite-differencing

a6cdd4a

Create .gitignore

bf80ebf

Use sqrt and cbrt instead of ^(1/2) and ^(1/3)

94755a7

Thanks Jeff Bezanson

johnmyleswhite added a commit that referenced this pull request Jan 25, 2013

Merge pull request #4 from timholy/master

e66cee4

Improved finite differencing

johnmyleswhite merged commit e66cee4 into JuliaMath:master Jan 25, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improved finite differencing #4

Improved finite differencing #4

Uh oh!

timholy commented Jan 24, 2013

Uh oh!

timholy commented Jan 24, 2013

Uh oh!

JeffBezanson commented Jan 24, 2013

Uh oh!

timholy commented Jan 24, 2013

Uh oh!

johnmyleswhite commented Jan 25, 2013

Uh oh!

timholy commented Jan 25, 2013

Uh oh!

johnmyleswhite commented Jan 25, 2013

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improved finite differencing #4

Improved finite differencing #4

Uh oh!

Conversation

timholy commented Jan 24, 2013

Uh oh!

timholy commented Jan 24, 2013

Uh oh!

JeffBezanson commented Jan 24, 2013

Uh oh!

timholy commented Jan 24, 2013

Uh oh!

johnmyleswhite commented Jan 25, 2013

Uh oh!

timholy commented Jan 25, 2013

Uh oh!

johnmyleswhite commented Jan 25, 2013

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants