Improve accuracy of logistic #94

johnmyleswhite · 2020-05-06T01:28:47Z

Change the logistic function to increase accuracy for subnormal values (e.g. logistic(-740.0)). See my notes for extended details.

quinnj · 2020-05-06T03:40:01Z

Impressive notes!

johnmyleswhite · 2020-05-06T10:00:10Z

Thanks! Ok to merge this?

andreasnoack · 2020-05-06T11:39:03Z

Calling float specific functions like nextfloat and prevfloat makes the function less generic as can be seen from JuliaDiff/ForwardDiff.jl#453. I also tried the version with ArbFloats and it also fails. I'll suggest restricting the new version to Union{Float16,Float32,Float64} and keeping the old version for Real.

johnmyleswhite · 2020-05-06T13:06:55Z

Ok, I'll make that change.

johnmyleswhite · 2020-05-06T13:08:03Z

Do you have an example of failing with ArbFloats so I can add to test to make sure future changes don't make this function less generic by accident?

andreasnoack · 2020-05-06T14:03:58Z

Do you have an example of failing with ArbFloats

I used logistic(ArbFloat(1.1)).

KristofferC · 2020-05-07T15:27:07Z

src/basicfuns.jl

+function logistic(x::Real)
+    e = exp(x)
+    lower, upper = _logistic_bounds(x)
+    ifelse(


Seems like a quite convoluted way of writing it over just if, else etc?

function logistic(x::Real) e = exp(x) lower, upper = _logistic_bounds(x) if x < lower return zero(x) elseif x > upper return one(x) else return e / (one(x) + e) end end

Probably worth checking @code_llvm; it's true that LLVM now does the conversion of if-else => ifelse automatically pretty well these days.

They seem to produce different output at the @code_llvm and @code_native stages consistent with my earlier experience that ifelse generates LLVM select and if generates LLVM br, but benchmarking suggests their performance is very similar in scalar applications and broadcasting: https://gist.github.com/johnmyleswhite/548f4eb18a028a237d52ae06811ca33c

Note that the benchmarks also suggest that changing from the old implementation to the new implementation causes performance to drop because of processing subnormals in a mirror image: the old formulation was fast for z = -710.0 and slow for z = +710.0, whereas the new formulation is slow for z = -710.0 and fast for z = +710.0.

Happy to choose whichever form people prefer. I tend to defer to the side of assuming select is more likely to play well with SIMD, but it doesn't seem like that issue applies in this case for the benchmarks I've run.

Happy to choose whichever form people prefer.

Personally, I don't really care, just felt like the if version was easier to read and I couldn't find a benchmark where it was slower. I guess if you correctly predict the branch version is faster and otherwise the branchless is faster? 🤷‍♂️

Ok, I think in the absence of useful evidence either way I'll maintain my existing superstitious faith in the importance of the LLVM select instruction. Revising in the future would be trivial.

johnmyleswhite · 2020-05-08T17:22:09Z

Made changes requested by @andreasnoack. Didn't add test to avoid adding more dependencies.

johnmyleswhite · 2020-05-09T11:59:43Z

Any objections to merging? I have the permissions to do it for myself, but given my general disengagement from the repo, it doesn't feel fair for me to use those permissions without approval from those who are more engaged than I am.

cossio · 2020-05-15T23:09:55Z

Can this be simplified to this?

function logistic(x::Real)
    t = exp(-abs(x))
    ifelse(x ≥ 0, inv(one(t) + t), t / (one(t) + t))
end

This simple version passes all the tests given in this PR. If there is no test that can distinguish then I guess I can make a PR with this simpler version (which automatically supports e.g. BigFloats). What do you think?

cossio · 2020-05-15T23:28:14Z

Using the (very nice!) function evaluate_errors, I find for the implementation I suggest:

julia> evaluate_errors(logistic, range(-744.4400719213812, -log(floatmax(Float64)), length=10_000))
Frequency of Exact Results:       0.9964
Average Error:                    0.0
Maximum Error:                    5.0e-324
Average Number of Incorrect Bits: 0.0079

julia> evaluate_errors(logistic, range(-log(floatmax(Float64)), -log(eps(1.0)), length=10_000))
Frequency of Exact Results:       0.8654
Average Error:                    2.727970920400209e-18
Maximum Error:                    1.1102230246251565e-16
Average Number of Incorrect Bits: 0.2609

julia> evaluate_errors(logistic, range(-log(eps(1.0)), 36.7368005696771, length=10_000))
Frequency of Exact Results:       0.415
Average Error:                    6.494804694057165e-17
Maximum Error:                    1.1102230246251565e-16
Average Number of Incorrect Bits: 0.585

In the last two regions the implementation of this PR is better. I'm gonna leave this comment here anyway for future reference.

KristofferC reviewed May 7, 2020

View reviewed changes

Improve accuracy of logistic

307c314

andreasnoack merged commit e9c2bb2 into JuliaStats:master May 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve accuracy of logistic #94

Improve accuracy of logistic #94

johnmyleswhite commented May 6, 2020

quinnj commented May 6, 2020

johnmyleswhite commented May 6, 2020

andreasnoack commented May 6, 2020

johnmyleswhite commented May 6, 2020

johnmyleswhite commented May 6, 2020

andreasnoack commented May 6, 2020

KristofferC May 7, 2020

quinnj May 7, 2020

johnmyleswhite May 8, 2020

KristofferC May 9, 2020

johnmyleswhite May 9, 2020

johnmyleswhite commented May 8, 2020

johnmyleswhite commented May 9, 2020 •

edited

Loading

cossio commented May 15, 2020

cossio commented May 15, 2020

Improve accuracy of logistic #94

Improve accuracy of logistic #94

Conversation

johnmyleswhite commented May 6, 2020

quinnj commented May 6, 2020

johnmyleswhite commented May 6, 2020

andreasnoack commented May 6, 2020

johnmyleswhite commented May 6, 2020

johnmyleswhite commented May 6, 2020

andreasnoack commented May 6, 2020

KristofferC May 7, 2020

Choose a reason for hiding this comment

quinnj May 7, 2020

Choose a reason for hiding this comment

johnmyleswhite May 8, 2020

Choose a reason for hiding this comment

KristofferC May 9, 2020

Choose a reason for hiding this comment

johnmyleswhite May 9, 2020

Choose a reason for hiding this comment

johnmyleswhite commented May 8, 2020

johnmyleswhite commented May 9, 2020 • edited Loading

cossio commented May 15, 2020

cossio commented May 15, 2020

johnmyleswhite commented May 9, 2020 •

edited

Loading