invlink for simplex distributions #17

trappmartin · 2019-02-22T17:33:05Z

Hi,

I've played around with the invlink function for simplex distributions and checked the correctness of the current implementation against the code used in Stan. (https://github.com/stan-dev/math/blob/502487c511594ccb93eb979df5b8fe163becb417/stan/math/rev/mat/fun/simplex_constrain.hpp)

The reimplementation I used is:

function invlink_simplex(y::AbstractVector{T}) where {T<:Real}
    N = length(y)
    stick = one(T)
    x = zeros(T, length(y))
    for k in 1:(N-1)
        lk = map(T, log(N - k))
        z = logistic(y[k] - lk)
        x[k] = stick * z
        stick -= x[k]
    end
    x[N] = stick
    return x
end

and I found quite a bit of speed improvement if this implementation is used...

K = 1000
y = rand(K)
d = Dirichlet(K, 1.0)

@btime invlink_simplex(y);
@btime invlink(d, y);
  77.077 μs (1 allocation: 7.94 KiB)
  369.305 μs (1 allocation: 7.94 KiB)

Should I make a PR for this or do we think there are stability issues with the code?

The text was updated successfully, but these errors were encountered:

cpfiffer · 2019-02-22T18:07:19Z

It looks to me like it's stable, but Mohamed the Master of Type Stability probably has a far better eye for that than me.

mohamed82008 · 2019-02-22T21:51:18Z

Hi @trappmartin !

Your implementation looks type stable but I think we will run into numerical issues when inverting this. The main numerical stability problems were from link, invlink was just modified accordingly to be its inverse. I do think however there is a performance issue in Bijectors. I looked into it, and it seems the @debug sentences are causing some type instability. When removing them, and benchmarking I get:

julia> @btime invlink_simplex($y);
39.384 μs (1 allocation: 7.94 KiB)

julia> @btime invlink($d, $y);
43.395 μs (1 allocation: 7.94 KiB)

I assume the extra time is from the extra epsilon arithmetic that is done. The real difference is probably more when you use @inbounds. So I guess what it boils down to is if we are willing to throw away the inverse property between link and invlink, then we can gain some extra performance with your implementation of invlink.

mohamed82008 · 2019-02-22T21:59:26Z

I will also make a PR to fix the @debug issues.

trappmartin · 2019-02-23T00:06:54Z

Sounds good. I wasn’t aiming for performance with my implementation but was surprised that the invlink is so much slower.

trappmartin · 2019-02-23T00:13:45Z

Maybe as a side note, it might be good to have internal functions that do not require to pass a distribution object. This would allow a knowledgeable user to use the transformations without instantiating a Distributions object. Similar to the StatsFuns package.

trappmartin · 2019-02-23T00:19:34Z

One last note. I think the invlink should be at least as fast as my code as I don’t optimise anything and invlink looks rather tuned. Your test shows that even after removing the @debug the invlink is still slower. I guess we should think about improving the code as this gets called frequently during sampling.

mohamed82008 · 2019-02-23T22:59:12Z

@trappmartin The slowdown is from the epsilons to make invlink a proper inverse of the numerically stable link. Are you proposing making invlink as fast as possible even if it is not a good inverse of the numerically stable link?

trappmartin · 2019-02-24T11:04:19Z

No, I propose we try to get to code faster while keeping it numerical stable.

mohamed82008 · 2019-02-24T13:27:55Z

Done, #20. The main improvement came from replacing log(1/x) with -log(x) thus eliminating an unnecessary division. New benchmarks:

julia> @btime invlink_simplex($y);
39.384 μs (1 allocation: 7.94 KiB)

julia> @btime invlink($d, $y);
39.020 μs (1 allocation: 7.94 KiB)

mohamed82008 mentioned this issue Feb 22, 2019

@debug causing type instability and performance hit #18

Closed

mohamed82008 mentioned this issue Feb 24, 2019

Squeeze some more performance out #20

Merged

mohamed82008 closed this as completed in #20 Feb 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

invlink for simplex distributions #17

invlink for simplex distributions #17

trappmartin commented Feb 22, 2019

cpfiffer commented Feb 22, 2019 •

edited

Loading

mohamed82008 commented Feb 22, 2019 •

edited

Loading

mohamed82008 commented Feb 22, 2019

trappmartin commented Feb 23, 2019 •

edited

Loading

trappmartin commented Feb 23, 2019

trappmartin commented Feb 23, 2019 •

edited

Loading

mohamed82008 commented Feb 23, 2019

trappmartin commented Feb 24, 2019

mohamed82008 commented Feb 24, 2019 •

edited

Loading

invlink for simplex distributions #17

invlink for simplex distributions #17

Comments

trappmartin commented Feb 22, 2019

cpfiffer commented Feb 22, 2019 • edited Loading

mohamed82008 commented Feb 22, 2019 • edited Loading

mohamed82008 commented Feb 22, 2019

trappmartin commented Feb 23, 2019 • edited Loading

trappmartin commented Feb 23, 2019

trappmartin commented Feb 23, 2019 • edited Loading

mohamed82008 commented Feb 23, 2019

trappmartin commented Feb 24, 2019

mohamed82008 commented Feb 24, 2019 • edited Loading

cpfiffer commented Feb 22, 2019 •

edited

Loading

mohamed82008 commented Feb 22, 2019 •

edited

Loading

trappmartin commented Feb 23, 2019 •

edited

Loading

trappmartin commented Feb 23, 2019 •

edited

Loading

mohamed82008 commented Feb 24, 2019 •

edited

Loading