Proposal: generalise `logsumexp` slightly. #69

willtebbutt · 2019-03-07T18:53:48Z

Will add thorough testing if this is deemed something that's reasonable to include. Provides functionality similar to that found in sum and maximum etc to provide a dims argument which acts in the same way as sum and maximum.

e.g.

B = randn(5, 4)
A = logsumexp(B; dims=1)

returns a 1 x 4 matrix A, where A[1, k] = logsumexp(B[:, k]).

ararslan · 2019-03-19T16:49:22Z

src/basicfuns.jl

@@ -190,7 +190,13 @@ function logsumexp(X)
    isempty(X) && return log(sum(X))
    reduce(logaddexp, X)
 end
-function logsumexp(X::AbstractArray{T}) where {T<:Real}
+function logsumexp(X::AbstractArray{T}; dims=nothing) where {T<:Real}
+    dims isa Nothing && return _logsumexp(X)


I think it's more contentional to make this something like

Suggested change

dims isa Nothing && return _logsumexp(X)

dims === nothing && return _logsumexp(X)

though it doesn't really matter; they're entirely equivalent and should perform the same. Just figured I'd note it.

isnothing(dims)?

=== is special cased in the compiler and is generally more efficient for checking nothing (and indeed, even missing).

I think isnothing is also elided since it's dealt with at dispatch time, and it was the recommended way to check for nothingness? The only thing is that it requires Julia 1.1.

it was the recommended way to check for nothingness?

How did you come to that conclusion?

I think isnothing is also elided

It is not, see JuliaLang/julia#27681.

How did you come to that conclusion?

Well, it's a exported function named isnothing
In reality I wasn't aware of that issue. Thanks for pointing it out.

ararslan · 2019-03-19T16:50:31Z

This certainly looks like a worthwhile feature to me!

andreasnoack

Just two minor comments.

andreasnoack · 2019-03-19T18:08:43Z

src/basicfuns.jl

-function logsumexp(X::AbstractArray{T}) where {T<:Real}
+function logsumexp(X::AbstractArray{T}; dims=nothing) where {T<:Real}
+    dims isa Nothing && return _logsumexp(X)
+    isempty(X) && return log(zero(T))


Isn't this type unstable? I think you'd still have to consider dimensions as in

julia> sum(zeros(0, 0), dims=1) 1×0 Array{Float64,2}

andreasnoack · 2019-03-19T18:09:21Z

src/basicfuns.jl

+    dims isa Nothing && return _logsumexp(X)
+    isempty(X) && return log(zero(T))
+    u = maximum(X; dims=dims)
+    return log.(sum(exp.(X .- u); dims=dims)) .+ u


Maybe reuse the array created by sum(exp.(X .- u); dims=dims) to avoid a temporary.

tpapp · 2019-03-20T11:04:14Z

Is this something that cannot be addressed with mapslices and similar?

willtebbutt · 2019-03-20T11:29:25Z

Are there not performance issues associated with mapslices and related?

andreasnoack · 2019-03-20T12:29:34Z

Are there not performance issues associated with mapslices and related?

Unfortunately, there are. At least there used to be. Could you benchmark it?

willtebbutt · 2019-03-20T14:12:33Z

Here are some timings:

using BenchmarkTools, StatsFuns

N1, M1 = 1000, 2;
x = randn(N1, M1);
@btime logsumexp($x);
  20.012 μs (0 allocations: 0 bytes)
@btime logsumexp($x; dims=1);
  25.779 μs (13 allocations: 16.23 KiB)
@btime mapslices(logsumexp, $x; dims=1);
  25.318 μs (56 allocations: 10.53 KiB)
@btime logsumexp($x; dims=2);
  33.160 μs (31 allocations: 40.33 KiB)
@btime mapslices(logsumexp, $x; dims=2);
  401.447 μs (9507 allocations: 236.17 KiB)

# Second round of tests with different memory layout.
x2 = Matrix(x');
@btime logsumexp($x2);
  19.984 μs (0 allocations: 0 bytes)
@btime logsumexp($x2; dims=2);
  31.323 μs (31 allocations: 16.80 KiB)
@btime mapslices(logsumexp, $x2; dims=2);
  25.503 μs (56 allocations: 10.53 KiB)
@btime logsumexp($x2; dims=1);
  38.780 μs (13 allocations: 39.77 KiB)
@btime mapslices(logsumexp, $x2; dims=1);
  373.496 μs (9507 allocations: 236.17 KiB)

When each slice is small the mapslices overhead associated with mapslices is rather large.

tpapp · 2019-03-20T14:47:18Z

Thanks for the benchmarks. This is because mapslices is poorly implemented. Cf

using BenchmarkTools, StatsFuns, JuliennedArrays

N1, M1 = 1000, 2;
x = randn(N1, M1);
@btime mapslices(logsumexp, $x; dims=2);
@btime map(logsumexp, Slices($x, False(), True()));

with

julia> @btime mapslices(logsumexp, $x; dims=2);
  704.918 μs (9507 allocations: 236.17 KiB)

julia> @btime map(logsumexp, Slices($x, False(), True()));
  52.297 μs (1003 allocations: 54.84 KiB)

I think it would be better to propagate the use of sane slice iterations constructs instead of adding a (; dims = ...) method to everything.

willtebbutt · 2019-06-06T13:16:12Z

Closing for now as this has gone stale and a more generic mechanism of the form @tpapp suggests would probably better. Happy to re-open if anyone feels strongly about this.

Basic implementation. Poor testing.

b13b1c7

ararslan reviewed Mar 19, 2019

View reviewed changes

andreasnoack reviewed Mar 19, 2019

View reviewed changes

willtebbutt closed this Jun 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: generalise `logsumexp` slightly. #69

Proposal: generalise `logsumexp` slightly. #69

willtebbutt commented Mar 7, 2019

ararslan Mar 19, 2019

cossio May 21, 2019 •

edited

ararslan May 21, 2019

cossio May 21, 2019

ararslan May 21, 2019

cossio May 21, 2019

ararslan commented Mar 19, 2019

andreasnoack left a comment

andreasnoack Mar 19, 2019

andreasnoack Mar 19, 2019

tpapp commented Mar 20, 2019

willtebbutt commented Mar 20, 2019

andreasnoack commented Mar 20, 2019

willtebbutt commented Mar 20, 2019

tpapp commented Mar 20, 2019

willtebbutt commented Jun 6, 2019

	dims isa Nothing && return _logsumexp(X)
	dims === nothing && return _logsumexp(X)

Proposal: generalise logsumexp slightly. #69

Proposal: generalise logsumexp slightly. #69

Conversation

willtebbutt commented Mar 7, 2019

ararslan Mar 19, 2019

Choose a reason for hiding this comment

cossio May 21, 2019 • edited

Choose a reason for hiding this comment

ararslan May 21, 2019

Choose a reason for hiding this comment

cossio May 21, 2019

Choose a reason for hiding this comment

ararslan May 21, 2019

Choose a reason for hiding this comment

cossio May 21, 2019

Choose a reason for hiding this comment

ararslan commented Mar 19, 2019

andreasnoack left a comment

Choose a reason for hiding this comment

andreasnoack Mar 19, 2019

Choose a reason for hiding this comment

andreasnoack Mar 19, 2019

Choose a reason for hiding this comment

tpapp commented Mar 20, 2019

willtebbutt commented Mar 20, 2019

andreasnoack commented Mar 20, 2019

willtebbutt commented Mar 20, 2019

tpapp commented Mar 20, 2019

willtebbutt commented Jun 6, 2019

Proposal: generalise `logsumexp` slightly. #69

Proposal: generalise `logsumexp` slightly. #69

cossio May 21, 2019 •

edited