Very slow loss aggregation #172

MilesCranmer · 2023-08-24T07:31:25Z

The mean and sum implementations in this package are extremely slow as they rely on generators. Could this be changed to a faster implementation? For example, here is how the current implementation would compute sum-squared-error:

square(x) = x ^ 2

# example of generator-based approach:
function sse(outputs, targets)
    sum(square(ŷ - y) for (ŷ, y) in zip(outputs, targets))
end

(i.e., like this code)

which gives us the following time:

julia> @btime sse(outputs, targets) setup=(outputs=randn(100_000); targets=randn(100_000))
  92.833 μs (0 allocations: 0 bytes)

but if we change this to an approach using sum(<function>, <indices>), it's much faster:

function sse2(outputs, targets)
    sum(i -> square(outputs[i] - targets[i]), eachindex(outputs, targets))
end

julia> @btime sse2(outputs, targets) setup=(outputs=randn(100_000); targets=randn(100_000))
  26.708 μs (0 allocations: 0 bytes)

which is a 3.5x speedup.

Could this be implemented as the default loss calculation? I thought this was the method that used to be used. Perhaps it got changed in the recent refactoring?

The text was updated successfully, but these errors were encountered:

juliohm · 2023-08-24T08:12:33Z

I don't remember using this feature anywhere else in downstream packages. If you can submit a PR dropping support for generators, we can review and merge it. Em qui., 24 de ago. de 2023 04:31, Miles Cranmer ***@***.***> escreveu:

…

The mean and sum implementations in this package are extremely slow as they rely on generators. Could this be changed to a faster implementation? For example, here is how the current implementation would compute sum-squared-error: square(x) = x ^ 2 # example of generator-based approach:function sse(outputs, targets) sum(square(ŷ - y) for (ŷ, y) in zip(outputs, targets))end (i.e., like this code <https://github.com/JuliaML/LossFunctions.jl/blob/7318c582874988fcc325464831aa2ce94eb36ff2/src/losses.jl#L37-L39> ) which gives us the following time: julia> @Btime sse(outputs, targets) setup=(outputs=randn(100_000); targets=randn(100_000)) 92.833 μs (0 allocations: 0 bytes) but if we change this to an approach using sum(<function>, <indices>), it's much faster: function sse2(outputs, targets) sum(i -> square(outputs[i] - targets[i]), eachindex(outputs, targets))end julia> @Btime sse2(outputs, targets) setup=(outputs=randn(100_000); targets=randn(100_000)) 26.708 μs (0 allocations: 0 bytes) which is a 3.5x speedup. Could this be implemented as the default loss calculation? I thought this was the method that used to be used. Perhaps it got changed in the recent refactoring? — Reply to this email directly, view it on GitHub <#172>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZQW3KI56D2NUGNSS7FNQLXW37NRANCNFSM6AAAAAA34SKW4E> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

MilesCranmer · 2023-08-24T08:48:08Z

I don't remember using this feature anywhere else in downstream packages.

Sorry I don't think I was clear. This is the main sum function in LossFunctions.jl which is the main interface to the package. It seems to be using a generator (e.g., sum(f(x) for x in X)) for summing, which is very slow:

LossFunctions.jl/src/losses.jl

Lines 37 to 39 in 7318c58

    
           function sum(loss::SupervisedLoss, outputs, targets) 
        
             sum(loss(ŷ, y) for (ŷ, y) in zip(outputs, targets)) 
        
           end

If you can submit a PR dropping support for generators, we can review and merge it.

Sure!

MilesCranmer · 2023-08-24T08:49:20Z

Implemented in #173

juliohm · 2023-08-24T09:59:58Z

Yes, i meant using the version with generators in downstream packages that depend on LossFunctions.jl Em qui., 24 de ago. de 2023 05:49, Miles Cranmer ***@***.***> escreveu:

…

Implemented in #173 <#173> — Reply to this email directly, view it on GitHub <#172 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZQW3OTKEZIWOCK55VHWWTXW4IRZANCNFSM6AAAAAA34SKW4E> . You are receiving this because you commented.Message ID: ***@***.***>

MilesCranmer · 2023-08-24T12:09:51Z

Got it, thanks!

MilesCranmer mentioned this issue Aug 24, 2023

Improve aggregation speeds by using eachindex instead of iterate #173

Merged

juliohm closed this as completed in #173 Aug 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Very slow loss aggregation #172

Very slow loss aggregation #172

MilesCranmer commented Aug 24, 2023

juliohm commented Aug 24, 2023 via email

MilesCranmer commented Aug 24, 2023

MilesCranmer commented Aug 24, 2023

juliohm commented Aug 24, 2023 via email

MilesCranmer commented Aug 24, 2023

Very slow loss aggregation #172

Very slow loss aggregation #172

Comments

MilesCranmer commented Aug 24, 2023

juliohm commented Aug 24, 2023 via email

MilesCranmer commented Aug 24, 2023

MilesCranmer commented Aug 24, 2023

juliohm commented Aug 24, 2023 via email

MilesCranmer commented Aug 24, 2023