Cache initialization of default algorithms #442

SKopecz · 2023-12-08T17:22:00Z

The following example demonstrates for StaticArrays that DefaultAlgorithmChoice.LUFactorization is significantly slower than explicitly requesting an LUFactorization. The main reason for this is that in

LinearSolve.jl/src/default.jl

Line 296 in e60a10a

caches = map(first.(EnumX.symbol_map(DefaultAlgorithmChoice.T))) do alg

caches for all possible default algorithms are initialized and not just for the algorithm actually used. The flame graph below shows that over 40% of the time is unnecessarily spent in init_cacheval initializing GMRES, Cholesky-Factorization, SVD and so on. The problem also exists with other types of A and b, albeit to a lesser extent.

It would be great if only the cache for the actual algorithm could be initialized.

I would also be interested to know why a distinction is made between DefaultAlgorithmChoice.LUFactorization and LUFactorization at all.

julia> using StaticArrays, LinearSolve, LinearAlgebra, BenchmarkTools

julia> A = @SMatrix [1.0 2.0; 3.0 4.0];
julia> b = @SVector [3.0; 7.0];
julia> prob = LinearProblem(A,b);

julia> sol1 = solve(prob);
julia> sol1.alg
LinearSolve.DefaultLinearSolver(LinearSolve.DefaultAlgorithmChoice.LUFactorization)

julia> solver = LUFactorization();
julia> sol2 = solve(prob, solver);
julia> sol2.alg
LUFactorization{RowMaximum}(RowMaximum())

julia> @btime solve($prob)
  3.637 μs (41 allocations: 4.27 KiB)
retcode: Default
u: 2-element Vector{Float64}:
 0.9999999999999997
 1.0000000000000002

julia> @btime solve($prob, $solver)
  84.237 ns (2 allocations: 288 bytes)
retcode: Default
u: 2-element Vector{Float64}:
 0.9999999999999997
 1.0000000000000002

CC @ranocha

The text was updated successfully, but these errors were encountered:

ChrisRackauckas · 2023-12-09T16:14:21Z

It's done in that way because with arrays it would be type-unstable to choose the method based on size and such. But with StaticArrays, you have the information. I think we could specialize SArray/MArray directly to LUFactorization/SVDFactorization based on size @avik-pal ?

avik-pal · 2023-12-09T19:09:44Z

Ah that is the reason to create a cache for all types. Yeah I will get it fixed.

avik-pal · 2023-12-14T00:23:14Z

Should be fixed now

SKopecz · 2023-12-22T14:39:57Z

Great, thanks!

avik-pal closed this as completed Dec 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache initialization of default algorithms #442

Cache initialization of default algorithms #442

SKopecz commented Dec 8, 2023

ChrisRackauckas commented Dec 9, 2023 •

edited

avik-pal commented Dec 9, 2023

avik-pal commented Dec 14, 2023

SKopecz commented Dec 22, 2023

Cache initialization of default algorithms #442

Cache initialization of default algorithms #442

Comments

SKopecz commented Dec 8, 2023

ChrisRackauckas commented Dec 9, 2023 • edited

avik-pal commented Dec 9, 2023

avik-pal commented Dec 14, 2023

SKopecz commented Dec 22, 2023

ChrisRackauckas commented Dec 9, 2023 •

edited