performance improvement for Symmetric/Hermitian of sparse times vector #30018

KlausC · 2018-11-13T11:34:58Z

If A is a sparse matrix (Float64, 10_000x10_000, nnz=10^6) and B a dense vector, we have currently an execution time for Symmetric(A) * B of 3 s. This PR reduces execution time to 1.8 ms, which is a factor of more than 1500!
For element type BigFloat the times are 19.8 s and 360 ms, which is still factor 55.
Same situation for all variants of Symmetric/Hermitian, :U/:L, and Real/Complex.

Benchmark values current situation:

julia> n = 10000
10000

julia> Random.seed!(0); A = sprandn(n, n, 0.01); nnz(A)
1000598

julia> B = randn(n);

julia> Asym = Symmetric(A); As = sparse(Asym);

julia> @benchmark $Asym * $B
BenchmarkTools.Trial: 
  memory estimate:  390.77 KiB
  allocs estimate:  10004
  --------------
  minimum time:     3.055 s (0.00% GC)
  median time:      3.082 s (0.00% GC)
  mean time:        3.082 s (0.00% GC)
  maximum time:     3.110 s (0.00% GC)
  --------------
  samples:          2
  evals/sample:     1

Benchmark with this PR included:

julia> @benchmark $Asym * $B
BenchmarkTools.Trial: 
  memory estimate:  78.20 KiB
  allocs estimate:  2
  --------------
  minimum time:     1.826 ms (0.00% GC)
  median time:      1.858 ms (0.00% GC)
  mean time:        1.890 ms (0.09% GC)
  maximum time:     2.566 ms (0.00% GC)
  --------------
  samples:          2633
  evals/sample:     1

KlausC · 2018-11-23T21:16:35Z

The immense improvement factor implies widening the realm of solvable matrix calculations on a given hardware enormously. I think, in the case "performance improvement" is a slight understatement - I am wondering, why I cannot attract more attention. (@StefanKarpinski, @andreasnoack).

chriscoey · 2018-11-23T21:51:31Z

I'm with you! I wish there was a regular linear algebra triage call.

andreasnoack · 2018-11-23T23:00:28Z

Please see #22200. It's always a good idea to ask before implementing.

I wish there was a regular linear algebra triage call.

I'd okay with trying that out. I think the main limiting factor is still reviews though.

KristofferC · 2018-11-23T23:05:04Z

FWIW, comparing the speed improvement against something that hits the abstractarray fallback is not terribly interesting since you can gain arbitrary speedup by changing the sparsity density. Comparing to the non-symmetric sparse case is perhaps more relevant.

KlausC · 2018-11-24T14:47:19Z

comparing the speed improvement against something that hits the abstractarray fallback is not terribly interesting

I don't think so, because currently there is no good way to get the job done.
IMO the question is, how to get Asym * b done fast if Asym isa Symmetric{,SparseMatrixCSC} and length(Asym) is big while nnz(Asym.data) is small. Fast means O(nnz(Asym.data)) in contrast to O(A.n^2).
The most obvious approach Asym * b should not lead to the "abstractarray fallback" trap, because the naive user is not aware of that.
The smart user could try to find a work-around, but he will have a hard time and not a real success:

julia> @btime $Asym * $b;            # naive user is disappointed with the current situation
  3.016 s (2 allocations: 78.20 KiB)

julia> @btime $Asym * sparse($b);      # uses the SuiteSparse.CHOLMOD C-library
  24.539 ms (50 allocations: 31.80 MiB)

julia> @btime $(sparse(Asym)) * $b;    # only multiplication measured
  1.949 ms (2 allocations: 78.20 KiB)

julia> @btime sparse($Asym) * $b; # included time for sparse(Asym) (BTW: reducible to 10 ms !)
  2.809 s (37 allocations: 40.10 MiB)

after the PR is merged:

julia> @btime $Asym * $b;           # the most natural approach is convincing now
  1.835 ms (2 allocations: 78.20 KiB)

In consequence the user is forced avoid the Symmetric/Hermitian wrappers of sparse matrices, if he wants good performance. But that is not desirable or possible in all cases.

you can gain arbitrary speedup by changing the sparsity density

Of course that is true. But I don't think, the example is unrealistic. Actually I looked for an example, which clearly demonstrates the difference between O(nnz(A)) and O(n²).
In the SuiteSparse Matrix Collection more than 50% of the symmetric examples fall in this category:

julia> using MatrixDepot   # v"0.7.0"
julia> length(mdlist(sp(:) & issymmetric))
1148
julia> length(mdlist(sp(:) & issymmetric & @pred(n >= 10^4 && nnz/n^2 <= 0.01)))
587

KlausC · 2018-11-24T16:48:11Z

Please see #22200. It's always a good idea to ask before implementing.

I looked at #22200 and it is a pity, that I didn't see it before. I would not have started my efforts, though, if that PR had been merged to master before I was stopped in my own project by the missing linalg operation on wrapped sparse matrices. I do not know, what prevented #22200 from accepted. Now we have 2 implementations for Symmetric{,SparseMatrixCSC} * StridedVector and we should select one and whichever is better merge it into master immediately.

…into krc/symmsparsevmul

ViralBShah · 2018-12-17T01:55:41Z

@KlausC Please bump often to make sure we get this done, if you don't mind.

ViralBShah · 2018-12-17T23:03:11Z

@andreasnoack Do we have a preference over which of the two PRs to merge? Perhaps this one is newer and may be lesser work?

ViralBShah · 2019-01-04T05:11:20Z

Bump

KlausC · 2019-01-10T16:49:35Z

@KlausC Please bump often to make sure we get this done, if you don't mind.

I will do...

KlausC · 2019-01-18T11:03:49Z

humbly knocking at the door...

ViralBShah · 2019-01-18T14:00:57Z

This looks good to me.

ViralBShah · 2019-01-18T14:04:40Z

I guess we can backport this to 1.1.

performance improvement for Symmetric/Hermitian of sparse times vector

691f54f

kshyatt added performance Must go faster domain:linear algebra Linear algebra domain:arrays:sparse Sparse arrays labels Nov 14, 2018

KlausC and others added 6 commits November 15, 2018 17:19

fixed typo in comment

b056c12

added sprandn methods with Type

7073d71

Merge remote-tracking branch 'upstream/master'

e43667e

Merge remote-tracking branch 'upstream/master'

4bef0c4

Merge branch 'master' into krc/symmsparsevmul

3bfb5cf

Merge remote-tracking branch 'upstream/master'

468723c

Merge remote-tracking branch 'upstream/master'

b2b63ec

KlausC and others added 13 commits November 24, 2018 18:07

allow StridedMatrix as rhs in mul!

cce9527

Merge branch 'krc/symmsparsevmul' of https://github.com/KlausC/julia …

a9af5b7

…into krc/symmsparsevmul

completed support for SparseMatrixCSCView

79d2a0d

Merge remote-tracking branch 'upstream/master'

ef2bc34

Merge remote-tracking branch 'upstream/master'

0b2dbeb

Merge branch 'master' into krc/symmsparsevmul

ae85275

Merge remote-tracking branch 'upstream/master'

1192355

Merge remote-tracking branch 'upstream/master'

d7282cb

Merge remote-tracking branch 'upstream/master'

d423468

merged with master

9a59d7e

Merge remote-tracking branch 'upstream/master'

07f744f

Merge branch 'master' into krc/symmsparsevmul

3b52be0

Merge branch 'krc/symmsparsevmul' of https://github.com/KlausC/julia …

0cf4c22

…into krc/symmsparsevmul

KlausC added 2 commits December 12, 2018 18:27

performance tweaks in _mul!

8b240fd

Merge remote-tracking branch 'upstream/master'

957aa85

andreasnoack self-requested a review December 12, 2018 18:21

KlausC closed this Dec 16, 2018

KlausC reopened this Dec 16, 2018

KlausC added 5 commits December 18, 2018 14:46

Merge remote-tracking branch 'upstream/master'

c8b61ca

Merge remote-tracking branch 'upstream/master'

6a4f54d

Merge remote-tracking branch 'upstream/master'

af3e470

Merge remote-tracking branch 'upstream/master'

1f95165

Merge remote-tracking branch 'upstream/master'

12b2c59

KlausC added 4 commits January 7, 2019 16:09

Merge remote-tracking branch 'upstream/master'

0644a29

Merge remote-tracking branch 'upstream/master'

c712bc0

Merge remote-tracking branch 'upstream/master'

c08cdac

merged with master

b956ceb

StefanKarpinski added this to the 1.2 milestone Jan 10, 2019

ViralBShah merged commit 30c6ee7 into JuliaLang:master Jan 18, 2019

fredrikekre added the kind:potential benchmark Could make a good benchmark in BaseBenchmarks label Jan 18, 2019

andreasnoack mentioned this pull request Jan 18, 2019

draft: symmetric sparse matrix support #22200

Closed

KlausC deleted the krc/symmsparsevmul branch January 18, 2019 22:34

KlausC mentioned this pull request Jul 28, 2019

Slow sparse matrix-vector multiplication with symmetric matrices #32689

Closed

KlausC mentioned this pull request Dec 13, 2019

Proposal for AbstractWrappedArray in order to mitigate "AbstractArray-fallback" #31563

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance improvement for Symmetric/Hermitian of sparse times vector #30018

performance improvement for Symmetric/Hermitian of sparse times vector #30018

KlausC commented Nov 13, 2018 •

edited

Loading

KlausC commented Nov 23, 2018 •

edited

Loading

chriscoey commented Nov 23, 2018

andreasnoack commented Nov 23, 2018

KristofferC commented Nov 23, 2018

KlausC commented Nov 24, 2018

KlausC commented Nov 24, 2018

ViralBShah commented Dec 17, 2018 •

edited

Loading

ViralBShah commented Dec 17, 2018

ViralBShah commented Jan 4, 2019

KlausC commented Jan 10, 2019

KlausC commented Jan 18, 2019

ViralBShah commented Jan 18, 2019

ViralBShah commented Jan 18, 2019

performance improvement for Symmetric/Hermitian of sparse times vector #30018

performance improvement for Symmetric/Hermitian of sparse times vector #30018

Conversation

KlausC commented Nov 13, 2018 • edited Loading

KlausC commented Nov 23, 2018 • edited Loading

chriscoey commented Nov 23, 2018

andreasnoack commented Nov 23, 2018

KristofferC commented Nov 23, 2018

KlausC commented Nov 24, 2018

KlausC commented Nov 24, 2018

ViralBShah commented Dec 17, 2018 • edited Loading

ViralBShah commented Dec 17, 2018

ViralBShah commented Jan 4, 2019

KlausC commented Jan 10, 2019

KlausC commented Jan 18, 2019

ViralBShah commented Jan 18, 2019

ViralBShah commented Jan 18, 2019

KlausC commented Nov 13, 2018 •

edited

Loading

KlausC commented Nov 23, 2018 •

edited

Loading

ViralBShah commented Dec 17, 2018 •

edited

Loading