improve slicing lazily transposed sparse matrices: WIP #28654

jlapeyre · 2018-08-14T18:06:16Z

This PR uses the code for getting slices in sparsevector.jl to get slices of sparse matrices
that have lazy adjoint and transpose wrappers. Before, the fallback code in abstractarray.jl created the
slices inefficiently.

Here is an example of a sparse matrix created with sprand(10^4 10^4, 0.01), which shows an increase in speed of 10 to 1000 x.
The test code.
Benchmark before the PR.
Benchmark after the PR

The changes should have no effect on the efficiency of slicing unwrapped, ie SparseMatrixCSC, matrices.

The new slicing calls adjoint and transpose recursively. I don't know of any cases where that is useful, although there may be. Consider for example a sparse matrix with elements that are dense matrices. If you want to interpret this as a block matrix, then transpose(S)[1, :] doesn't give you the correct result. (Neither does S[1,:].) Maybe restricting the recursion to structures that are intended to be used for linear algebra (eg BlockArray), rather than any ad-hoc array makes sense. The reason for this restriction would be to reduce complexity. I'm sure this has been discussed elsewhere, but I have not yet read about it.

There could be further optimizations. EDIT: The following has been done and commited to this branch.
~~For instance, check if the elements of the parent matrix are numbers (via dispatch), in which case it is not necessary to call transpose.~~

EDIT: Tests have been added to this branch ~~This PR needs tests, there are none.~~ I would not be surprised if some or all slicing of wrapped arrays is not covered by existing tests.

This PR uses the code for getting slices in sparsevector.jl to get slices of sparse matrices with lazy adjoint and transpose wrappers. Before, the fallback code in abstractarray.jl created the slices inefficiently.

jlapeyre · 2018-08-14T18:10:41Z

Ref to discourse: https://discourse.julialang.org/t/strange-performance-with-adjoint-structures/13397

This tests slicing (indexing) of sparse matrices wrapped with Transpose or Adjoint.

jlapeyre · 2018-08-14T22:02:31Z

Testing with code coverage showed that before this PR there were no tests of indexing sparse matrices wrapped with Transpose or Adjoint.

This commit adds tests for all of the new indexing methods.

…rixCSC * Only do a recursive transpose if the data type of SparseMatrixCSC is not a Number. Some tests on 1000x1000 and 100x100 matrices showed at most approximately 10% decrease in time for slicing transposed, numerical SparseMatrixCSC. * rename _colptr_range to _row_index_range and improve comments.

jlapeyre · 2018-08-15T10:32:06Z

Commit #e5d2acd decreases the benchmark time consistently by about 10% for some test matrices. I think the compiler has enough information to do the same optimization without this commit. But, that optimization might not come soon.

chriscoey · 2018-08-17T18:46:48Z

nice! I'm looking forward to this being merged. is it still WIP?

jlapeyre · 2018-08-18T14:40:26Z

It is finished. I just needs to be reviewed. I changed existing indexing routines. But, I checked that the changes are optimized away, and perform exactly as before. Still someone should trigger nanosoldier.

andreasnoack · 2018-08-18T18:43:53Z

@nanosoldier runbenchmarks("sparse", vs = ":master")

jlapeyre · 2018-08-29T17:24:29Z

Where can I find the output from nanosoldier ? ... or maybe it failed to run.

andreasnoack · 2018-08-29T17:29:26Z

I think it was on strike when I tried last time but should be back on duty now so let's try again

@nanosoldier runbenchmarks("sparse", vs = ":master")

andreasnoack · 2018-08-29T18:39:37Z

Hm. It looks like I've been misinformed. It still isn't running.

jlapeyre · 2018-08-29T19:39:44Z

Sorry about your wasted time mucking with this. I doubt it will show a performance hit, but the modification really should be tested properly. I prefer to wait a bit rather than run them locally.

andreasnoack · 2018-09-08T03:50:56Z

@nanosoldier runbenchmarks("sparse", vs = ":master")

nanosoldier · 2018-09-08T12:30:40Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

jlapeyre · 2018-09-08T14:12:37Z

Looks like there may be a regression. And, reviewing my own benchmarks in the gist I see a regression:

# before PR
@btime a[1, :];
 25.862 μs (15 allocations: 4.56 KiB)

# after PR
@btime a[1, :];
  31.257 μs (16 allocations: 4.59 KiB)

These should be unchanged. But, in fact there is an additional allocation. I thought I took care of this before the PR. It can definitely be fixed by copying code. But, I want to avoid that, if possible. I will look into it...

ViralBShah · 2018-12-17T23:17:28Z

Let's try to get this in.

ViralBShah · 2019-01-28T06:14:46Z

Bump.

jlapeyre · 2019-02-03T12:19:54Z

I have four PRs to Julia in the works. I'll prioritize this one. But, at the moment, my time budget is pretty tight. Thanks for the reminder.

ViralBShah · 2019-07-06T21:19:27Z

Just checking to see if it is worth revisiting now.

DilumAluthge · 2022-01-14T22:39:49Z

We have moved the SparseArrays stdlib to an external repository.

Please open this PR against that repository: https://github.com/JuliaLang/SparseArrays.jl

Thank you!

jlapeyre · 2022-01-14T23:13:32Z

Huh. I completely forgot about this. I'll look into it.

improve slicing lazily wrapped sparse matrices.

eff59d4

This PR uses the code for getting slices in sparsevector.jl to get slices of sparse matrices with lazy adjoint and transpose wrappers. Before, the fallback code in abstractarray.jl created the slices inefficiently.

test slicing lazily wrapped sparse matrices

bf16ce7

This tests slicing (indexing) of sparse matrices wrapped with Transpose or Adjoint.

jlapeyre force-pushed the gjl/sparsetransposeslice branch from 8438f75 to e5d2acd Compare August 15, 2018 10:26

mbauman added sparse Sparse arrays performance Must go faster labels Aug 29, 2018

ViralBShah marked this pull request as draft December 17, 2020 15:49

Pbellive mentioned this pull request Jan 14, 2022

getindex is very slow for adjoints of sparse arrays JuliaSparse/SparseArrays.jl#36

Open

DilumAluthge closed this Jan 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve slicing lazily transposed sparse matrices: WIP #28654

improve slicing lazily transposed sparse matrices: WIP #28654

jlapeyre commented Aug 14, 2018 •

edited

jlapeyre commented Aug 14, 2018

jlapeyre commented Aug 14, 2018

jlapeyre commented Aug 15, 2018

chriscoey commented Aug 17, 2018

jlapeyre commented Aug 18, 2018 •

edited

andreasnoack commented Aug 18, 2018

jlapeyre commented Aug 29, 2018

andreasnoack commented Aug 29, 2018

andreasnoack commented Aug 29, 2018

jlapeyre commented Aug 29, 2018

andreasnoack commented Sep 8, 2018

nanosoldier commented Sep 8, 2018

jlapeyre commented Sep 8, 2018 •

edited

ViralBShah commented Dec 17, 2018

ViralBShah commented Jan 28, 2019

jlapeyre commented Feb 3, 2019

ViralBShah commented Jul 6, 2019

DilumAluthge commented Jan 14, 2022

jlapeyre commented Jan 14, 2022

improve slicing lazily transposed sparse matrices: WIP #28654

improve slicing lazily transposed sparse matrices: WIP #28654

Conversation

jlapeyre commented Aug 14, 2018 • edited

jlapeyre commented Aug 14, 2018

jlapeyre commented Aug 14, 2018

jlapeyre commented Aug 15, 2018

chriscoey commented Aug 17, 2018

jlapeyre commented Aug 18, 2018 • edited

andreasnoack commented Aug 18, 2018

jlapeyre commented Aug 29, 2018

andreasnoack commented Aug 29, 2018

andreasnoack commented Aug 29, 2018

jlapeyre commented Aug 29, 2018

andreasnoack commented Sep 8, 2018

nanosoldier commented Sep 8, 2018

jlapeyre commented Sep 8, 2018 • edited

ViralBShah commented Dec 17, 2018

ViralBShah commented Jan 28, 2019

jlapeyre commented Feb 3, 2019

ViralBShah commented Jul 6, 2019

DilumAluthge commented Jan 14, 2022

jlapeyre commented Jan 14, 2022

jlapeyre commented Aug 14, 2018 •

edited

jlapeyre commented Aug 18, 2018 •

edited

jlapeyre commented Sep 8, 2018 •

edited