ENH: sparse: speed up LIL indexing + assignment via Cython #3356

pv · 2014-02-19T22:20:09Z

This PR gains 5...50x speedups in LIL matrix setitem/getitem by writing the fancy indexing code more carefully. I also add fast paths for scalar indexing (~5x faster).

The fancy getitem speed is now not far from CSR/CSC. Fancy setitem is also not far from CSR/CSC (and for changing sparsity pattern, LIL can now be much faster than CSR/CSC).

Benchmarks: https://gist.githubusercontent.com/pv/9102868/raw/a256bfe058b44131f8acba0f41ab90eafacbe127/gistfile1.txt

coveralls · 2014-02-19T23:35:41Z

Coverage remained the same when pulling c5e3ae4 on pv:lil-speed-cy into 32cd96d on scipy:master.

jnothman · 2014-02-19T23:45:41Z

I think you can switch off bounds checking and wraparound for a few more functions...

pv · 2014-02-20T00:09:08Z

I didn't get measureable performance improvement out from disabling wraparound/bounds. bisect_left however does matter, as it's the innermost loop.

It's also possible someone passes in wrong sized rows/data arrays, so if I add the decorators, more checks would be needed. I'd rather keep it simpler.

jnothman · 2014-02-20T00:46:54Z

Okay.

On 20 February 2014 11:09, Pauli Virtanen notifications@github.com wrote:

I didn't get any measureable performance improvement out from disabling
wraparound/bounds. bisect_left however does matter, as it's the innermost
loop.

It's also possible someone passes in wrong sized rows/data arrays, so if I
add the decorators, more checks would be needed. I'd rather keep it simpler.

Reply to this email directly or view it on GitHubhttps://github.com//pull/3356#issuecomment-35567998
.

juliantaylor · 2014-02-21T00:18:20Z

.gitattributes

@@ -30,6 +30,7 @@ scipy/io/matlab/mio_utils.c binary
 scipy/io/matlab/mio5_utils.c binary
 scipy/io/matlab/streams.c binary
 scipy/signal/_spectral.c binary
+scipy/sparse/_csparsetools.c


I assume that should have gone into .gitignores?

aren't the gitattributes for the cython files are now obsolete as they are not included in the repository anymore?

Indeed, I intended to put it to gitignore. Fixed.

coveralls · 2014-02-21T18:26:07Z

Coverage remained the same when pulling fee97bd on pv:lil-speed-cy into 32cd96d on scipy:master.

coveralls · 2014-02-21T18:26:28Z

Coverage remained the same when pulling fee97bd on pv:lil-speed-cy into 32cd96d on scipy:master.

rgommers · 2014-02-23T20:52:09Z

@jnothman did you finish your review? This could still make it into 0.14.x if merged in time.

jnothman · 2014-02-23T23:48:43Z

scipy/sparse/lil.py

+        # Scalar fast path first
+        if isinstance(index, tuple) and len(index) == 2:
+            i, j = index
+            if ((isinstance(i, int) or isinstance(i, np.integer)) and


other implementations use sputils.isintlike which has slightly different semantics. For consistency, it should apply here.

Using isintlike or isscalarlike slows the fast path down by 25-50%.
Hence the splitting of it into two parts.

jnothman · 2014-02-24T00:00:40Z

Sorry for so many minor comments. Really, none should be a blocker, and this LGTM!

As per review comments: - explain and rename prepare_index_arrays - indicate out-of-bounds column indices - faster insertion in lil_fancy_get - explain reason for preferring isinstance in scalar fast paths

pv · 2014-02-24T00:51:21Z

Thanks, revised. Getitem is now ~25% faster due to the faster insertion.

jnothman · 2014-02-24T01:15:53Z

Great! I hope you haven't missed the 0.14 boat :)

coveralls · 2014-02-24T01:22:56Z

Coverage remained the same when pulling 6270da3 on pv:lil-speed-cy into 0da153e on scipy:master.

rgommers · 2014-02-24T20:38:51Z

To make sure no boats are missed, let's push the merge button.

ENH: sparse: speed up LIL indexing + assignment via Cython

rgommers · 2014-02-24T20:39:02Z

Thanks @pv, @jnothman

pv added 5 commits February 19, 2014 23:30

ENH: sparse: rewrite some lil methods in Cython for speedups

d3ec1b1

ENH: sparse: implement cythonized fancy setitem

5ba9fa8

ENH: sparse: use memoryview in _csparsetools

4bca4f3

ENH: sparse: add scalar indexing fast paths to LIL

43ac432

MAINT: sparse: document the _csparsetools.pyx file appropriately

c5e3ae4

pv mentioned this pull request Feb 19, 2014

fancy indexation is terrible slow (Trac #1071) #1598

Closed

pv added PR labels Feb 19, 2014

juliantaylor reviewed Feb 21, 2014
View reviewed changes

pv added 2 commits February 21, 2014 19:38

MAINT: add _csparsetools to .gitignore

a9b564b

MAINT: drop Cython from .gitattributes (no longer in repo)

fee97bd

jnothman reviewed Feb 23, 2014
View reviewed changes

ENH: sparse/lil: minor improvements in LIL fancy indexing

6270da3

As per review comments: - explain and rename prepare_index_arrays - indicate out-of-bounds column indices - faster insertion in lil_fancy_get - explain reason for preferring isinstance in scalar fast paths

rgommers added a commit that referenced this pull request Feb 24, 2014

Merge pull request #3356 from pv/lil-speed-cy

6bea3d0

ENH: sparse: speed up LIL indexing + assignment via Cython

rgommers merged commit 6bea3d0 into scipy:master Feb 24, 2014

rgommers added this to the 0.14.0 milestone Feb 24, 2014

rgommers added the enhancement label Feb 24, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: sparse: speed up LIL indexing + assignment via Cython #3356

ENH: sparse: speed up LIL indexing + assignment via Cython #3356

pv commented Feb 19, 2014

coveralls commented Feb 19, 2014

jnothman commented Feb 19, 2014

pv commented Feb 20, 2014

jnothman commented Feb 20, 2014

juliantaylor Feb 21, 2014

pv Feb 21, 2014

coveralls commented Feb 21, 2014

coveralls commented Feb 21, 2014

rgommers commented Feb 23, 2014

jnothman Feb 23, 2014

pv Feb 24, 2014

jnothman commented Feb 24, 2014

pv commented Feb 24, 2014

jnothman commented Feb 24, 2014

coveralls commented Feb 24, 2014

rgommers commented Feb 24, 2014

rgommers commented Feb 24, 2014

ENH: sparse: speed up LIL indexing + assignment via Cython #3356

ENH: sparse: speed up LIL indexing + assignment via Cython #3356

Conversation

pv commented Feb 19, 2014

coveralls commented Feb 19, 2014

jnothman commented Feb 19, 2014

pv commented Feb 20, 2014

jnothman commented Feb 20, 2014

juliantaylor Feb 21, 2014

Choose a reason for hiding this comment

pv Feb 21, 2014

Choose a reason for hiding this comment

coveralls commented Feb 21, 2014

coveralls commented Feb 21, 2014

rgommers commented Feb 23, 2014

jnothman Feb 23, 2014

Choose a reason for hiding this comment

pv Feb 24, 2014

Choose a reason for hiding this comment

jnothman commented Feb 24, 2014

pv commented Feb 24, 2014

jnothman commented Feb 24, 2014

coveralls commented Feb 24, 2014

rgommers commented Feb 24, 2014

rgommers commented Feb 24, 2014