ENH: sparse.linalg: Implement matrix_power() #18544

ljwolf · 2023-05-26T00:23:42Z

Reference issue

No specific reference issue

What does this implement/fix?

This adds a scipy.sparse.linalg.matrix_power() analogue to numpy.linalg.matrix_power(). It uses the old _spmatrix.__pow__() recursive implementation, since MatrixPowerOperator() was quite a bit slower.

ljwolf · 2023-05-26T17:23:16Z

Whitespace issues fixed. 37 tests fail expecting _spmatrix.__pow__ to raise an exception for a sparse matrix raised to a negative power.

ljwolf · 2023-05-26T17:31:54Z

We decided to keep the default behaviour in both MatrixPowerOperator() and _spmatrix.__pow__, which raises an error when the user raises a matrix to a negative power.

If we allowed the user to pass a negative power, we would have to invert the input array A. That would usually create a dense matrix unless there is special structure in the input matrix. It's easy enough for an interested user to invert the matrix themselves and use abs(pow), so we'll keep the current behaviour consistent by raising an error with a negative power.

rossbar · 2023-05-26T17:50:29Z

Just to note - it looks like #513 introduced a recursive implementation that saves on the number of matmuls for larger exponents. It would be worthwhile to ensure there is a benchmark in place for sparse matrix power to ensure that there are no performance regressions with the updated implementation. @perimosocordiae is working on a benchmark in a separate PR!

ljwolf · 2023-05-26T18:02:10Z

Happy to pull in the old implementation, too. I figured that the MatrixPowerOperator() implementation would be faster than the recursive call, especially if structure is known.

perimosocordiae · 2023-05-26T19:06:29Z

I just opened gh-18553 with a benchmark covering this case. Once that merges, it'll be really easy to evaluate the impact of this PR's changes using dev.py bench --compare main.

scipy/sparse/linalg/_matfuncs.py

perimosocordiae · 2023-05-26T22:53:56Z

My benchmark PR is now merged.

ljwolf · 2023-05-31T15:26:43Z

Yeah, this is not gonna fly. The LinearOperator() version is an order of magnitude slower than the recursive solution on my computer:

on main
              === ============== ==============

              --           N / density
              --- -----------------------------
               x   1000 / 1e-06   1000 / 0.001
              === ============== ==============
               0     49.7±3μs       52.0±6μs
               1    66.7±20μs       58.9±7μs
               2     209±20μs       268±10μs
               3     338±20μs       451±10μs
               8     516±50μs       662±50μs
               9     697±50μs       856±20μs
              === ============== ==============
in PR: 
              === ============== ==============
              --           N / density
              --- -----------------------------
               x   1000 / 1e-06   1000 / 0.001
              === ============== ==============
               0     424±10μs       428±30μs
               1     826±30μs       845±40μs
               2     932±20μs     1.05±0.06ms
               3   1.13±0.06ms     1.52±0.3ms
               8   1.76±0.05ms     2.13±0.1ms
               9    2.11±0.2ms     2.35±0.2ms
              === ============== ==============

I'll swap back to the _spmatrix.__pow__ implementation, rather than scipy.sparse.linalg.MatrixPowerOperator().

ljwolf · 2023-06-01T14:03:25Z

OK, this now uses the older recursive __pow__() implementation for scipy.sparse.linalg.matrix_power(), and transitions _spmatrix.__pow__ to use scipy.sparse.linalg.matrix_power(), so the benchmark is about the same as on main:

=== ============== ==============
--           N / density
--- -----------------------------
 x   1000 / 1e-06   1000 / 0.001
=== ============== ==============
 0     56.2±5μs       52.1±4μs
 1     60.2±8μs      70.8±20μs
 2     197±20μs       254±30μs
 3     351±20μs       442±30μs
 8     533±50μs      769±200μs
 9     627±80μs      905±100μs
=== ============== ==============

perimosocordiae · 2023-06-10T13:39:38Z

Commenting here to say that I haven't forgotten about this PR! I still need to do one more read through the changes, but in the meantime, could you update the PR description to reflect the current state of the change?

scipy/sparse/linalg/_matfuncs.py

ljwolf · 2023-06-22T14:29:16Z

Great, the description of the PR and docstring is now fixed.

jjerphan

Thank you, @ljwolf!

Here are a few comments.

scipy/sparse/_matrix.py

scipy/sparse/linalg/_matfuncs.py

jjerphan · 2023-07-10T06:18:38Z

scipy/sparse/linalg/_matfuncs.py

+        tmp = matrix_power(A, power // 2)
+        if power % 2:
+            return A @ tmp @ tmp
+        else:
+            return tmp @ tmp


What is the benefit of proceeding as follow, here?

This is part of the existing matrix_power implementation, which (as I understand it) was used to reduce the number of required multiplications. This performance improvement was significant then, and seems so again.

Ah yes, that's just a recursive implementation.

We could further reduce the cost of this function using inline computations with more or less np.log2(power) iterations not to have the checks just above be rerun several times.

What do you think?

scipy/sparse/linalg/_matfuncs.py

scipy/sparse/linalg/tests/test_matfuncs.py

scipy/sparse/_matrix.py

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

ljwolf · 2023-07-28T15:50:38Z

OK, I took all of @jjerphan's suggestions, and clarified the comments! let me know where to take this further!

jjerphan

Here is a second pass.

I just have another comment regarding the implementation of matrix_power which I think can be improved.

jjerphan · 2023-08-04T13:47:01Z

scipy/sparse/linalg/_matfuncs.py

+        tmp = matrix_power(A, power // 2)
+        if power % 2:
+            return A @ tmp @ tmp
+        else:
+            return tmp @ tmp


Ah yes, that's just a recursive implementation.

We could further reduce the cost of this function using inline computations with more or less np.log2(power) iterations not to have the checks just above be rerun several times.

What do you think?

scipy/sparse/linalg/_matfuncs.py

WarrenWeckesser · 2023-08-04T14:17:55Z

scipy/sparse/linalg/_matfuncs.py

@@ -861,3 +862,47 @@ def _ell(A, m):
    log2_alpha_div_u = np.log2(alpha/u)
    value = int(np.ceil(log2_alpha_div_u / (2 * m)))
    return max(value, 0)
+
+def matrix_power(A, power, structure=None):
+    """


A couple docstring issues:

As @jjerphan already noted, structure must be described in the "Parameters" section. If there is nothing implemented yet, then remove the parameter.

Add an "Examples" section (see DOC: Add "Examples" to docstrings #7168).

Add a "Notes" section with the appropriate versionadded markup.

For comparison, see the expm docstring in this file. A "Notes" section with just the versionadded markup is fine.

Thanks! Have done. "Structure" was a holdover from using the MatrixPowerOperator implementation, and it has been removed. I also refer now to the idea of your stackoverflow comment in the notes... I think it's pretty important to disclose this to the user.

WarrenWeckesser · 2023-08-04T14:45:02Z

The recursive/logarithmic implementation is probably a good default, and I don't suggest we change it, but folks involved in this PR might be interested to know that it is not always the best approach. Its performance can be substantially worse than simple iteration, depending on how the number of nonzero values increases as powers are computed. See my answer to a question on stackoverflow for an example where the recursive calculation of A**16 is much slower than computing A*A*A*A*A*A*A*A*A*A*A*A*A*A*A*A.

ljwolf · 2023-08-04T15:22:59Z

thanks for the comment @WarrenWeckesser! I think I've addressed the concerns mentioned. The point on your stackoverflow post is useful to disclose (imho) so I've added the gist of the point to the notes.

ljwolf · 2023-08-04T15:27:21Z

I should say, I'm happy to move forward with an inline-based computation, but my goal was only to bring forward the existing implementation. This recursive strategy was adopted in #513, and we probably should develop a more stringent benchmark to characterise the relative performance of a recursive vs. inline strategy over different sizes and sparsity levels. For now, we should probably proceed with the current _sp_matrix.__pow__ implementation, and optimise its implementation once it's brought forward to the sparse array.

jjerphan

LGTM.

I am also in favor of studying both possible implementations in a dedicated PR.

I just have one comment regarding the docstring.

scipy/sparse/linalg/_matfuncs.py

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

perimosocordiae · 2023-08-12T01:37:36Z

As discussed, there are a few ways that this implementation could be made more efficient, but as it stands this is already a very nice improvement to the library: without a standalone matrix_power function, users of sparse arrays would be out of luck if they were migrating old sparse matrix code using the ** operator.

Thanks for the PR, @ljwolf !

* add matrix_power() using MatrixPowerOperator() * remove unused imports in _matrix.py * move back to recursive matrix_power implementation * fix lint issues * add matrix_power to __init__ autosummary * fix docstring explaining negative powers * Update scipy/sparse/linalg/_matfuncs.py Co-authored-by: Julien Jerphanion <git@jjerphan.xyz> * Update scipy/sparse/_matrix.py Co-authored-by: Julien Jerphanion <git@jjerphan.xyz> * Update scipy/sparse/linalg/_matfuncs.py Co-authored-by: Julien Jerphanion <git@jjerphan.xyz> * Update scipy/sparse/_matrix.py Co-authored-by: Julien Jerphanion <git@jjerphan.xyz> * Update scipy/sparse/linalg/_matfuncs.py Co-authored-by: Julien Jerphanion <git@jjerphan.xyz> * update docstring and remove structure param * Update scipy/sparse/linalg/_matfuncs.py Co-authored-by: Julien Jerphanion <git@jjerphan.xyz> --------- Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

j-bowhay added enhancement A new feature or improvement scipy.sparse.linalg labels May 26, 2023

j-bowhay changed the title ~~implement scipy.sparse.linalg.matrix_power()~~ ENH: sparse.linalg: Implement matrix_power() May 26, 2023

ljwolf force-pushed the matrix-power branch from 3c9fe63 to 1016599 Compare May 26, 2023 17:07

ljwolf marked this pull request as ready for review May 26, 2023 17:27

ljwolf requested a review from perimosocordiae as a code owner May 26, 2023 17:27

perimosocordiae mentioned this pull request May 26, 2023

BENCH: sparse: Add a benchmark for sparse matrix power #18553

Merged

perimosocordiae reviewed May 26, 2023

View reviewed changes

scipy/sparse/linalg/_matfuncs.py Outdated Show resolved Hide resolved

ljwolf added 3 commits June 1, 2023 14:59

add matrix_power() using MatrixPowerOperator()

c420ab9

remove unused imports in _matrix.py

4dd2fd4

move back to recursive matrix_power implementation

7565aef

ljwolf force-pushed the matrix-power branch from 1eeab5c to 7565aef Compare June 1, 2023 14:00

ljwolf added 2 commits June 1, 2023 15:19

fix lint issues

98a5162

add matrix_power to __init__ autosummary

214433c

dschult reviewed Jun 11, 2023

View reviewed changes

scipy/sparse/linalg/_matfuncs.py Outdated Show resolved Hide resolved

fix docstring explaining negative powers

fa260de

jjerphan reviewed Jul 10, 2023

View reviewed changes

ljwolf mentioned this pull request Jul 18, 2023

Weights sprint planning pysal/libpysal#534

Open

ljwolf and others added 2 commits July 28, 2023 16:30

Update scipy/sparse/linalg/_matfuncs.py

d3e4dda

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

Update scipy/sparse/_matrix.py

70fabf7

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

ljwolf and others added 3 commits July 28, 2023 16:30

Update scipy/sparse/linalg/_matfuncs.py

0b338dd

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

Update scipy/sparse/_matrix.py

9bbc889

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

Update scipy/sparse/linalg/_matfuncs.py

36123b6

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

jjerphan reviewed Aug 4, 2023

View reviewed changes

WarrenWeckesser reviewed Aug 4, 2023

View reviewed changes

update docstring and remove structure param

966e61e

jjerphan approved these changes Aug 4, 2023

View reviewed changes

scipy/sparse/linalg/_matfuncs.py Outdated Show resolved Hide resolved

Update scipy/sparse/linalg/_matfuncs.py

298473e

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

perimosocordiae approved these changes Aug 12, 2023

View reviewed changes

perimosocordiae merged commit 1ef84bc into scipy:main Aug 12, 2023
20 of 22 checks passed

j-bowhay added this to the 1.12.0 milestone Aug 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: sparse.linalg: Implement matrix_power() #18544

ENH: sparse.linalg: Implement matrix_power() #18544

ljwolf commented May 26, 2023 •

edited

ljwolf commented May 26, 2023

ljwolf commented May 26, 2023

rossbar commented May 26, 2023

ljwolf commented May 26, 2023

perimosocordiae commented May 26, 2023

perimosocordiae commented May 26, 2023

ljwolf commented May 31, 2023

ljwolf commented Jun 1, 2023

perimosocordiae commented Jun 10, 2023

ljwolf commented Jun 22, 2023

jjerphan left a comment

jjerphan Jul 10, 2023

ljwolf Jul 28, 2023

jjerphan Aug 4, 2023

ljwolf commented Jul 28, 2023

jjerphan left a comment

jjerphan Aug 4, 2023

WarrenWeckesser Aug 4, 2023

ljwolf Aug 4, 2023

WarrenWeckesser commented Aug 4, 2023

ljwolf commented Aug 4, 2023

ljwolf commented Aug 4, 2023

jjerphan left a comment

perimosocordiae commented Aug 12, 2023

ENH: sparse.linalg: Implement matrix_power() #18544

ENH: sparse.linalg: Implement matrix_power() #18544

Conversation

ljwolf commented May 26, 2023 • edited

Reference issue

What does this implement/fix?

ljwolf commented May 26, 2023

ljwolf commented May 26, 2023

rossbar commented May 26, 2023

ljwolf commented May 26, 2023

perimosocordiae commented May 26, 2023

perimosocordiae commented May 26, 2023

ljwolf commented May 31, 2023

ljwolf commented Jun 1, 2023

perimosocordiae commented Jun 10, 2023

ljwolf commented Jun 22, 2023

jjerphan left a comment

Choose a reason for hiding this comment

jjerphan Jul 10, 2023

Choose a reason for hiding this comment

ljwolf Jul 28, 2023

Choose a reason for hiding this comment

jjerphan Aug 4, 2023

Choose a reason for hiding this comment

ljwolf commented Jul 28, 2023

jjerphan left a comment

Choose a reason for hiding this comment

jjerphan Aug 4, 2023

Choose a reason for hiding this comment

WarrenWeckesser Aug 4, 2023

Choose a reason for hiding this comment

ljwolf Aug 4, 2023

Choose a reason for hiding this comment

WarrenWeckesser commented Aug 4, 2023

ljwolf commented Aug 4, 2023

ljwolf commented Aug 4, 2023

jjerphan left a comment

Choose a reason for hiding this comment

perimosocordiae commented Aug 12, 2023

ljwolf commented May 26, 2023 •

edited