ENH: Use buffered space in linalg.matrix_power #18137

yunfeim · 2021-01-08T07:02:56Z

The linalg.matrix_power function allocates new space
for each matrix multiplication that it performs.
For large matrices, creating and using a buffer
can lead to performance benefits.

The linalg.matrix_power function allocates new space for each matrix multiplication that it performs. For large matrices, creating and using a buffer can lead to performance benefits.

yunfeim · 2021-01-08T07:04:04Z

numpy/linalg/tests/test_linalg.py

@@ -1039,6 +1039,18 @@ def tz(mat):
            if dt != object:
                tz(self.stacked.astype(dt))

+    def test_power_is_three(self, dt):


The case of the exponent being 3 (a hard-coded case) was actually untested before.

rgommers · 2021-01-24T10:16:48Z

Thanks @yunfeim. It would be helpful if you could add a benchmark for matrix_power in benchmarks/benchmarks/bench_linalg.py that shows the performance benefit.

rgommers · 2021-01-24T10:17:21Z

numpy/linalg/linalg.py

-        return fmatmul(fmatmul(a, a), a)
+        # create and use buffered space
+        buffer = fmatmul(a, a)
+        return fmatmul(buffer, a, out=buffer)


Seems clear that this will help.

Do we even support in-place matrix multiply? (Does BLAS even support it?)

I honestly expect the matmul ufunc (and np.dot) will just run overlap detection and create an additional internal buffer here. (I am having problems with timeit not running my setup code right now).

Unless there are some clear timings, We should probably keep the new tests and maybe see how to improve the n>3 case, where I expect something might work (if it is worth it). Otherwise, first we need to dig into avoiding the additional copy in np.dot and np.matmul first (and making sure that is correct).

rgommers · 2021-01-24T10:17:52Z

numpy/linalg/linalg.py


    # Use binary decomposition to reduce the number of matrix multiplications.
    # Here, we iterate over the bits of n, from LSB to MSB, raise `a` to
    # increasing powers of 2, and multiply into the result as needed.
    z = result = None
    while n > 0:
-        z = a if z is None else fmatmul(z, z)
+        z = a.copy() if z is None else fmatmul(z, z, out=z)


This is a little less obvious.

This looks like a fairly clear win to me - although in principle this now does an extra copy that would be possible to avoid

mattip · 2021-02-22T08:48:57Z

@yunfeim is there a benchmark that is improved by this code? If not please add one. In any case, you should show a before/after comparison of speed or memory usage.

mattip · 2023-03-28T07:07:06Z

This benchmark shows no change in this PR:

class MatrixPower(Benchmark):
    def setup(self):
        self.a = get_squares_()['float64']

    def time_matrix_power_3(self):
        np.linalg.matrix_power(self.a, 3)

I think ufunc overlap checks make a copy of out before reaching the linalg code, so deeper refactoring is needed to avoid copying. Perhaps OpenBLAS can avoid some extra allocations, but in general matmul(a, a, out=a) will have to copy a.

Unfortunately I think we should close this PR.

seberg · 2023-03-28T07:08:27Z

I agree, the current state is not useful. In parts it could be, but there was no follow-up in a long time.

ENH: Use buffered space in linalg.matrix_power

da04a8a

The linalg.matrix_power function allocates new space for each matrix multiplication that it performs. For large matrices, creating and using a buffer can lead to performance benefits.

github-actions bot added the 01 - Enhancement label Jan 8, 2021

yunfeim commented Jan 8, 2021

View reviewed changes

rgommers reviewed Jan 24, 2021

View reviewed changes

Base automatically changed from master to main March 4, 2021 02:05

seberg added 52 - Inactive Pending author response 55 - Needs work labels Jun 10, 2022

seberg closed this Mar 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: Use buffered space in linalg.matrix_power #18137

ENH: Use buffered space in linalg.matrix_power #18137

yunfeim commented Jan 8, 2021

yunfeim Jan 8, 2021

rgommers commented Jan 24, 2021

rgommers Jan 24, 2021

seberg Feb 10, 2021

rgommers Jan 24, 2021

eric-wieser Jan 24, 2021

mattip commented Feb 22, 2021

mattip commented Mar 28, 2023

seberg commented Mar 28, 2023

ENH: Use buffered space in linalg.matrix_power #18137

ENH: Use buffered space in linalg.matrix_power #18137

Conversation

yunfeim commented Jan 8, 2021

yunfeim Jan 8, 2021

Choose a reason for hiding this comment

rgommers commented Jan 24, 2021

rgommers Jan 24, 2021

Choose a reason for hiding this comment

seberg Feb 10, 2021

Choose a reason for hiding this comment

rgommers Jan 24, 2021

Choose a reason for hiding this comment

eric-wieser Jan 24, 2021

Choose a reason for hiding this comment

mattip commented Feb 22, 2021

mattip commented Mar 28, 2023

seberg commented Mar 28, 2023