[MRG] Use GEMM in _update_dict #11420

jakirkham · 2018-07-03T22:31:33Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Avoids a copy in _update_dict and fuses two operations together using the BLAS GEMM operation.

Any other comments?

This fuses the multiplication and addition together into the same computation. Also includes the sign change as well. Not to mention that SciPy linalg routines return Fortran ordered arrays (even if they were C-ordered originally) unlike NumPy's `dot`. So this avoids a copy as well.

jnothman

Is it worth showing a benchmark?

jnothman · 2018-07-03T23:28:51Z

sklearn/decomposition/dict_learning.py

    ger, = linalg.get_blas_funcs(('ger',), (dictionary, code))
+    # Residuals, computed with BLAS for speed and efficiency
+    # R <- -1.0 * U * V^T + 1.0 * Y
+    R = gemm(-1.0, dictionary, code, 1.0, Y)


This produces a Fortran array I presume?

Maybe it would be worth making it explicit that we expect a fortran array in the comment.

Interestingly, the SciPy linalg functions correctly handle C ordered arrays as input.

In [1]: import numpy as np In [2]: import scipy.linalg as linalg In [3]: np.random.seed(0) In [4]: a = np.random.random((2, 3)) In [5]: b = np.random.random((3, 4)) In [6]: c = np.random.random((2, 4)) In [7]: gemm, = linalg.get_blas_funcs(('gemm',), (a, b, c)) In [8]: gemm(1.0, a, b, 1.0, c) Out[8]: array([[1.62736178, 1.79020759, 1.92593582, 2.17344607], [1.08121316, 1.54678646, 0.89707181, 1.77876957]]) In [9]: gemm(1.0, a, b, 1.0, c).flags Out[9]: C_CONTIGUOUS : False F_CONTIGUOUS : True OWNDATA : True WRITEABLE : True ALIGNED : True WRITEBACKIFCOPY : False UPDATEIFCOPY : False

That said, we appear to already be forcing dictionary and code to Fortran ordered arrays anywhere _update_dict is called.

jakirkham · 2018-07-04T00:52:03Z

This is the main take away.

In [1]: import numpy as np

In [2]: a = np.random.random((5, 6)).copy(order="F")

In [3]: b = np.random.random((6, 7)).copy(order="F")

In [4]: np.isfortran(np.dot(a, b))
Out[4]: False

Meaning calling np.asfortranarray before was copying the array. This avoids that copy.

jnothman · 2018-07-04T01:36:53Z

Ah, I get it now. LGTM. But I'm no blas expert, so let's get a second opinion.

jakirkham · 2018-07-10T13:40:18Z

Any ideas who might be a good person to ask to look at this?

jakirkham · 2018-07-11T15:22:52Z

Would you be able to review, @ogrisel?

jakirkham · 2018-07-17T14:33:55Z

Friendly nudge 😉

ogrisel

This looks good. Out of curiosity could you please run a quick benchmark to evaluate the performance impact of that fix on a typical (smallish) problem of yours?

ogrisel · 2018-07-17T15:50:44Z

sklearn/decomposition/dict_learning.py

    ger, = linalg.get_blas_funcs(('ger',), (dictionary, code))
+    # Residuals, computed with BLAS for speed and efficiency
+    # R <- -1.0 * U * V^T + 1.0 * Y
+    R = gemm(-1.0, dictionary, code, 1.0, Y)


Maybe it would be worth making it explicit that we expect a fortran array in the comment.

jakirkham · 2018-07-18T06:39:50Z

Benchmarking shows the time change is pretty close to negligible, which is expected since the same operations are done in either case. Though memory usage should be reduced as we are dealing only in Fortran arrays now instead of converting as before thus avoiding a copy.

agramfort · 2018-07-18T08:03:18Z

@ogrisel I added the comment you suggested.

Merging.

jakirkham · 2018-07-18T14:59:12Z

Thanks @agramfort. Sorry @ogrisel, thought you were concerned about the requirements of input arrays.

Get BLAS functions as part of prep

1985177

jakirkham force-pushed the use_gemm__update_dict branch from 23228fe to e48f0f1 Compare July 3, 2018 22:53

jakirkham force-pushed the use_gemm__update_dict branch from e48f0f1 to 78c0570 Compare July 3, 2018 22:57

jakirkham changed the title ~~Use GEMM in _update_dict~~ [MRG] Use GEMM in _update_dict Jul 3, 2018

jnothman reviewed Jul 3, 2018

View reviewed changes

jnothman approved these changes Jul 4, 2018

View reviewed changes

ogrisel approved these changes Jul 17, 2018

View reviewed changes

add comment [ci skip]

4da1193

agramfort merged commit fc3a6cc into scikit-learn:master Jul 18, 2018

jakirkham deleted the use_gemm__update_dict branch July 18, 2018 14:58

Uh oh!

[MRG] Use GEMM in _update_dict #11420

[MRG] Use GEMM in _update_dict #11420

Uh oh!

Conversation

jakirkham commented Jul 3, 2018

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman Jul 3, 2018

Choose a reason for hiding this comment

Uh oh!

jakirkham Jul 4, 2018

Choose a reason for hiding this comment

Uh oh!

ogrisel Jul 17, 2018

Choose a reason for hiding this comment

Uh oh!

jakirkham Jul 17, 2018

Choose a reason for hiding this comment

Uh oh!

jakirkham commented Jul 4, 2018

Uh oh!

jnothman commented Jul 4, 2018

Uh oh!

jakirkham commented Jul 10, 2018

Uh oh!

jakirkham commented Jul 11, 2018

Uh oh!

jakirkham commented Jul 17, 2018

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel Jul 17, 2018

Choose a reason for hiding this comment

Uh oh!

jakirkham commented Jul 18, 2018

Uh oh!

agramfort commented Jul 18, 2018

Uh oh!

jakirkham commented Jul 18, 2018

Uh oh!

Uh oh!