Dispatch to generic matrix product rather than gemm for reshaped transposed arrays #30988

ettersi · 2019-02-07T03:08:49Z

julia> n = 1000;
       @time rand(n,n) * reshape(rand(n,n)', (n,n))
       @time rand(n,n) * Matrix( reshape(rand(n,n)', (n,n)) )
       ;
  0.834937 seconds (20 allocations: 22.889 MiB, 1.68% gc time)
  0.022891 seconds (16 allocations: 30.518 MiB)

Profiling indicates that the difference in runtimes comes about because the first call dispatches to _generic_matmul while the second dispatches to gemm!.

Related issues:

The text was updated successfully, but these errors were encountered:

andreasnoack · 2019-02-17T20:14:33Z

It's generally best to avoid ReshapedArrays since they almost always end up dispatching to very slow methods. I wouldn't like to add ReshapedArrays methods for all functions that accept AbstractMatrix. Eventually, it might become possible to do something smarter with ReshapedArrays than fetching elements one-by-one but for now, the best thing you can do is to materialize a ReshapedArray.

ettersi · 2019-02-18T12:43:54Z

I think it would be justified to specialise * such that it materialises reshaped arrays before multiplying. It's a common enough operation, the performance penalty is huge, and figuring out that reshape(transpose(A), (n,n)) is the problem is rather tricky (both reshape(A, (n,n)) and transpose(A) on their own would be fine).

In case anyone else wonders why Julia has a lazy ReshapedArray type that only seems to make things slower: reshape is required to maintain the link to the underlying data, so this function cannot materialise.

andreasnoack · 2019-02-18T13:01:04Z

I think it would be justified to specialise * such that it materialises reshaped arrays before multiplying.

This is a problem all over the place and I'm not sure reshaping a transposed array necessarily is the common place where this causes problems. Consider e.g. things like

julia> @time sum(spzeros(10000,10000))
  0.000014 seconds (12 allocations: 78.672 KiB)
0.0

julia> @time sum(vec(spzeros(10000,10000)))
  0.230635 seconds (17 allocations: 78.797 KiB)
0.0

and similar things happen for most array types.

reshape is required to maintain the link to the underlying data, so this function cannot materialise.

That was indeed the conclusion in #4211. However, I think it's worth reconsidering for Julia 2, see #4211 (comment)

ettersi · 2019-02-18T13:05:58Z

I'm not sure I understand. Are you suggesting not to materialise ReshapedArrays because for some array types (e.g. sparse arrays) materialising them to dense arrays could make things more costly?

andreasnoack · 2019-02-18T13:20:38Z

No. I'm saying that the problems with ReshapedArrays are all over the place so either we could either add materializing methods for ReshapedArrays for any function that has an AbstractArray method or we could ask users to materialize them explicitly. I think the latter makes more sense.

ararslan added the domain:linear algebra Linear algebra label Feb 7, 2019

andreasnoack added the performance Must go faster label Feb 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dispatch to generic matrix product rather than gemm for reshaped transposed arrays #30988

Dispatch to generic matrix product rather than gemm for reshaped transposed arrays #30988

ettersi commented Feb 7, 2019

andreasnoack commented Feb 17, 2019

ettersi commented Feb 18, 2019

andreasnoack commented Feb 18, 2019

ettersi commented Feb 18, 2019

andreasnoack commented Feb 18, 2019

Dispatch to generic matrix product rather than gemm for reshaped transposed arrays #30988

Dispatch to generic matrix product rather than gemm for reshaped transposed arrays #30988

Comments

ettersi commented Feb 7, 2019

andreasnoack commented Feb 17, 2019

ettersi commented Feb 18, 2019

andreasnoack commented Feb 18, 2019

ettersi commented Feb 18, 2019

andreasnoack commented Feb 18, 2019