check sizes of arguments in dot; fixes #28617 #28666

ranocha · 2018-08-15T09:37:52Z

As discussed in #28617 and https://discourse.julialang.org/t/efficient-trace-of-product-of-matrices/13313/12, dot(A::AbstractArray, B::AbstractArray) should check whether size(A) == size(B) instead of length(A) == length(B).

andreasnoack · 2018-08-15T09:50:32Z

I think the check should actually be axes(A) == axes(B) to handle non-standard indices correctly.

ranocha · 2018-08-15T10:19:29Z

I think the check should actually be axes(A) == axes(B) to handle non-standard indices correctly.

I thought custom indices should only be some kind of simplification for special purposes. Thus, I would not forbid having two arrays of the same size but with different axes and calculating the dot product of them. Could you please explain why we should disallow something as

julia> using LinearAlgebra, OffsetArrays
                                                                                    
julia> A = [1 2; 3 4]; B = OffsetArray(A, (-1,-1))
OffsetArray(::Array{Int64,2}, 0:1, 0:1) with eltype Int64 with indices 0:1×0:1:
 1  2
 3  4

julia> dot(A, A)
30

julia> dot(A, B)
30

andreasnoack · 2018-08-15T12:49:11Z

The idea is that the indices of A and B should match. So for Array, we'd disallow dot(A, B) for A = randn(4) and B = randn(2,2) because vec(CartesianIndices(A)) != vec(CartesianIndices(B)). The generalization of this requirement to AbstractArray is to require axes(A) == axes(B). I'd also expect this to hold when working with OffsetArrays but I might, of course, be wrong. Eventually, I'd also think it would be reasonable to require that axes(A, 2) == axes(B, 1) in the matrix multiplication A*B.

ranocha · 2018-08-15T12:55:38Z

Thank you for the explanation. If this is the basic idea behind the checks (and A*B will work if axes(A, 2) == axes(B, 1)), there should be checks using axes, of course. I will update the PR accordingly.

garrison · 2018-08-15T13:59:37Z

stdlib/LinearAlgebra/src/generic.jl

-Compute the dot product between two vectors. For complex vectors, the first
-vector is conjugated. When the vectors have equal lengths, calling `dot` is
-semantically equivalent to `sum(dot(vx,vy) for (vx,vy) in zip(x, y))`.
+Compute the dot product between two arrays of the same size as if they were


"of the same size" -> "with the same axes" ? The "equal sizes" language is also used a few lines below...

I will change that.

garrison · 2018-08-15T14:01:06Z

stdlib/SparseArrays/src/linalg.jl

@@ -206,7 +206,9 @@ end
 # Frobenius dot/inner product: trace(A'B)
 function dot(A::SparseMatrixCSC{T1,S1},B::SparseMatrixCSC{T2,S2}) where {T1,T2,S1,S2}
    m, n = size(A)
-    size(B) == (m,n) || throw(DimensionMismatch("matrices must have the same dimensions"))
+    if size(B) != (m,n)


Is axes purposely not used here for some reason?

I thought SparseMatrixCSC uses definitely classical indexing as in line 208. Thus, I've just used size instead of axes. Should I change that?

For now, the SparseArrays code has not been modified to support arbitrary indices.
I suppose it would be generally less useful than for dense arrays.

ranocha · 2018-08-16T14:16:16Z

We should note somewhere that these changes are breaking. Since the NEWS.md file seems to be reserved for the official release of Julia v1.0, I don't know where I should add a comment. Could you help me, please?

stevengj · 2018-08-16T17:17:05Z

I guess the question is whether we consider this to be a bugfix or a change in the documented API.

ranocha · 2018-08-16T17:20:15Z

Following the discussion above, it seems that most people see this as a bugfix. Should I modify this PR in some way before it can be merged?

KristofferC · 2018-08-16T17:25:47Z

This is not a simple bug fix.

For any iterable containers x and y (including arrays of any dimension) of numbers (or any element
type for which dot is defined), compute the dot product (or inner product or scalar product), i.e. the
sum of dot(x[i],y[i]), as if they were vectors.

It works exactly like documented. We can't label things bugfixes just because we change what we like about the behavior. Also, master is currently closed for breaking changes.

ranocha · 2018-08-16T17:48:51Z

Okay. Do I have to anything now in that case?

KristofferC · 2018-08-16T17:56:17Z

This LGTM so I think this can rest for a while until we open up master or implement versioning in for the stdlibs.

garrison · 2018-12-02T05:10:51Z

Bump. Any chance this qualifies as a "minor change"?

StefanKarpinski · 2018-12-03T17:40:56Z

I've marked it as a "minor change" and put the "triage" label on it and 1.1 milestone; we can discuss whether to include it on Thursday's triage call.

JeffBezanson · 2018-12-06T19:50:02Z

I'm not sure this is useful at all --- it just adds errors to more cases. Very much not sure it's worth the potential breakage.

JeffBezanson · 2018-12-06T19:57:26Z

Also not obvious to me whether comparing axes or just sizes is the right thing.

StefanKarpinski · 2018-12-06T20:17:25Z

The general triage sentiment is that by default we need to lean heavily in favor of not breaking code. So this could happen but the burden is on those proposing this that this is (a) the right thing to do and (b) that it won't break any packages or applications. Running PkgEval on this branch would be a good first step to inform that decision—if nothing breaks, we can consider it.

JeffBezanson · 2019-02-14T20:33:32Z

Triage says 👍 but we need a PkgEval run.

marius311 · 2019-06-25T20:50:40Z

I'm for the behavior in this PR. Also, It should be marked as fixing #32395 (which I opened).

andreasnoack · 2019-07-24T21:35:42Z

@ararslan Would you be able to run PkgEval here?

ararslan · 2019-07-25T15:36:38Z

I won't likely be able to get to it for a while.

maleadt · 2019-12-19T06:47:42Z

@nanosoldier runtests(ALL, vs = ":master")

nanosoldier · 2019-12-19T22:23:11Z

Your test job has completed - possible new issues were detected. A full report can be found here. cc @maleadt

JeffBezanson · 2019-12-19T22:27:33Z

Ok, this does indeed cause some package test failures. That matches my intuition that, given that we allow passing arbitrary iterators to dot, being permissive here can be useful.

andreasnoack · 2021-09-28T07:27:29Z

I just read through the error logs here and I think we have drawn the wrong conclusion from them. Most of the failures are unrelated to this PR. They are either segfaults or look intermittent. Then three of them fail because of an unfortunate sum method in StatsBase that was removed in JuliaStats/StatsBase.jl#526. Finally, it looks like there are just a few cases where somebody ends up dotting a vector and a one-column matrix. The latter might be something that people actually find convenient but let's rebase one and rerun PkgEval to get an updated view on this.

…but keep the tests

maleadt · 2021-09-29T16:24:01Z

Third time's the charm (and comes with a prevind fix):

@nanosoldier runtests(ALL, vs = ":master")

nanosoldier · 2021-09-29T23:02:48Z

Your package evaluation job has completed - possible new issues were detected. A full report can be found here.

andreasnoack · 2021-09-30T13:01:13Z

I've tried to summarize the reason for the failures in the table below. No packages rely on dotting arrays of different shape except for singleton dimensions. A few packages rely on dotting one column matrix with a vector and a few packages dots a one row matrix with a vector. There are also some numerical differences which might be because some calls now use our dot instead of the dot from BLAS. We can revert that but I think the tests might just be too strict and that it would be better to go through the packages and adjust the tests.

I think it's unfortunate that we don't provide an error in the motivating example in #28617 but maybe it's not a sufficiently common case to warrant the the effort required for introducing the check.

Package	Reason	src or test
AeroMDAO	One column matrix	src
CompressiveLearning	One column matrix	src
CopEnt	Unrelated	N/A
EnhancedGJK	One column matrix	src
ExponentialUtilities	Unrelated	N/A
FaultDetectionTools	Unrelated	N/A
FilesystemDatastructures	Unrelated	N/A
GridArrays	Unrelated	N/A
ImageDistances	Unrelated	N/A
ImmersedLayers	Marginally different results	N/A
IndependentComponentAnalysis	Unrelated	N/A
KissMCMC	Marginally different results	N/A
LearningHorse	One row(?!?) matrix	src
MultinomialRegression	Marginally different results	N/A
MutableArithmetics	One column matrix	test
OpSel	One column matrix	src
QuantumTomography	Unrelated	N/A
RegressionDiscontinuity	One row(?!?) matrix	src
SolverTools	Marginally different results	N/A
Symbolic	Unrelated	N/A
TensorCore	One row(?!?) matrix	Test
UnbalancedOptimalTransport	One column matrix	src
WaveFD	Marginally different results	N/A
Widgets	Unrelated	N/A

KristofferC added the kind:breaking This change will break code label Aug 15, 2018

garrison reviewed Aug 15, 2018

View reviewed changes

garrison added the domain:linear algebra Linear algebra label Aug 15, 2018

StefanKarpinski added kind:minor change Marginal behavior change acceptable for a minor release status:triage This should be discussed on a triage call labels Dec 3, 2018

StefanKarpinski added this to the 1.1 milestone Dec 3, 2018

JeffBezanson modified the milestones: 1.1, 1.2 Dec 7, 2018

JeffBezanson added needs pkgeval Tests for all registered packages should be run with this change and removed status:triage This should be discussed on a triage call labels Feb 14, 2019

JeffBezanson modified the milestones: 1.2, 1.3 Apr 25, 2019

fredrikekre mentioned this pull request Jun 24, 2019

dot(x::Adjoint, y::AbstractVector) gives inconsistent answer #32395

Open

JeffBezanson modified the milestones: 1.3, 1.4 Aug 15, 2019

JuliaLang deleted a comment from nanosoldier Dec 19, 2019

JuliaLang deleted a comment from JeffBezanson Dec 19, 2019

JeffBezanson removed this from the 1.4 milestone Dec 20, 2019

ranocha added 3 commits September 28, 2021 09:42

check sizes of arguments in dot; fixes JuliaLang#28617

6e97d2a

check for axes instead of size in dot

5f2b9e1

adapt docstring of dot: size -> axes

2bdedc1

andreasnoack force-pushed the dot_check_sizes branch from 5d48509 to 2bdedc1 Compare September 28, 2021 07:43

andreasnoack added 2 commits September 28, 2021 10:48

Remove redundant dot definitions in matmul.jl

3dca92e

Remove redundant BLAS.dot(c/u) methods introduced in JuliaLang#39751 …

8a5200e

…but keep the tests

check sizes of arguments in dot; fixes #28617 #28666

Are you sure you want to change the base?

check sizes of arguments in dot; fixes #28617 #28666

Conversation

ranocha commented Aug 15, 2018

andreasnoack commented Aug 15, 2018

ranocha commented Aug 15, 2018

andreasnoack commented Aug 15, 2018

ranocha commented Aug 15, 2018

garrison Aug 15, 2018

Choose a reason for hiding this comment

ranocha Aug 15, 2018

Choose a reason for hiding this comment

garrison Aug 15, 2018

Choose a reason for hiding this comment

ranocha Aug 15, 2018

Choose a reason for hiding this comment

jebej Aug 15, 2018

Choose a reason for hiding this comment

ranocha commented Aug 16, 2018

stevengj commented Aug 16, 2018

ranocha commented Aug 16, 2018 • edited

KristofferC commented Aug 16, 2018

ranocha commented Aug 16, 2018

KristofferC commented Aug 16, 2018

garrison commented Dec 2, 2018 • edited

StefanKarpinski commented Dec 3, 2018

JeffBezanson commented Dec 6, 2018

JeffBezanson commented Dec 6, 2018

StefanKarpinski commented Dec 6, 2018

JeffBezanson commented Feb 14, 2019

marius311 commented Jun 25, 2019

andreasnoack commented Jul 24, 2019

ararslan commented Jul 25, 2019

maleadt commented Dec 19, 2019

nanosoldier commented Dec 19, 2019

JeffBezanson commented Dec 19, 2019

andreasnoack commented Sep 28, 2021

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

maleadt commented Sep 29, 2021

nanosoldier commented Sep 29, 2021

andreasnoack commented Sep 30, 2021

ranocha commented Aug 16, 2018 •

edited

garrison commented Dec 2, 2018 •

edited