Define factorize(Adjoint) to make e.g. inv(Adjoint) work (in most cases) #26302

andreasnoack · 2018-03-02T21:51:42Z

Fixes #26299. It will still fail for complex symmetric matrices and it will require a few more definitions to handle that (relatively rare) case so I've added a @test_broken for now.

Sacha0 · 2018-03-02T22:04:40Z

stdlib/LinearAlgebra/src/dense.jl

@@ -1222,6 +1222,7 @@ function factorize(A::StridedMatrix{T}) where T
    end
    qrfact(A, Val(true))
 end
+factorize(A::Adjoint) = adjoint(factorize(parent(A)))


Why prefer adjoint(factorize(parent(A))) over factorize(copy(A))? Prefer avoiding an allocation to generating an object that may dispatch more nicely downstream?

As you say, it saves an allocation and I think this within the core business of Adjoint since the main thing to do after factorize is solving and most solves ought to have methods for Adjoint.

Sacha0

Thanks Andreas! :)

garrison · 2018-03-02T23:01:48Z

Thanks! Probably makes sense to do a similar thing for Transpose as well?

stevengj · 2018-03-03T04:19:02Z

And ConjMatrix…

andreasnoack · 2018-03-03T16:02:53Z

@garrison I've added a version for Transpose as well.
@stevengj ConjArray was deprecated in #25217

fredrikekre · 2018-03-03T16:39:27Z

I think the user should do inv(copy(A')) (or inv(A)' here instead of doing the materialization automatically.

mbauman · 2018-03-05T16:41:37Z

Could you expand on that, @fredrikekre? Adjoints are themselves AbstractMatrixes, supporting all sorts of abstract implementations beyond those specifically targeting ::Adjoint.

mbauman · 2018-03-05T16:44:45Z

stdlib/LinearAlgebra/test/dense.jl

+    B = complex.(A, randn(n, n))
+    B = B + transpose(B)
+
+    @test_broken inv(B')*B'                     ≈ I


Could you add a note about why this is broken and what would be required to fix it?

fredrikekre · 2018-03-05T21:23:39Z

Could you expand on that, @fredrikekre?

There is just so many cases where Adjoint/Tranpose does not work right now (see e.g. #25331), and is the solution really to add methods for these in all cases? For #25331 this would require many combination of methods.

In this particular case the 0.7 equivalent of inv(A') is inv(copy(A')) and I think it might be nice to be explicit about the materialization of the lazy adjoint here, and not "fool" the user that inv can somehow take advantage of a lazy transpose. But on the other hand Adjoint <: AbstractArray so we can not stop them from dispatching to methods accepting AbstractArrays.

Perhaps it would have been different if Adjoint/Transpose weren't AbstractArrays, in that case we could choose to implement only methods were we can take advantage of the lazyness.

andreasnoack · 2018-03-05T21:49:37Z

I think it might be nice to be explicit about the materialization of the lazy adjoint here, and not "fool" the user that inv can somehow take advantage of a lazy transpose.

Generally, there is a tension here between more and less experienced users. Many non-expert users would like to avoid taking Adjoint into account and just get a result even if it has some unnecessary allocations. People more interested in the implementation might prefer an error here but I'm not completely convinced. Specifically for this PR, there is actually an advantage from using Adjoint over the inv(copy(A')) so I think the critique doesn't apply in this particular case.

mbauman · 2018-03-05T22:40:52Z

For #25331 this would require many combination of methods.

That's not true at all. Check out #26331.

The totally generic AbstractArray implementations will be pretty terrible until we have a way to improve iteration order over row-major arrays. But I believe that, too, will come in time.

work (in most cases)

Jutho · 2018-03-07T02:08:59Z

I think the user should do inv(copy(A')) (or inv(A)' here instead of doing the materialization automatically.

I am confused. The current implementation does not materialize A', is that correct?

On the Adjoint <: AbstractArray issue being raised in this topic, one inconsistency with this is that Factorization is not a subtype of AbstractArray, but then if you take its adjoint, it gets wrapped in a Adjoint object and so the adjoint of a Factorization suddenly does become a subtype of AbstractArray.

andreasnoack · 2018-03-07T07:29:35Z

I am confused. The current implementation does not materialize A', is that correct?

That is correct.

Your second point is also correct. Although a bit annoying, it only seems to be a theoretical and not a practical concern. If it becomes a real problem, I think we should just make Factorization a subtype of AbstractMatrix. Otherwise, we'd have to give up adjoint solves which would be unfortunate.

StefanKarpinski · 2018-03-07T14:45:53Z

I think we should just make Factorization a subtype of AbstractMatrix.

Any reason not to do this?

stevengj · 2018-03-07T14:49:35Z

Making it a subtype of AbstractMatrix would require that we have getindex, but for an n×n Factorization object getindex would be O(n) rather than O(1).

In general, making Adjoint and Transpose subtypes of AbstractArray seems problematic for any kind of "matrix-free" data structure, where you want to be able to take the adjoint but don't have explicit matrix storage or any efficient way to obtain individual entries.

andreasnoack · 2018-03-07T15:08:41Z

Ref: #1412 and #10064

For QR, I think indexing would as costly as O(n^2).

In general, making Adjoint and Transpose subtypes of AbstractArray seems problematic for any kind of "matrix-free" data structure, where you want to be able to take the adjoint but don't have explicit matrix storage or any efficient way to obtain individual entries.

Which potential issues are there here? Isn't it just that you can hit an operation that takes forever or maybe consumes all your memory. We already have these issues with other AbstractArrays such as SparseMatrixCSC and DistributedArrays. Is there a feasible alternative? If we don't have Adjoint<:AbstractMatrix then Adjoitn{<:AbstractMatrix} can't benefit from AbstractMatrix methods.

Jutho · 2018-03-07T15:23:09Z

Are there, except for the <:Factorizations, other types of objects which are wrapped in Adjoint but are not themselves <:AbstractArray?

Theoretically, it should be possible to make the adjoint of a qr factorization be such that it is the lq factorization with matrices L=Adjoint(R), Q=Adjoint(Q) and vice versa. But I am not sure if that is easy to express in the way the information of the QR factorization is encoded.

The adjoint of an SVD factorization is again an SVD factorization. Not sure about the others though (pivoted LU, ...).

andreasnoack · 2018-03-07T15:42:44Z

Theoretically, it should be possible to make the adjoint of a qr factorization be such that it is the lq factorization with matrices L=Adjoint(R), Q=Adjoint(Q) and vice versa. But I am not sure if that is easy to express in the way the information of the QR factorization is encoded.

Initially, I thought this couldn't work but now I'm in doubt. At least for the factorizations that are parametric on the array type this might actually work. I'll have to try it out. Our QR is probably a good test case.

andreasnoack · 2018-03-07T15:53:17Z

I've just tried to play around with this and, unfortunately, I don't think this can really work for the QR. It would require more type parameters for the factorization since the field for storing the blocked loadings isn't parametric. Even if we did that, it would be quite awkward compared to just wrapping the factorization in Adjoint.

Sacha0 reviewed Mar 2, 2018

View reviewed changes

Sacha0 approved these changes Mar 2, 2018

View reviewed changes

andreasnoack force-pushed the anj/adjfact branch from b4a3d2d to cfee1b8 Compare March 3, 2018 15:53

mbauman reviewed Mar 5, 2018

View reviewed changes

Define factorize(Adjoint/Transpose) to make e.g. inv(Adjoint/Transpose)

0f180c7

work (in most cases)

andreasnoack force-pushed the anj/adjfact branch from cfee1b8 to 0f180c7 Compare March 6, 2018 08:23

mbauman approved these changes Mar 7, 2018

View reviewed changes

andreasnoack merged commit 8613942 into master Mar 7, 2018

andreasnoack deleted the anj/adjfact branch March 7, 2018 12:58

StefanKarpinski mentioned this pull request Jul 9, 2019

Inverse broken for adjoint matrices #32537

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define factorize(Adjoint) to make e.g. inv(Adjoint) work (in most cases) #26302

Define factorize(Adjoint) to make e.g. inv(Adjoint) work (in most cases) #26302

andreasnoack commented Mar 2, 2018

Sacha0 Mar 2, 2018

andreasnoack Mar 3, 2018

Sacha0 left a comment

garrison commented Mar 2, 2018 •

edited

stevengj commented Mar 3, 2018

andreasnoack commented Mar 3, 2018

fredrikekre commented Mar 3, 2018

mbauman commented Mar 5, 2018

mbauman Mar 5, 2018

fredrikekre commented Mar 5, 2018

andreasnoack commented Mar 5, 2018 •

edited

mbauman commented Mar 5, 2018

Jutho commented Mar 7, 2018

andreasnoack commented Mar 7, 2018

StefanKarpinski commented Mar 7, 2018

stevengj commented Mar 7, 2018 •

edited

andreasnoack commented Mar 7, 2018

Jutho commented Mar 7, 2018

andreasnoack commented Mar 7, 2018

andreasnoack commented Mar 7, 2018

Define factorize(Adjoint) to make e.g. inv(Adjoint) work (in most cases) #26302

Define factorize(Adjoint) to make e.g. inv(Adjoint) work (in most cases) #26302

Conversation

andreasnoack commented Mar 2, 2018

Sacha0 Mar 2, 2018

Choose a reason for hiding this comment

andreasnoack Mar 3, 2018

Choose a reason for hiding this comment

Sacha0 left a comment

Choose a reason for hiding this comment

garrison commented Mar 2, 2018 • edited

stevengj commented Mar 3, 2018

andreasnoack commented Mar 3, 2018

fredrikekre commented Mar 3, 2018

mbauman commented Mar 5, 2018

mbauman Mar 5, 2018

Choose a reason for hiding this comment

fredrikekre commented Mar 5, 2018

andreasnoack commented Mar 5, 2018 • edited

mbauman commented Mar 5, 2018

Jutho commented Mar 7, 2018

andreasnoack commented Mar 7, 2018

StefanKarpinski commented Mar 7, 2018

stevengj commented Mar 7, 2018 • edited

andreasnoack commented Mar 7, 2018

Jutho commented Mar 7, 2018

andreasnoack commented Mar 7, 2018

andreasnoack commented Mar 7, 2018

garrison commented Mar 2, 2018 •

edited

andreasnoack commented Mar 5, 2018 •

edited

stevengj commented Mar 7, 2018 •

edited