(H+μI) \ x solvers for Hessenberg factorizations #31853

stevengj · 2019-04-27T03:46:41Z

As discussed in #31738, this adds a few more methods for the Hessenberg type:

The Hessenberg type now includes a shift μ, so that you can form H+μ*I without making a copy.
It also includes an O(n²) ldiv! solver for (H+μI) \ x using an algorithm by Henry (1994) that requires only O(n) auxiliary storage and does not modify H.
also fast det, logabsdet/logdet, and multiplication by scalars

(I wasn't entirely sure whether to store the shift as H+μI or H-μI … computationally it makes no difference, but it changes what the user gets if they look at H.μ.)

Possible to-dos:

One thing I would like to support is a complex shift μ even for a real H, since this shows up in many real problems and it would be nice to support efficiently (i.e. without forcing you to complexify H). Unfortunately the LAPACK *ormhr can't multiply a real Q by a complex vector, so it seems like we'd have to copy the real/imaginary parts to separate arrays to copy.
I didn't implement rdiv!. Should be straightforward but seems a bit tedious to rederive Henry's algorithm for this case, ~~so I'm inclined to leave it for a hypothetical future PR.~~ Done.
Would be good to pretty-print Hessenberg objects as currently the output is rather inscrutable.
adjoints/transposes of Hessenberg factorizations.
fast det(H+µI) and logabsdet(H+µI)
UpperHessenberg matrix type (analogous to UpperTriangular) for storing upper-Hessenberg matrices ~~(possibly including shifts)~~. F.H now returns an UpperHessenberg matrix.
specialized Hessenberg factorization for Hermitian matrices, in which case H is real SymTridiagonal.

cc @jiahao since we just chatted about this.

StefanKarpinski

I’m not really qualified to review but this is an impressively thorough PR. @jiahao, maybe you could review?

andreasnoack · 2019-04-29T11:44:36Z

I'll try to wrap my head around this tonight.

stevengj · 2019-04-29T12:20:31Z

One related change that I was considering: change Hessenberg(A) to simply assume that A is already upper Hessenberg (either calling triu!(A,-1) or simply ignoring the lower triangle), so that you need to use hessenberg(A) or hessenberg!(A) if you want to compute the Hessenberg factorization.

Rationale:

If you have a matrix that is already upper-Hessenberg for some reason, you might want to take advantage of solvers for things like (H+µI) \ x without going through LAPACK's factorization routine.
This makes the Hessenberg(A) constructor more like UpperTriangular(A) etcetera.
The Hessenberg(A) constructor is currently undocumented (we only documented hessenberg and hessenberg!), so it doesn't seem problematic to change.
Currently, Hessenberg(A) and hessenberg!(A) are redundant — if we have two functions, why not get two different functionalities?

Alternatively, it could be a new function called Hessenberg!(A) since it modifies A. (I recall that there was a related discussion about Hermitian!(A), but I can't find it…)

But that could be a separate PR.

StefanKarpinski · 2019-04-30T13:24:56Z

I resisted the String!, Symmetric!, Hermitian! and now Hessenberg! pattern but it seems like many people have naturally converged on this same convention and it makes intuitive sense to people, so I've come around to the idea of doing this. I know that @JeffBezanson was also skeptical about this previously—have you had any further thoughts?

stevengj · 2019-04-30T13:55:34Z

Probably in this case we should avoid Hessenberg! simply by having H = Hessenberg(A) ignore the lower triangle of A, similar to UpperTriangular(A).

(This could be signified in the data structure in various ways, most simply by having an empty H.τ array.)

andreasnoack

Great upgrade of the Hessenberg factorization.

Did you consider a separate HessenbergMatrix? I'm not sure it's worth the trouble but it would allow us to e.g. just overload ldiv! instead of introducing special solve functions. It might also be useful for the eigenvalue problem.

Probably in this case we should avoid Hessenberg! simply by having H = Hessenberg(A) ignore the lower triangle of A, similar to UpperTriangular(A).

I agree. I think it would be a good idea to make Hessenberg just wrap the input matrix with an empty τ vector.

stdlib/LinearAlgebra/src/hessenberg.jl

stevengj · 2019-04-30T21:07:06Z

Did you consider a separate HessenbergMatrix? I'm not sure it's worth the trouble but it would allow us to e.g. just overload ldiv! instead of introducing special solve functions. It might also be useful for the eigenvalue problem.

I didn't think about this… it's an interesting thought, but I have a couple of questions:

If we add Hessenberg(H) for a matrix that is already in upper-Hessenberg form (i.e. Q=I), then a separate HessenbergMatrix type seems a bit redundant? Presumably we would support HessenbergMatrix(H) instead of Hessenberg(H) with an empty τ array?
Having both HessenbergMatrix and Hessenberg might be a bit confusing? On the other hand, this allows us to expose the AbstractMatrix interface (getindex etc.) for HessenbergMatrix.
I see that https://github.com/JuliaLinearAlgebra/GenericLinearAlgebra.jl/blob/31d2d8012347de93611902a8579659b7a4dc480c/src/eigenGeneral.jl defines a HessenbergMatrix type, so if we defined our own and exported it it might be breaking … I guess we would have to not export it? On the other hand, it wouldn't be conflicting if we called the new type UpperHessenberg as I suggest below, as that name seems currently unused by any registered Julia package.
Would HessenbergMatrix also include the shift μ?

I would tend to call it UpperHessenberg in analogy with UpperTriangular, rather than HessenbergMatrix.

stevengj · 2019-05-01T18:00:24Z

Update: added an UpperHessenberg matrix type analogous to UpperTriangular. For a F::Hessenberg factorization object, F.H now returns this type, which means that F.H is now allocation-free. For example:

julia> A = rand(6,6)
6×6 Array{Float64,2}:
 0.107756   0.181333  0.366188   0.441492   0.0903661  0.342221  
 0.236713   0.121664  0.769639   0.878645   0.774642   0.464094  
 0.204347   0.110714  0.993152   0.0257284  0.408132   0.00811852
 0.0485218  0.618574  0.856544   0.498366   0.109      0.485984  
 0.0248759  0.652369  0.0578524  0.522572   0.615913   0.655619  
 0.0747064  0.344599  0.123562   0.915485   0.732657   0.752754  

julia> F = hessenberg(A);

julia> F.H
6×6 UpperHessenberg{Float64,Array{Float64,2},Bool}:
  0.107756  -0.512071   0.394723  -0.081845   0.076756    0.23782  
 -0.326105   1.48647   -1.21715    0.341651   0.680446   -0.364656 
   ⋅        -1.25486    0.833861   0.357741   0.214807    0.120456 
   ⋅          ⋅         1.11037    0.478931   0.217121   -0.121404 
   ⋅          ⋅          ⋅        -0.31514    0.143539    0.219257 
   ⋅          ⋅          ⋅          ⋅        -0.0626903   0.0390469

stevengj · 2019-05-01T21:08:10Z

I made the τ type even more generic in the Hessenberg and HessenbergQ objects. Hopefully it is generic enough now for you to use in your GenericLinearAlgebra package instead of your own HessenbergFactorization and HessenbergMatrix types (where τ is a Vector{Householder{T}}) @andreasnoack?

stevengj · 2019-05-01T22:44:25Z

Upon reflection, I'm not happy with including a shift µ in the UpperHessenberg matrix type. Pretty soon I would like to have hessenberg(A::Hermitian) call LAPACK zhetrd etc. to produce a SymTridiagonal matrix instead of an upper-Hessenberg matrix, with corresponding efficient shifted solvers, and I don't want to add a shift to SymTridiagonal. I see two options:

Introduce a ShiftedMatrix{T, S<:AbstractMatrix, V} type that wraps a matrix data::S and a shift μ::V, then have specialized routines acting on this. I'm reluctant to do this because it contributes to the combinatorial explosion of matrix types that we need to handle.
Store the shift μ in the Hessenberg factorization type, and add optional shift keyword arguments to ldiv!, det, etcetera for the UpperHessenberg and SymTridiagonal matrix types so that Hessenberg solves can pass in the shift.

I'm inclined towards the latter, since copy-free shifts mainly seem useful in re-using Hessenberg factorizations.

stevengj · 2019-05-06T15:57:41Z

AppVeyor failure is #29880. Travis is GitError(Code:ERROR, Class:Net, curl error: Could not resolve host: github.com

stevengj · 2019-05-11T00:23:03Z

I updated it to add specialized Hessenberg factorizations for Hermitian matrices, in which case F.H is real SymTridiagonal. This seemed like a good test of generality, and forced me to make a few changes to the types.

In the SymTridiagonal case, the solves are already O(n) with O(n) allocation, so it seems like there is not too much to be gained by a specialized solver ala Henry, especially since multiplying by F.Q is still O(n²). I did add a shift keyword in a couple of places so that the LDLᵀ factorization can be formed without allocating a second copy of the matrix A+μI, and to simplify the Hessenberg methods.

stevengj · 2019-05-11T12:32:00Z

Travis osx failure is unrelated Error in testset FileWatching.

stevengj · 2019-05-12T01:42:31Z

@andreasnoack, I've added quite a bit since the last time you reviewed; any comments prior to merging?

stevengj · 2019-05-15T12:46:06Z

Will merge at the end of the week if there are no further comments.

…ation for SymTridiagonal factorization of Hermitian A

stdlib/LinearAlgebra/docs/src/index.md

stevengj · 2019-05-17T18:13:12Z

Tests are passing again except for the unrelated "failed running test Profile" on win32.

Good to merge?

andreasnoack · 2019-08-28T08:55:38Z

@stevengj While updating GenericLinearAlgebra to work with this change I wondered why you added the sym parameter. Isn't the information already reflected in the type of the wrapped matrix, i.e. SymTridiagonal?

stevengj · 2019-08-28T12:40:20Z

@andreasnoack, for hessenberg(A::Hermitian), the SymTridiagonal type is stored only in the type parameter SH of the Hessenberg object. It is not in any of the type parameters of HessenbergQ, which is why I needed a sym parameter there. (Q.factors and Q.τ are ordinary Arrays, but are interpreted differently if they came from a Hermitian factorization.)

For example:

julia> H = hessenberg(Hermitian(rand(4,4)))
Hessenberg{Float64,SymTridiagonal{Float64,Array{Float64,1}},Array{Float64,2},Array{Float64,1},Bool}
Q factor:
4×4 LinearAlgebra.HessenbergQ{Float64,Array{Float64,2},Array{Float64,1},true}:
  0.857333   0.327767  -0.396924  0.0
  0.135862  -0.88782   -0.439679  0.0
 -0.496509   0.323024  -0.805688  0.0
  0.0        0.0        0.0       1.0
H factor:
4×4 SymTridiagonal{Float64,Array{Float64,1}}:
 0.207701   0.0808081    ⋅          ⋅       
 0.0808081  0.211438    0.510799    ⋅       
  ⋅         0.510799    1.40159   -0.86616  
  ⋅          ⋅         -0.86616    0.0661149

julia> H.Q
4×4 LinearAlgebra.HessenbergQ{Float64,Array{Float64,2},Array{Float64,1},true}:
  0.857333   0.327767  -0.396924  0.0
  0.135862  -0.88782   -0.439679  0.0
 -0.496509   0.323024  -0.805688  0.0
  0.0        0.0        0.0       1.0

stevengj added linear algebra Linear algebra needs news A NEWS entry is required for this change labels Apr 27, 2019

stevengj requested a review from andreasnoack April 27, 2019 03:46

stevengj mentioned this pull request Apr 27, 2019

more methods for Hessenberg factorizations #31738

Closed

6 tasks

stevengj changed the title ~~WIP: (H+μI) \ x solvers for Hessenberg factorizations~~ (H+μI) \ x solvers for Hessenberg factorizations Apr 27, 2019

stevengj removed the needs news A NEWS entry is required for this change label Apr 27, 2019

StefanKarpinski approved these changes Apr 29, 2019

View reviewed changes

andreasnoack approved these changes Apr 30, 2019

View reviewed changes

stdlib/LinearAlgebra/src/hessenberg.jl Outdated Show resolved Hide resolved

This was referenced Apr 30, 2019

remove SLICOT-based code JuliaSystems/LTISystems.jl#11

Merged

better Hessenberg solvers RalphAS/GenericSchur.jl#2

Closed

stevengj and others added 7 commits May 15, 2019 09:53

first draft of new Hessenberg operations

e1539f9

fixes

6ca092d

more fixes and tests

c914120

fixes and tests

02c4c02

fix ambiguity

7e1e452

fix complex shift, pretty-print

600abb4

ambiguity fix

c029fd9

stevengj and others added 15 commits May 15, 2019 09:53

hessenberg docs

53d870e

tweak

7ba75c4

fix doc reference for non-exported HessenbergQ

4b6c12b

make τ type more generic

330ca95

update NEWS

3d37fb2

put shifts into Hessenberg object, store factors separately in prepar…

8e2dbea

…ation for SymTridiagonal factorization of Hermitian A

correct comment

8d612b8

hessenberg for Hermitian matrices -> real-symmetric tridiagonal

9056d42

shifted solvers for symtridiag

9f5c32d

rm redundant methods

1ecd267

test workaround

ce05928

news for tridiagonal Hessenberg

01df020

grammar

e4afbc4

make sure Matrix(H.Q) is tested

2a91795

rm workaround now that 32001 is merged

ef3644f

stevengj force-pushed the sgj/hessenbetter branch from c06694b to ef3644f Compare May 15, 2019 13:55

StefanKarpinski reviewed May 15, 2019

View reviewed changes

stdlib/LinearAlgebra/docs/src/index.md Outdated Show resolved Hide resolved

stevengj added 2 commits May 15, 2019 10:54

clarifications

41ea262

fix typo

b3d33ed

StefanKarpinski merged commit a0d831c into master May 17, 2019

StefanKarpinski deleted the sgj/hessenbetter branch May 17, 2019 18:31

dkarrasch mentioned this pull request May 23, 2019

doc: add factorizations docstrings #31284

Merged

c42f mentioned this pull request May 28, 2019

Fix unsigned wrap around in lpad/rpad/string allocation #32161

Merged

andreasnoack mentioned this pull request Jun 21, 2019

givens does not work with integer vectors #32388

Closed

KristofferC mentioned this pull request Sep 6, 2019

Heads up about test error on 1.3 Jutho/KrylovKit.jl#21

Closed

jyjemily mentioned this pull request May 29, 2023

NEWS 수정 juliakorea/translate-doc#32

Open

dkarrasch mentioned this pull request Jul 24, 2023

Missing method in /(::Adjoint{ComplexF64}, Hessenberg{Float64}) #50617

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(H+μI) \ x solvers for Hessenberg factorizations #31853

(H+μI) \ x solvers for Hessenberg factorizations #31853

stevengj commented Apr 27, 2019 •

edited

StefanKarpinski left a comment

andreasnoack commented Apr 29, 2019

stevengj commented Apr 29, 2019 •

edited

StefanKarpinski commented Apr 30, 2019

stevengj commented Apr 30, 2019 •

edited

andreasnoack left a comment

stevengj commented Apr 30, 2019 •

edited

stevengj commented May 1, 2019 •

edited

stevengj commented May 1, 2019 •

edited

stevengj commented May 1, 2019 •

edited

stevengj commented May 6, 2019

stevengj commented May 11, 2019

stevengj commented May 11, 2019

stevengj commented May 12, 2019

stevengj commented May 15, 2019

stevengj commented May 17, 2019 •

edited

andreasnoack commented Aug 28, 2019

stevengj commented Aug 28, 2019

(H+­μI) \ x solvers for Hessenberg factorizations #31853

(H+­μI) \ x solvers for Hessenberg factorizations #31853

Conversation

stevengj commented Apr 27, 2019 • edited

StefanKarpinski left a comment

Choose a reason for hiding this comment

andreasnoack commented Apr 29, 2019

stevengj commented Apr 29, 2019 • edited

StefanKarpinski commented Apr 30, 2019

stevengj commented Apr 30, 2019 • edited

andreasnoack left a comment

Choose a reason for hiding this comment

stevengj commented Apr 30, 2019 • edited

stevengj commented May 1, 2019 • edited

stevengj commented May 1, 2019 • edited

stevengj commented May 1, 2019 • edited

stevengj commented May 6, 2019

stevengj commented May 11, 2019

stevengj commented May 11, 2019

stevengj commented May 12, 2019

stevengj commented May 15, 2019

stevengj commented May 17, 2019 • edited

andreasnoack commented Aug 28, 2019

stevengj commented Aug 28, 2019

(H+μI) \ x solvers for Hessenberg factorizations #31853

(H+μI) \ x solvers for Hessenberg factorizations #31853

stevengj commented Apr 27, 2019 •

edited

stevengj commented Apr 29, 2019 •

edited

stevengj commented Apr 30, 2019 •

edited

stevengj commented Apr 30, 2019 •

edited

stevengj commented May 1, 2019 •

edited

stevengj commented May 1, 2019 •

edited

stevengj commented May 1, 2019 •

edited

stevengj commented May 17, 2019 •

edited