Clarify definition of preconditioner for sparse linear system solvers #5818

anntzer · 2016-02-07T04:42:23Z

http://docs.scipy.org/doc/scipy-0.17.0/reference/generated/scipy.sparse.linalg.cg.html (and other solvers too) currently defines the preconditioner M as "The preconditioner should approximate the inverse of A."
Wikipedia and MATLAB both seem to use the inverse definition (i.e. they solve M^-1.A.x=M^-1.y).
Both definitions seem to exist in the literature(?) but (assuming that the note is correct) I think it would be helpful to clarify which M is meant there ("Note: the definition of the preconditioner matches the definition in reference XYZ, and is the inverse of the definition in reference XYZ'.")

The text was updated successfully, but these errors were encountered:

ev-br · 2016-03-21T21:05:40Z

Figuring this out would definitely be very welcome.

anntzer · 2016-03-21T21:20:22Z

I'm now pretty certain that the required M is indeed an approximation of the inverse of A (nothing else really makes sense).

pv · 2016-03-21T21:57:42Z

All of scipy routines afaik assume M ~ A^-1, so the docs seem correct.

mdhaber · 2021-05-12T00:56:40Z

Another good one for someone who is familiar with the math.

gdcs92 · 2022-07-11T03:20:28Z

Hi! I've done some reading on the preconditioned CG and browsing of the source code, and would like to take this as my first issue.

The sentence "The preconditioner should approximate the inverse of A." seems to imply that we should have M ~ A^-1. This ends up being correct, but a bit confusing.

The implemented CG algorithm solves the left-preconditioned system (*), which in relevant works of the literature such as [1] and [2] is written as M^-1 Ax = M^-1 b. In this form, the convergence of the algorithm depends on the properties of M^-1 A instead of A. So, ideally M should be close to A in the sense that the eigenvalues of M^-1 A are clustered near 1 and || M^-1 A - Id ||_2 is small (see Lecture 40 at [3]).

However, since the CG iterations only require M in order to compute z = M^-1 r, what the implementation expects as its argument M is actually the operator P = M^-1. In this sense, it is correct to say that M ~ A^-1.

A note on the assertion (*) above: looking at the source code, it is not obvious at first glance that this is the case. The CG and other iterative algorithms are implemented using the reverse communications interface (described in [4]), which obscures the flow of execution a bit. But one can conclude this by inspecting the code and comparing with reference [2] (which is even referred to in [4] and in the source code of CGREVCOM.f).

References

[1] Saad, Yousef. "Iterative methods for sparse linear systems". Society for Industrial and Applied Mathematics, 2003.
[2] Barrett, Richard, et al. "Templates for the solution of linear systems: building blocks for iterative methods". Society for Industrial and Applied Mathematics, 1994.
[3] Trefethen, Lloyd N., and David Bau III. "Numerical linear algebra". Vol. 50. Siam, 1997.
[4] Dongarra, Jack, Victor Eijkhout, and Ajay Kalhan. "Reverse communication interface for linear algebra templates for iterative methods." UT, CS-95-291, May (1995).

melissawm · 2022-12-11T13:57:41Z

@gdcs92 are you still interested in fixing the docs here? It's seems this is indeed the right explanation but I'd suggest making it a bit more concise for docs purposes and point to one (or 2) references instead.

rogue-nix · 2023-09-14T20:03:44Z

Hey @melissawm, is this issue still up to be resolved or has been taken up or assigned to someone. I would like to help out in case it remains to be resolved.

melissawm · 2023-10-18T13:04:03Z

@rogue-nix There is no pull request for it so feel free to work on it 😄

Afleloup · 2024-04-17T10:04:07Z

I investigated this issue a little bit. It seems like the explication given by @gdcs92 is correct. Usually, the preconditionned system is written M^-1 Ax = M^-1 b with M^-1 approximating A^-1 as it is meant in their previous comment. Because the algorithm only needs to perform operation with M^-1 it is passed as the M argument which is the inverse of the definition with other work in the litterature [1,2]. However, I feel like there is a bigger issue. In fact, the CG algorithm only works on hermitian positive definite matrix but M^-1A is not necessarily that. There is a workaround this issue if you force M^-1 to also be hermitian positive definite (see [2] for more info). This is not reflected in the precondition on M. I will make a PR soon trying to clarify that for the cg and other iterative solver.

References

[1] Trefethen, Lloyd N., and David Bau III. "Numerical linear algebra". Vol. 50. Siam, 1997
[2]Shewchuk, Jonathan R . An Introduction to the Conjugate Gradient Method Without the Agonizing Pain, 1994

linear solver The argument M in iterative linear solver 'bicg', 'bicgstab', 'cg', 'cgs' is not standard notation in comparison to the literature. It is the inverse of the preconditioner usually denoted M^-1. Their should be a precondition on M for the preconditioned conjugate gradient. It is now explicited in the documentation and linked to relevant references. Closes scipy#5818

linear solver The argument M in iterative linear solver 'bicg', 'bicgstab', 'cg', 'cgs' has different notations in the literature. Its usage has been clarified. Their should be a precondition on M for the preconditioned conjugate gradient. It is now explicited in the documentation and linked to relevant references. Closes scipy#5818

ev-br added Documentation Issues related to the SciPy documentation. Also check https://github.com/scipy/scipy.org scipy.sparse.linalg labels Mar 21, 2016

mdhaber added the good first issue Good topic for first contributor pull requests, with a relatively straightforward solution label May 12, 2021

Afleloup mentioned this issue Apr 17, 2024

DOC: updated preconditioner doc for iterative #20513

Closed

Afleloup linked a pull request Apr 18, 2024 that will close this issue

DOC: updated preconditioner doc for iterative linear solver #20517

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify definition of preconditioner for sparse linear system solvers #5818

Clarify definition of preconditioner for sparse linear system solvers #5818

anntzer commented Feb 7, 2016

ev-br commented Mar 21, 2016

anntzer commented Mar 21, 2016

pv commented Mar 21, 2016

mdhaber commented May 12, 2021

gdcs92 commented Jul 11, 2022 •

edited

melissawm commented Dec 11, 2022

rogue-nix commented Sep 14, 2023

melissawm commented Oct 18, 2023

Afleloup commented Apr 17, 2024 •

edited

Clarify definition of preconditioner for sparse linear system solvers #5818

Clarify definition of preconditioner for sparse linear system solvers #5818

Comments

anntzer commented Feb 7, 2016

ev-br commented Mar 21, 2016

anntzer commented Mar 21, 2016

pv commented Mar 21, 2016

mdhaber commented May 12, 2021

gdcs92 commented Jul 11, 2022 • edited

References

melissawm commented Dec 11, 2022

rogue-nix commented Sep 14, 2023

melissawm commented Oct 18, 2023

Afleloup commented Apr 17, 2024 • edited

References

gdcs92 commented Jul 11, 2022 •

edited

Afleloup commented Apr 17, 2024 •

edited