Fix coefficients in expm helper function #9705

SuperFluffy · 2019-01-21T10:43:19Z

According to Higham 2005, the leading term of the backward error function of the [m/m] Padé approximant of the exponential function is x^{2m+1}. Its corresponding coefficient is C_{2m+1} = (m!)^2 / ( (2m)! * (2m+1)!), cf. Equations (2.2) and (2.6) in said reference.

So far, the code didn't compute C_{2m+1}, but replaced m <- 2m + 1, which is probably due to the literature's somewhat surprising notation (cf. the index i = 2m + 1 in Equation (2.6)).

Al-Mohy and Higham 2009 actually have a Matlab reference implementation available, where they hard-code the cofficients, which are reproduced verbatim below. Using m instead of m <- 2m + 1, as is currently done, exactly reproduces these coefficients:

% Coefficients of leading terms in the backward error functions h_{2m+1}.
Coeff = [1/100800, 1/10059033600, 1/4487938430976000,...
         1/5914384781877411840000, 1/113250775606021113483283660800000000];

According to [Higham 2005], the leading term of the backward error function of the [m/m] Padé approximant of the exponential function is x^{2m+1}. Its corresponding coefficient is C_{2m+1} = (m!)^2 / ( (2m)! * (2m+1)!), cf. Equations (2.2) and (2.6) in said reference. So far, the code didn't compute C_{2m+1}, but replaced m <- 2m + 1, which is probably due to the literature's somewhat surprising notation (cf. the index i = 2m + 1 in Equation (2.6)). [Al-Mohy and Higham 2009] actually have a [Matlab reference implementation] available, where they hard-code the cofficients, which are reproduced verbatim below. Using m instead of m <- 2m + 1, as is currently done, exactly reproduces these coefficients ```matlab % Coefficients of leading terms in the backward error functions h_{2m+1}. Coeff = [1/100800, 1/10059033600, 1/4487938430976000,... 1/5914384781877411840000, 1/113250775606021113483283660800000000]; ``` [Higham 2005]: https://doi.org/10.1137/04061101X [Al-Mohy and Higham 2009]: https://doi.org/10.1137/09074721X [Matlab reference implementation]: http://eprints.ma.man.ac.uk/1442/03/expm_new.zip

SuperFluffy · 2019-02-17T20:13:46Z

Any way to move this forward? I am cannot assess how bad this incorrect factor is, but given that a probably non-trivial number of people is using matrix exponents in scipy, I would love to get this fixed (or assess the severity of this issue, if somebody has that ability).

tylerjereddy

A unit test that fails before & succeeds after the fix is usually helpful.

Also, there may be concerns about the license of the reference code you are citing? Maybe clarify--that code can definitely be used to fix issues in our code base?

SuperFluffy · 2019-02-17T20:50:07Z

Re the unit tests:

As I wrote, I cannot assess the severity of this error. I can only state that a) scipy's expm claims to be an implementation of Al-Mohy and Higham 2009, and b) that it is an incorrect implementation according to their paper and the ones cited. Given the approximate nature of this algorithm we would need to establish an error bound that is not fulfilled by the current implementation but should be fulfilled by the reference implementation. I do not have the sufficient understanding of what's going on here to provide this expertise myself.

Right now, the unit tests available for scipy's expm itself are only testing trivial cases. Furthermore, I do not see how unit tests are useful here. If you want, I can write unit tests with the mathematically correct values, but that seems to be circular.

Re licensing:

I do not quite understand your point. In how far is their code used to fix scipy's code? Scipy implements the actual equations in the cited papers, with my fix calculating the same numbers that the matlab script hardcodes. There is no sharing of code, no reimplementation, not even a derivation of work. The calculation of the correct terms in the error function of the Padé approximants I did a) by hand because I was using scipy's implementation to learn the algorithm and not being able to get the same factors and b) by implementing them in a script. I have crosschecked the correctness of my factors by comparing them with the matlab code above. The matlab code itself was published alongside their paper, see the preprint server of Manchester University: http://eprints.maths.manchester.ac.uk/1442/

Steps forward:

The code is mathematically incorrect. You can follow the derivation of the coefficients in the papers cited. Gautschi 2012 is also useful to read here.

Scipy should either not mention that they implement Al-Mohy and Higham 2009, or alternatively specify that the implementation is probably not correct and warn people from using it.

rgommers · 2019-02-20T05:47:42Z

If you want, I can write unit tests with the mathematically correct values, but that seems to be circular.

agreed

There is no sharing of code, no reimplementation, not even a derivation of work. ..... I have crosschecked the correctness of my factors by comparing them with the matlab code above. The matlab code itself was published alongside their paper

this is all fine, thanks for clarifying

The code is mathematically incorrect.

yes, this PR should be okay as is. While this is a bugfix and we can therefore change the code, we should put a note in the 1.3.0 release notes when this PR is merged. Other than that, this just needs review. Unfortunately the expm expert we had went MIA. This should not be too hard to review though, since it's only a couple of line of code.

I'll mark it for the next release; if no one has reviewed in 1-2 weeks I'll get to it at some point (no time now, sorry).

tylerjereddy · 2019-02-20T23:44:28Z

we would need to establish an error bound that is not fulfilled by the current implementation but should be fulfilled by the reference implementation

I don't get how this wouldn't be useful. Maybe it is hard to do, and maybe the PR is fine as is, but certainly not ideal to not have a regression guard for something that is mathematically incorrect.

SuperFluffy · 2019-02-21T09:02:14Z

we would need to establish an error bound that is not fulfilled by the current implementation but should be fulfilled by the reference implementation

I don't get how this wouldn't be useful. Maybe it is hard to do, and maybe the PR is fine as is, but certainly not ideal to not have a regression guard for something that is mathematically incorrect.

Oh, better unit tests would definitely be useful! I just don't know how to approach this, because I would need a much deeper understanding of the methods involved.

Maybe you can open an issue separate from this PR and take the lead on this? It would definitely be nice to see a unit test on expm which is directly affected by the wrong calculation of the error Padé coefficients. I would be very interested in that! I am happy to share anything I found out about this stuff.

tylerjereddy · 2019-02-21T17:52:56Z

Makes sense -- I'll think about this a bit!

tylerjereddy

Ok, I'll go with Ralf on this one:

yes, this PR should be okay as is

Unit tests would have been nice, but we need a resident expert!

tylerjereddy · 2019-04-19T01:16:12Z

@SuperFluffy Thanks! Can you add a release note to: https://github.com/scipy/scipy/pulls?q=is%3Aopen+is%3Apr+milestone%3A1.3.0

SuperFluffy · 2019-04-19T12:33:39Z

@tylerjereddy Awesome that this got merged, makes me very happy!

It looks like I cannot assign labels/milestons to issues. I guess I don't have the rights. :) Can you do that for me? Here is the associated issue: #10074

I hope that that's what you meant me to do. You linked the issue tracker filtered by milestone 1.3.0.

rgommers · 2019-04-19T12:40:17Z

It looks like I cannot assign labels/milestons to issues. I guess I don't have the rights. :) Can you do that for me? Here is the associated issue: #10074

done

I hope that that's what you meant me to do.

No the link was wrong. Tyler's question was to add a note to https://github.com/scipy/scipy/wiki/Release-note-entries-for-SciPy-1.3.0#scipysparse-improvements. I'm not sure that this is relevant for users though, since it's not clear how this is relevant to an end user of expm in a practical sense. So I suggest to leave it.

SuperFluffy · 2019-04-19T12:44:14Z

I'm not sure that this is relevant for users though, since it's not clear how this is relevant to an end user of expm in a practical sense. So I suggest to leave it.

It would indeed be interesting to ask users who rely heavily on expm to check if they observe any differences in their calculations before and after the fix.

rgommers · 2019-04-19T12:46:49Z

That's a good idea - more a question for the mailing list than the release notes though. If you would want to write a message to scipy-user, that would be helpful

pvanmulbregt added the scipy.sparse.linalg label Jan 29, 2019

tylerjereddy reviewed Feb 17, 2019

View reviewed changes

rgommers added this to the 1.3.0 milestone Feb 20, 2019

tylerjereddy approved these changes Apr 19, 2019

View reviewed changes

tylerjereddy merged commit 4a497cc into scipy:master Apr 19, 2019

SuperFluffy mentioned this pull request Apr 19, 2019

BUG: expm calculates the wrong coefficients in the backward error function. #10074

Closed

rgommers added the maintenance Items related to regular maintenance tasks label Apr 19, 2019

SuperFluffy mentioned this pull request Dec 13, 2019

Complex matrices SuperFluffy/rust-expm#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix coefficients in expm helper function #9705

Fix coefficients in expm helper function #9705

SuperFluffy commented Jan 21, 2019 •

edited

SuperFluffy commented Feb 17, 2019

tylerjereddy left a comment •

edited

SuperFluffy commented Feb 17, 2019 •

edited

rgommers commented Feb 20, 2019

tylerjereddy commented Feb 20, 2019

SuperFluffy commented Feb 21, 2019 •

edited

tylerjereddy commented Feb 21, 2019

tylerjereddy left a comment

tylerjereddy commented Apr 19, 2019

SuperFluffy commented Apr 19, 2019 •

edited

rgommers commented Apr 19, 2019

SuperFluffy commented Apr 19, 2019

rgommers commented Apr 19, 2019

Fix coefficients in expm helper function #9705

Fix coefficients in expm helper function #9705

Conversation

SuperFluffy commented Jan 21, 2019 • edited

SuperFluffy commented Feb 17, 2019

tylerjereddy left a comment • edited

Choose a reason for hiding this comment

SuperFluffy commented Feb 17, 2019 • edited

Re the unit tests:

Re licensing:

Steps forward:

rgommers commented Feb 20, 2019

tylerjereddy commented Feb 20, 2019

SuperFluffy commented Feb 21, 2019 • edited

tylerjereddy commented Feb 21, 2019

tylerjereddy left a comment

Choose a reason for hiding this comment

tylerjereddy commented Apr 19, 2019

SuperFluffy commented Apr 19, 2019 • edited

rgommers commented Apr 19, 2019

SuperFluffy commented Apr 19, 2019

rgommers commented Apr 19, 2019

SuperFluffy commented Jan 21, 2019 •

edited

tylerjereddy left a comment •

edited

SuperFluffy commented Feb 17, 2019 •

edited

SuperFluffy commented Feb 21, 2019 •

edited

SuperFluffy commented Apr 19, 2019 •

edited