Kahan and Klein summation #695

zerothi · 2023-02-21T11:53:49Z

Motivation

Precision summations are necessary for certain operations (also suggested for cumsum)

Would you accept a type(Kahan(dp)) and type(Klein(dp)), and precision variants for inclusion in the stdlib?

Prior Art

No response

Additional Information

No response

The text was updated successfully, but these errors were encountered:

zerothi · 2023-02-21T12:04:35Z

Instead of having the type in the name, I would suggest we name them:

type(Kahan_real(dp)) :: rsum
type(Kahan_cmplx(dp)) :: csum

to assign data-type?

milancurcic · 2023-02-22T12:31:11Z

Thanks for this proposal. I think it's in scope and needed. I know of Kahan sum but not Klein--can you provide some reference for each here?

Regarding the API, can you explain the need for using derived types here? Could we not implement this as generic functions, i.e function sum_kahan(x)?

zerothi · 2023-02-22T12:35:09Z

The Klein summation is a 2nd order improvement of the Kahan summation. In fact there is a simple extension of the Kahan summation that fixes cases where the input value is larger than the currently summed quantity (e.g. single values are much much larger than the rest) or similar. Perhaps we can start with the regular Kahan, and then decide on how to add more advanced schemes, see for instance https://en.wikipedia.org/wiki/Kahan_summation_algorithm, I just call it Klein to not have excessively long names
The point is that Kahan summation might be needed on non-array summations, consider complex arithmetic on various array combinations a(:) * b(:) * sqrt(c(:) + a2(:,1) * a2(:,2), in this case I see a great value in enabling a user-defined type that sums arrays.
I do plan (and have a code) that adds the functions for regular arrays, as your sum_kahan. But I think both are necessary.

arjenmarkus · 2023-02-22T12:50:44Z

A pitfall of this algorithm is that compilers may decide that the parentheses are superfluous and thereby destroy the effect. How do you guard against that? The WIkipedia article (which also describes the Klein version - it was a new one for me as well) suggests using a volatile variable. Have you considered other summation tecniques, such as pariwise summation (not as accurate perhaps but more accurate than the naïve algorithm and faster than Kahan)? There is also a rather nice article by Nicholas Higham about summation techniques. It could be an inspiration. Op wo 22 feb. 2023 om 13:35 schreef Nick Papior ***@***.***>:

…

1. The Klein summation is a 2nd order improvement of the Kahan summation. In fact there is a simple extension of the Kahan summation that fixes cases where the input value is larger than the currently summed quantity (e.g. single values are much much larger than the rest) or similar. Perhaps we can start with the regular Kahan, and then decide on how to add more advanced schemes 2. The point is that Kahan summation might be needed on non-array summations, consider complex arithmetic on various array combinations a(:) * b(:) * sqrt(c(:) + a2(:,1) * a2(:,2), in this case I see a great value in enabling a user-defined type that sums arrays. 3. I do plan (and have a code) that adds the functions for regular arrays, as your sum_kahan. But I think both are necessary. — Reply to this email directly, view it on GitHub <#695 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAN6YR3EHMKVIFJ43V6CRV3WYYBYTANCNFSM6AAAAAAVC6LQPY> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

zerothi · 2023-02-22T12:55:48Z

I agree that compilation problems are vital.
However, we could add a test when compiling it to see if optimizations turn them off. And/or change the default flags for GNU/Intel compilers to obey code layout for this code only. Again, since incremental summations might be useful I think the type is necessary. As for the array summation, agreed that could be split into pairwise, etc. algorithms, but that isn't the point here, and those should probably be put in another module (stdlib_reduce for instance which would also accommodate norms etc.) :)

arjenmarkus · 2023-02-22T14:11:51Z

I do not mean to expand the work :), the main issue is the possible interference of compilers thinking they know better. I was inspired by the Wikipedia article wrt the other possibiliities. Op wo 22 feb. 2023 om 13:56 schreef Nick Papior ***@***.***>:

…

I agree that compilation problems are vital. However, we could add a test when compiling it to see if optimizations turn them off. And/or change the default flags for GNU/Intel compilers to obey arithmetic for this code only. Again, since incremental summations might be useful I think the type is necessary. As for the array summation, agreed that could be split into pairwise, etc. algorithms, but that isn't the point here, and those should probably be put in another module ( stdlib_reduce for instance which would also accommodate norms etc.) :) — Reply to this email directly, view it on GitHub <#695 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAN6YR4NAWEZSFRBROCYTFTWYYEGBANCNFSM6AAAAAAVC6LQPY> . You are receiving this because you commented.Message ID: ***@***.***>

zerothi · 2023-02-22T14:13:48Z

I agree that the compiler problem might be a blocker, but at least having the algorithm in-place and warning users about possible problems might be the only way to have the possibility in stdlib.. hmm...

arjenmarkus · 2023-02-22T14:16:14Z

Well, Fortran does allow variables to be volatile. That might definitely be a solution. Op wo 22 feb. 2023 om 15:13 schreef Nick Papior ***@***.***>:

…

I agree that the compiler problem might be a blocker, but at least having the algorithm in-place and warning users about possible problems might be the *only* way to have the possibility in stdlib.. hmm... — Reply to this email directly, view it on GitHub <#695 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAN6YR33DBNDZ7P76DKLMVTWYYNKNANCNFSM6AAAAAAVC6LQPY> . You are receiving this because you commented.Message ID: ***@***.***>

zerothi · 2023-02-22T14:18:03Z

Ah yes, good point. So bottom line:

Implement kahan summation (original version)
use volatile attributes
use types for custom summation paths
an array summation with kahan for easy solution

Anything else?

ivan-pi · 2023-03-01T16:21:12Z

Is this issue similar (or a duplicate) to #402?

ivan-pi · 2023-03-01T16:25:22Z

A pitfall of this algorithm is that compilers may decide that the parentheses are superfluous and thereby destroy the effect. How do you guard against that?

In principle you could parse the contents of compiler_options() intrinsic function and issue a warning to the error_unit, say if -ffast-math was detected with gfortran.

But in general I'd just go with a warning in the documentation.

zerothi · 2023-03-02T07:57:20Z

Yeah, it is.
So is this something I should do, or?

ivan-pi · 2023-03-02T08:25:37Z

If your reply is aimed at the compiler_options() comment, personally I wouldn't do it. Extra complexity for little benefit.

zerothi · 2023-03-02T08:26:32Z

No, I am questioning whether I should make a PR, the details of how to fix things is something we could take in the draft (IMHO).

jalvesz · 2023-11-14T22:40:01Z

If of interest for this discussion, I have an implementation of the kahan summation on top of a chunks-based sum here: https://github.com/jalvesz/fast_math/blob/main/src/fast_sum.f90 (if aggressive optimizations are activated, the chunks approach would still give an accuracy that should be at least 1 order of magnitude better than intrinsic)

zerothi added the idea Proposition of an idea and opening an issue to discuss it label Feb 21, 2023

milancurcic added topic: algorithms searching and sorting, merging, ... topic: mathematics linear algebra, sparse matrices, special functions, FFT, random numbers, statistics, ... labels Feb 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kahan and Klein summation #695

Kahan and Klein summation #695

zerothi commented Feb 21, 2023

zerothi commented Feb 21, 2023

milancurcic commented Feb 22, 2023

zerothi commented Feb 22, 2023 •

edited

arjenmarkus commented Feb 22, 2023 via email

zerothi commented Feb 22, 2023 •

edited

arjenmarkus commented Feb 22, 2023 via email

zerothi commented Feb 22, 2023

arjenmarkus commented Feb 22, 2023 via email

zerothi commented Feb 22, 2023

ivan-pi commented Mar 1, 2023

ivan-pi commented Mar 1, 2023

zerothi commented Mar 2, 2023

ivan-pi commented Mar 2, 2023

zerothi commented Mar 2, 2023

jalvesz commented Nov 14, 2023

Kahan and Klein summation #695

Kahan and Klein summation #695

Comments

zerothi commented Feb 21, 2023

Motivation

Prior Art

Additional Information

zerothi commented Feb 21, 2023

milancurcic commented Feb 22, 2023

zerothi commented Feb 22, 2023 • edited

arjenmarkus commented Feb 22, 2023 via email

zerothi commented Feb 22, 2023 • edited

arjenmarkus commented Feb 22, 2023 via email

zerothi commented Feb 22, 2023

arjenmarkus commented Feb 22, 2023 via email

zerothi commented Feb 22, 2023

ivan-pi commented Mar 1, 2023

ivan-pi commented Mar 1, 2023

zerothi commented Mar 2, 2023

ivan-pi commented Mar 2, 2023

zerothi commented Mar 2, 2023

jalvesz commented Nov 14, 2023

zerothi commented Feb 22, 2023 •

edited

zerothi commented Feb 22, 2023 •

edited