[hist] Improve precision of TAxis::FindFixBin / FindBin. #19033

hageboeck · 2025-06-13T12:46:27Z

The bin computation in TAxis can suffer from floating-point uncertainties, returning bins such that this assertion breaks:

int bin = axis.FindBin(x);
assert(axis.GetBinLowEdge(bin) <= x && x < axis.GetBinUpEdge(bin));

This is of course surprising for users, and sometimes brings data into the overflow, although they fall into the valid range of the axis. Here, two terms are added, which fix these floating-point errors.

On a recent AMD CPU, there is no runtime difference, since the latency of the virtual call overshadows the additional computations for the correction. For 10M calls, running 100 times with perf, we arrive at:

	Uncorrected	Corrected
Instructions	250M	360M
cycles	216M	218M
Run time	0.0495 +- 0.0003 s	0.0493 +- 0.0002 s

Note:
Several attempts at reordering the equations for better precision failed. There were always a few test cases in the included tests that didn't pass, so applying the correction looks to be the best solution.

This PR fixes the issues brought up here:

ferdymercury · 2025-06-13T13:39:07Z

Thanks a lot Stephan!
Some questions from my side:

Does this supersede [hist] Improve calculation of TAxis::FindBin(x) when x is at the bin edges #14105 ? Maybe it can be closed.
Does this fix Bug in TAxis::FindBin (Double_t x) ? #14091 ? Maybe it can be linked.
Does this new implementation affect runtime performance when filling a histogram? If yes, maybe it's better to have a separate function FindBinHighRes or so. One could even think of using long-double.
Is it sure that the resulting bin is only off by one? I can imagine of (probably very weird cases) where the floating point limited precision might be two?

hageboeck · 2025-06-13T14:09:36Z

Thanks a lot Stephan! Some questions from my side:

Does this supersede [hist] Improve calculation of TAxis::FindBin(x) when x is at the bin edges #14105 ? Maybe it can be closed.

Does this fix Bug in TAxis::FindBin (Double_t x) ? #14091 ? Maybe it can be linked.

Does this new implementation affect runtime performance when filling a histogram? If yes, maybe it's better to have a separate function FindBinHighRes or so. One could even think of using long-double.

Is it sure that the resulting bin is only off by one? I can imagine of (probably very weird cases) where the floating point limited precision might be two?

Good questions, I'm checking! 🙂

github-actions · 2025-06-13T17:20:44Z

Test Results

19 files 19 suites 4d 3h 1m 0s ⏱️
3 016 tests 3 013 ✅ 0 💤 3 ❌
55 735 runs 55 708 ✅ 0 💤 27 ❌

For more details on these failures, see this check.

Results for commit 901d041.

♻️ This comment has been updated with latest results.

hageboeck · 2025-06-17T12:12:53Z

Does this supersede [hist] Improve calculation of TAxis::FindBin(x) when x is at the bin edges #14105 ? Maybe it can be closed.

It can be closed, indeed.

Does this fix Bug in TAxis::FindBin (Double_t x) ? #14091 ? Maybe it can be linked.

It fixes the problem described there. I linked the issue.

Does this new implementation affect runtime performance when filling a histogram? If yes, maybe it's better to have a separate function FindBinHighRes or so. One could even think of using long-double.

It does not affect run time on a modern CPU, even though more instructions are issued. When you run the bare code to find the bin (not as part of a virtual function), you have the following behaviour:

	Uncorrected	Corrected
Instructions	8	19
Cycles	33	31

It doesn't matter how many instructions are issued. What's interesting is how many cycles they need to complete. Although that depends on the CPU in use, you see that the two versions are very close.

The killer argument, however, is that all of this happens inside a virtual function, and that of course has its own latencies. When plugging the codes into a virtual function, and benchmarking 10M calls, we arrive at:

	Uncorrected	Corrected
Instructions	250M	360M
cycles	216M	218M
Run time	0.0495 +- 0.0003 s	0.0493 +- 0.0002 s

So you see that more instructions are issued, but due to the CPU being a superscalar CPU, these additional instructions have no noteworthy effect on the run time, because the correction terms can be evaluated in parallel to the original computation.

This means, that in TAxis, where it is part of a virtual function, we should not see a run time difference.

Is it sure that the resulting bin is only off by one? I can imagine of (probably very weird cases) where the floating point limited precision might be two?

I did not manage to provoke a more-than-one difference. I didn't set out to prove it mathematically, but I had lots of difficulties crafting a case that made an "off by -1" error.

ferdymercury · 2025-06-17T12:50:22Z

Awesome analysis, thanks a lot!!
Does this mean that I can close #17689 if you implement the same strategy in that part of the code ?
And not sure if you want to review this alleviation strategy: #17896

ferdymercury · 2025-06-17T13:35:16Z

Btw, this PR reminds me of this issue: https://its.cern.ch/jira/browse/ROOT-10179

hageboeck · 2025-06-17T14:28:48Z

Awesome analysis, thanks a lot!! Does this mean that I can close #17689 if you implement the same strategy in that part of the code ?

@ferdymercury I think for this PR, the x == max --> overflow problem remains. So you could probably just use:

- max = currentMax;
+ max = std::nextafter(currentMax, inf);

but that's to be tested.

Btw, this PR reminds me of this issue: https://its.cern.ch/jira/browse/ROOT-10179

Was this the correct link?

ferdymercury · 2025-06-17T14:34:58Z

Was this the correct link?

It was about the question/fact of whether performance was impacted. Maybe more the subissue https://its.cern.ch/jira/browse/ROOT-10185 which would have been easier for you if rootbench could be triggered from a PR.

 max = std::nextafter(currentMax, inf);

I had tried that and it didn't work. I really had to add
xmax = std::max(xmax + 1e-15*(xmax - xmin), std::nextafter(xmax,INFINITY));

For most cases, the 1e-15*max_width is what really counts.
Or do you mean that that should be fine with just nextafter, 'after' your PR gets merged?

Related: #17896

Due to the floating-point subtraction x - xMin, the bin index occasionally flows over to the next bin. This is particularly annoying when it goes into the overflow, although the coordinate is strictly smaller than the max of the axis. By adding a correction term that can detect this case, overflows such as the one discussed in https://root-forum.cern.ch/t/bug-or-feature-in-ttree-draw/62862 can be avoided. An error in the other direction is possible, too, and fixed in this commit: https://root-forum.cern.ch/t/floating-point-rounding-error-when-filling-the-histogram/35189 Microbenchmarking the changed lines showed them to be the same speed in gcc, and 40% slower in clang. Both changes are by far outweighted by the virtual call overhead, though. This allowed for removing the cautionary note on rounding errors, added in 1703c54, which is fixed now. Fix root-project#14091.

The tests are based on two issues brought up in the forum, a few cases designed by hand to break the old algorithm, github issue root-project#14091, the tests from PR root-project#14105, and a test sampling 100 random points.

hageboeck self-assigned this Jun 13, 2025

hageboeck requested a review from lmoneta as a code owner June 13, 2025 12:46

hageboeck force-pushed the TAxis_numericalPrecision branch 2 times, most recently from 3eb7398 to 50e7024 Compare June 13, 2025 12:59

hageboeck linked an issue Jun 17, 2025 that may be closed by this pull request

Bug in TAxis::FindBin (Double_t x) ? #14091

Open

1 task

hageboeck mentioned this pull request Jun 17, 2025

Bug in TAxis::FindBin (Double_t x) ? #14091

Open

1 task

hageboeck force-pushed the TAxis_numericalPrecision branch 2 times, most recently from eab0ca7 to 543f148 Compare June 17, 2025 12:44

hageboeck mentioned this pull request Jun 17, 2025

[hist] Improve calculation of TAxis::FindBin(x) when x is at the bin edges #14105

Closed

hageboeck force-pushed the TAxis_numericalPrecision branch from 543f148 to b9849ac Compare June 17, 2025 12:48

hageboeck force-pushed the TAxis_numericalPrecision branch from b9849ac to 8e6acc0 Compare June 17, 2025 14:06

hageboeck added 3 commits June 19, 2025 13:56

[NFC] Clean up an outdated comment.

7373739

[hist] Add a test for finding bins in TAxis.

901d041

The tests are based on two issues brought up in the forum, a few cases designed by hand to break the old algorithm, github issue root-project#14091, the tests from PR root-project#14105, and a test sampling 100 random points.

hageboeck force-pushed the TAxis_numericalPrecision branch from 8e6acc0 to 901d041 Compare June 19, 2025 11:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[hist] Improve precision of TAxis::FindFixBin / FindBin. #19033

[hist] Improve precision of TAxis::FindFixBin / FindBin. #19033

hageboeck commented Jun 13, 2025 •

edited

Loading

Uh oh!

ferdymercury commented Jun 13, 2025 •

edited

Loading

Uh oh!

hageboeck commented Jun 13, 2025

Uh oh!

github-actions bot commented Jun 13, 2025 •

edited

Loading

Uh oh!

hageboeck commented Jun 17, 2025

Uh oh!

ferdymercury commented Jun 17, 2025 •

edited

Loading

Uh oh!

ferdymercury commented Jun 17, 2025

Uh oh!

hageboeck commented Jun 17, 2025

Uh oh!

ferdymercury commented Jun 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

[hist] Improve precision of TAxis::FindFixBin / FindBin. #19033

Are you sure you want to change the base?

[hist] Improve precision of TAxis::FindFixBin / FindBin. #19033

Conversation

hageboeck commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ferdymercury commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hageboeck commented Jun 13, 2025

Uh oh!

github-actions bot commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

hageboeck commented Jun 17, 2025

Uh oh!

ferdymercury commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ferdymercury commented Jun 17, 2025

Uh oh!

hageboeck commented Jun 17, 2025

Uh oh!

ferdymercury commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

hageboeck commented Jun 13, 2025 •

edited

Loading

ferdymercury commented Jun 13, 2025 •

edited

Loading

github-actions bot commented Jun 13, 2025 •

edited

Loading

ferdymercury commented Jun 17, 2025 •

edited

Loading

ferdymercury commented Jun 17, 2025 •

edited

Loading