fix: make histogramQuantile handle case of zero samples #5419

wolffcm · 2023-05-26T00:39:31Z

When there are no observations/samples in a histogram (all zeros for each bucket) produce a null value.

Checklist

Dear Author 👋, the following checks should be completed (or explicitly dismissed) before merging.

✏️ Write a PR description, regardless of triviality, to include the value of this PR
🔗 Reference related issues
🏃 Test cases are included to exercise the new code
🧪 If new packages are being introduced to stdlib, link to Working Group discussion notes and ensure it lands under experimental/
📖 If language features are changing, ensure docs/Spec.md has been updated

Dear Reviewer(s) 👋, you are responsible (among others) for ensuring the completeness and quality of the above before approval.

stdlib/universe/histogram_quantile.go

Co-authored-by: Gavin Cabbage <gavincabbage@users.noreply.github.com>

wolffcm · 2023-05-26T18:08:45Z

stdlib/universe/histogram_quantile.go

+	return true
+}
+
+func (t *histogramQuantileTransformation) computeQuantile(cdf []bucket) (quantileResult, error) {


The issue here was that the Flux stdlib function histogram_quantile gets a wrong answer for some input data.

This function accepts a cumulative distribution function (a cumulative histogram produced from the input table data) and produces the requested quantile.

When the cdf contains all zeroes, this function would return the bound of the last histogram bucket, which is incorrect. The right thing to do for that case is to return a null value, since we can't compute a quantile if we didn't actually receive any observations.

wolffcm · 2023-05-26T18:23:31Z

stdlib/universe/histogram_quantile.go

+			// "force" is not possible because isMonotonic will fix the buckets
+			return quantileResult{}, errors.Newf(codes.Internal, "unknown or unexpected value for onNonmonotonic: %q", t.spec.OnNonmonotonic)
+		}
+	}


Sometimes the histogram buckets are not monotonic (which they should be if they are cumulative) due to late-arriving data on the edge. The OnNonmonotonic parameter describes what to do in this case.

Checking for monotonicity first (and fixing if needed and requested by the user) avoids a bug that occurred when the total observation count was pulled from the last bucket before it was "fixed" in the case of forcing monotonicity.

This is not really related to the issue the user found but I saw it here and fixed it. The test case histogramQuantileOnNonmonotonicForceLastBucket below verifies this fix.

wolffcm · 2023-05-26T18:23:48Z

stdlib/universe/histogram_quantile.go

+	if totalCount == 0 {
+		// Produce a null value if there were no samples
+		return quantileResult{action: appendNil}, nil
+	}


Here is where we bail and produce a null value for the case of zero observations.

fix: make histogramQuantile handle case of zero samples

4a69973

wolffcm requested a review from a team as a code owner May 26, 2023 00:39

wolffcm mentioned this pull request May 26, 2023

The histogramQuantile function returns an incorrect value when there are no observations in the histogram #5415

Closed

gavincabbage reviewed May 26, 2023

View reviewed changes

stdlib/universe/histogram_quantile.go Outdated Show resolved Hide resolved

gavincabbage approved these changes May 26, 2023

View reviewed changes

chore: fix comment typo in stdlib/universe/histogram_quantile.go

8d1b455

Co-authored-by: Gavin Cabbage <gavincabbage@users.noreply.github.com>

wolffcm commented May 26, 2023

View reviewed changes

wolffcm merged commit d8995bb into master May 26, 2023
7 checks passed

wolffcm deleted the fix/histogram-quantile branch May 26, 2023 18:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: make histogramQuantile handle case of zero samples #5419

fix: make histogramQuantile handle case of zero samples #5419

wolffcm commented May 26, 2023 •

edited

wolffcm May 26, 2023

wolffcm May 26, 2023

wolffcm May 26, 2023

fix: make histogramQuantile handle case of zero samples #5419

fix: make histogramQuantile handle case of zero samples #5419

Conversation

wolffcm commented May 26, 2023 • edited

Checklist

wolffcm May 26, 2023

Choose a reason for hiding this comment

wolffcm May 26, 2023

Choose a reason for hiding this comment

wolffcm May 26, 2023

Choose a reason for hiding this comment

wolffcm commented May 26, 2023 •

edited