bug: getQuantiles() returns values that exceed max #4744

ginoledesma · 2017-09-01T23:40:58Z

ginoledesma · 2017-09-05T19:04:36Z

Updated PR to include a proposed fix. May need discussion to ensure this is the right way to do things.

I fed getQuantiles() with more synthetic data sets to see that the numbers generated are sane. Since the algorithm is based off the BH/TT uniform() procedure, we know that uj should not exceed s given that uj should fall in the set of real numbers {u1 < . . . < u~B}, where s = u~B.

gianm

In addition to the one comment, @ginoledesma could you please fill out our CLA located at: http://druid.io/community/cla.html

Thanks for the contribution.

gianm · 2017-09-06T01:13:00Z

...-core/histogram/src/main/java/io/druid/query/aggregation/histogram/ApproximateHistogram.java

@@ -1581,7 +1581,7 @@ public double sum(final float b)
          z = (-b + Math.sqrt(b * b - 4 * a * c)) / (2 * a);
        }
        final double uj = this.positions[i - 1] + (this.positions[i] - this.positions[i - 1]) * z;
-        quantiles[j] = (float) uj;
+        quantiles[j] = ((float) uj < this.max()) ? (float) uj : this.max();


It's nice how simple, yet effective this tweak is. Would it make sense to do a similar tweak for mins?

In addition, if you have a clearly worded explanation of why the uj's can be greater than the max, a comment would be useful. Something about why it's necessary to clamp them to the max.

gianm · 2017-09-06T01:17:36Z

Does this help with the issue reported in #3972 (with regard to the median, rather than high percentiles)? It seems like max-clamping might not help with the low/mid quantiles like median.

ginoledesma · 2017-09-07T03:47:38Z

@gianm I've updated the PR to include a comment on why clamping to max needs to happen. It's mostly based on the BH/TT paper but I hope it's clear and makes sense. The short version is that we don't expect any "points" (values) to exist to the left of the known minimum or the right of the known maximum.

The clamping to min was already handled as a special case (line 1571 - https://github.com/druid-io/druid/pull/4744/files#diff-de22260542704e07a950d869cd124aefL1571).

As for #3972, yes, it handles that situation. I've tried to replicate the original poster's situation, but I took a guess in filling in the blanks for the full ingestion spec. Here's what I have:

Given the time series query:

{
  "queryType": "timeseries",
  "dataSource": "test_quantile_fix",
  "granularity": "all",
  "intervals": [
    "2017-09-01T00:00:00+08:00/2017-10-01T00:00:00+08:00"
  ],
  "aggregations": [
    {
      "type": "approxHistogramFold",
      "name": "hist_fold",
      "fieldName": "hist_fold_api_duration",
      "resolution": 10000,
      "numBuckets": 100
    }
  ],
  "postAggregations": [
    {
      "type": "quantile",
      "name": "myResult",
      "fieldName": "hist_fold",
      "probability": 0.5
    }
  ],
  "context": {}
}

Druid 0.10.1 Result for q=0.5: 1111.9833
druid_quantiles_stock.json.txt

Druid 0.10.1 + patch Result for q=0.5: 190.86847
druid_quantiles_patched.json.txt

Reference (computed using SciPy for q=0.5): 190.0

You'll notice the breaks are completely different between the behavior without the clamped values for max, but I think that has more to do with the folding of multiple approximate histograms, which gets exacerbated when the original values weren't clamped to begin with.

ginoledesma · 2017-09-07T03:48:08Z

I'll update the PR with the CLA tomorrow — I did use some personal/company time so I'll check with the company to get that ironed out.

ginoledesma · 2017-09-07T16:43:30Z

Updated unit tests to be more thorough with computing a range of quantiles (q=[.10, ..., 0.90] in 0.10 increments) including the explicit test for "outliers" (q=0.05 and q=0.95).

ginoledesma · 2017-09-12T23:01:09Z

Not sure why the travis builds are failing, but it seems to be affecting the other PRs as well — particularly on those with the -Xmx512m maven opts set.

gianm · 2017-09-12T23:12:28Z

@ginoledesma We've been having some unreliability with one of the tests recently, sorry about that. I re-ran it and it may pass when run a second time.

The patch looks good to me, thanks for the additional tests as well. Have you had a chance to get the CLA sorted?

gianm · 2017-09-13T23:02:27Z

@ginoledesma If you merge master into your branch, it should help with the CI issues. We just did a commit there that tweaks some Travis settings and seems to be helping.

Fixes apache#3972

gianm

LGTM.

gianm · 2017-09-14T02:23:53Z

@ginoledesma Hmm, this change does seem better, but I'm wondering how it helps with the test you describe in #4744 (comment). The patch only affects the getQuantiles(float[]) method, which is only called in the post-aggregation step, and so it shouldn't affect the breaks or folding behavior at all (which happen before that, in the aggregation step).

gianm · 2017-09-21T08:03:31Z

@ginoledesma, any thoughts on the question from #4744 (comment)?

drcrallen · 2017-09-21T14:45:57Z

waiting for answers to @gianm 's questions before merging.

ginoledesma · 2017-09-21T15:08:05Z

I don’t have a solid answer, unfortunately. I’m trying to come up with a smaller dataset that replicates the approximation breaks do it can be replicated in the tests properly, but I haven’t found any good leads to explain it.

gianm · 2017-09-26T17:43:54Z

@ginoledesma thanks for the details. If you do find any new info please raise a new issue. In the meantime I'll merge this patch since it does look like an improvement. Thanks!

Fixes apache#3972

Fixes #3972

drcrallen added the WIP label Sep 3, 2017

ginoledesma force-pushed the histogram-quantile-bug branch from 486914d to 6dfaa74 Compare September 5, 2017 18:39

ginoledesma changed the title ~~[WIP] bug: getQuantiles() returns values that exceed max~~ bug: getQuantiles() returns values that exceed max Sep 5, 2017

ginoledesma force-pushed the histogram-quantile-bug branch 2 times, most recently from bc9b11c to 812c492 Compare September 5, 2017 18:58

ginoledesma changed the title ~~bug: getQuantiles() returns values that exceed max~~ [WIP] bug: getQuantiles() returns values that exceed max Sep 5, 2017

gianm reviewed Sep 6, 2017

View reviewed changes

gianm added this to the 0.11.0 milestone Sep 6, 2017

ginoledesma force-pushed the histogram-quantile-bug branch from 812c492 to 1db9b41 Compare September 7, 2017 02:35

ginoledesma force-pushed the histogram-quantile-bug branch from 1db9b41 to 420ae6a Compare September 7, 2017 16:41

ginoledesma force-pushed the histogram-quantile-bug branch 3 times, most recently from c1d1694 to 28de800 Compare September 11, 2017 19:08

ginoledesma changed the title ~~[WIP] bug: getQuantiles() returns values that exceed max~~ bug: getQuantiles() returns values that exceed max Sep 12, 2017

bug: getQuantiles() returns values that exceed max

773489f

Fixes apache#3972

ginoledesma force-pushed the histogram-quantile-bug branch from 28de800 to 773489f Compare September 13, 2017 23:09

gianm removed the WIP label Sep 14, 2017

gianm approved these changes Sep 14, 2017

View reviewed changes

jon-wei added the Bug label Sep 20, 2017

drcrallen approved these changes Sep 21, 2017

View reviewed changes

gianm merged commit e60bc0c into apache:master Sep 26, 2017

leventov mentioned this pull request Sep 26, 2017

Fix formatting in ApproximateHistogramTest #4853

Merged

gianm pushed a commit to gianm/druid that referenced this pull request Sep 26, 2017

bug: getQuantiles() returns values that exceed max (apache#4744)

88aaa74

Fixes apache#3972

gianm mentioned this pull request Sep 26, 2017

[Backport] bug: getQuantiles() returns values that exceed max #4854

Merged

fjy pushed a commit that referenced this pull request Sep 26, 2017

bug: getQuantiles() returns values that exceed max (#4744) (#4854)

40d269d

Fixes #3972

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: getQuantiles() returns values that exceed max #4744

bug: getQuantiles() returns values that exceed max #4744

ginoledesma commented Sep 1, 2017 •

edited

ginoledesma commented Sep 5, 2017 •

edited

gianm left a comment

gianm Sep 6, 2017

gianm Sep 6, 2017

gianm commented Sep 6, 2017

ginoledesma commented Sep 7, 2017

ginoledesma commented Sep 7, 2017

ginoledesma commented Sep 7, 2017

ginoledesma commented Sep 12, 2017

gianm commented Sep 12, 2017

gianm commented Sep 13, 2017

gianm left a comment

gianm commented Sep 14, 2017 •

edited

gianm commented Sep 21, 2017

drcrallen commented Sep 21, 2017

ginoledesma commented Sep 21, 2017

gianm commented Sep 26, 2017

bug: getQuantiles() returns values that exceed max #4744

bug: getQuantiles() returns values that exceed max #4744

Conversation

ginoledesma commented Sep 1, 2017 • edited

ginoledesma commented Sep 5, 2017 • edited

gianm left a comment

Choose a reason for hiding this comment

gianm Sep 6, 2017

Choose a reason for hiding this comment

gianm Sep 6, 2017

Choose a reason for hiding this comment

gianm commented Sep 6, 2017

ginoledesma commented Sep 7, 2017

ginoledesma commented Sep 7, 2017

ginoledesma commented Sep 7, 2017

ginoledesma commented Sep 12, 2017

gianm commented Sep 12, 2017

gianm commented Sep 13, 2017

gianm left a comment

Choose a reason for hiding this comment

gianm commented Sep 14, 2017 • edited

gianm commented Sep 21, 2017

drcrallen commented Sep 21, 2017

ginoledesma commented Sep 21, 2017

gianm commented Sep 26, 2017

ginoledesma commented Sep 1, 2017 •

edited

ginoledesma commented Sep 5, 2017 •

edited

gianm commented Sep 14, 2017 •

edited