Druid Broker Result Level Cache Not Working Properly When Different Query Intervals Cover the Same Set of Segments #7302

lxqfy · 2019-03-20T03:37:24Z

Druid Broker Result Level Cache Not Working Properly When Different Query Intervals Cover the Same Set of Segments.

Affected Version

0.13.0-incubating

Description

We have a segment with Segment Granularity of DAY and Query Granularity of HOUR.
When we send the same queries with only a different query interval, the result level cache will return the wrong result.

Example: QueryA and QueryB has same query body.
QueryA Interval: 2019-03-15T14:00:00/ 2019-03-16T14:00:00
QueryB Interval: 2019-03-15T18:00:00/ 2019-03-16T18:00:00

Involved Segments:
QueryA: segment2019-03-15_2019-03-16
QueryB: segment2019-03-15_2019-03-16

Response:
QueryA gets the correct result.
QueryB gets the result of QueryA.

Some Investigations:
Result level cache will use the query without interval as key. When the key match, the E-Tag is used to check if the cached result should be returned. However, E-Tag is only calculated with the Identifiers of involved segments.

gianm · 2019-03-20T16:16:33Z

I looked into this a bit and believe the ETag calculation needs to be re-thought. It has the bug you mentioned - the one you mentioned, where it is calculated based on segment identifiers (which include the entire segment interval) and does not include a sub-interval within a segment. This causes the behavior you are seeing, and could be fixed by mixing in the SegmentDescriptor interval to the ETag.

But a potential larger bug is that it is leveraging QueryToolChest.computeCacheKey, which is designed for segment-level caching and, importantly, only includes query parameters that might affect the segment-level results. This is done to promote higher cache hit rates. For example, it doesn't include things like:

grandTotal or postAggregations for timeseries
any non-sorting postAggregations for topN
having, subtotalsSpec, postAggregations, or limitSpec for groupBy (although the lack of limitSpec is a bug, I think, since groupBy supports limit push down in some cases now)

I think this means we need a new method in the QueryToolChest for computing result-level cache keys.

gianm · 2019-03-20T16:20:06Z

the lack of limitSpec is a bug, I think, since groupBy supports limit push down in some cases now

Ah, maybe it's not. I don't remember if limit push down goes all the way to the per-segment results, or if it only affects the merge buffer. If it's the latter, it doesn't need to be in the per-segment cache key.

surekhasaharan · 2019-03-20T17:08:47Z

@lxqfy thanks for reporting, I'm looking into it.

jihoonson · 2019-05-03T20:28:56Z

Fixed in #7325.

gianm added Bug Area - Cache labels Mar 20, 2019

surekhasaharan mentioned this issue Mar 22, 2019

Fix result-level cache for queries #7325

Merged

gianm added this to the 0.15.0 milestone Apr 18, 2019

jihoonson closed this as completed May 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Druid Broker Result Level Cache Not Working Properly When Different Query Intervals Cover the Same Set of Segments #7302

Druid Broker Result Level Cache Not Working Properly When Different Query Intervals Cover the Same Set of Segments #7302

lxqfy commented Mar 20, 2019

gianm commented Mar 20, 2019 •

edited

gianm commented Mar 20, 2019

surekhasaharan commented Mar 20, 2019

jihoonson commented May 3, 2019

Druid Broker Result Level Cache Not Working Properly When Different Query Intervals Cover the Same Set of Segments #7302

Druid Broker Result Level Cache Not Working Properly When Different Query Intervals Cover the Same Set of Segments #7302

Comments

lxqfy commented Mar 20, 2019

Affected Version

Description

gianm commented Mar 20, 2019 • edited

gianm commented Mar 20, 2019

surekhasaharan commented Mar 20, 2019

jihoonson commented May 3, 2019

gianm commented Mar 20, 2019 •

edited