Optimize request fetching for filters and filter aggs #136796

flash1293 · 2022-07-21T06:47:28Z

Filters specified in the filter or filters agg use a linear scan through all documents matching the top level query of the request - depending on how many documents are matching this top level query, this can be dramatically slower than sending separate requests to fetch those parts individually.

A common example, tested with ~3M documents:
Date histogram of a ratio of sum of some field (fetching the sum agg twice, with filter and without filter):

Date histogram without filter set on query level: 12s
Date histogram with filter set in top level query: depends on how many documents are matched by the query - can span from milliseconds to ~10s
Date histogram with two sum aggs, using a nested filter agg (no matter how much documents are matched by the filter): 21s

If the filter hits a lot of data, doing it in a single request is just as fast or slightly faster than doing two separate requests - however, if the filter is hitting little data, it can be much faster. The same would apply if the ratio is done with two separate non-overlapping filters hitting a small amount of documents each.

This problem gets worse if lots of different filter aggs are used in the same query - effectively every filtered metric is as expensive as a whole separate search requiring a linear scan (like doing a metric agg like sum or a date histogram without a top level query). It's affecting the filters agg and multiple filter aggs in the same way (the only performant way is to narrow down documents in the top level query).

In the best case doing a single request is roughly as fast as doing multiple requests (minus some static overhead per request which doesn’t matter too much for requests hitting large amounts of data), in the worst case it’s orders of magnitudes slower because the top level query is hitting optimizations that are not available to in-agg filters, making certain requests feasible in the first place (like just getting the top level count instead of doing aggs requiring a scan).

Due to Elasticsearch parallelizing a lot of work across multiple shards, on a healthy cluster this is often not resulting in widely longer response times for the inefficient query itself, but it unnecessarily increases the load on the cluster which can turn into problems for busy clusters (queued searches, CPU throttling).

Possible optimizations

On Lens level

in the to_expression function of the datasource, if filtered metrics or the filter agg is used, create separate esaggs calls, adding the same filter to the top level query - each of those has all the bucket aggs plus the metrics with the same filter - simplification for a POC: have one request per metric
Pass disabled filters agg so the datatable meta data is set up right
Define a merge_tables expression function which is merging all rows along the bucket columns (for filters agg there should be an argument for the filters column Id and the label to set it to)

Upside: leverage all top level query optimizations with dramatic performance gains in some special cases, with negligible to no impact in query runtime in other cases
Downside: Map and tabify the bucket structure multiple times and also merge it later on which is more taxing on the client

On AggConfig level

The same thing as above, but on aggconfig level, not operating on the table but merging responses similar to how timeshift is implemented before tabification.

Another option not leveraging as much potential but theoretically being easier to implement is to collect all filters for filters and filter aggs from the agg tree and adding all of them as OR clauses to the top level query. This will limit the amount of documents a linear scan has to be performed on in some cases without catching all possible optimizations - e.g. in the example above with a ratio of a filtered metric vs an unfiltered metric it wouldn't be possible to prevent a another linear scan for the filtered metric.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2022-07-21T06:47:30Z

Pinging @elastic/kibana-app-services (Team:AppServicesSv)

elasticmachine · 2022-07-21T06:47:30Z

Pinging @elastic/kibana-vis-editors @elastic/kibana-vis-editors-external (Team:VisEditors)

nik9000 · 2022-07-27T20:14:34Z

Relates to elastic/elasticsearch#88660

elasticmachine · 2024-01-15T14:25:59Z

Pinging @elastic/kibana-data-discovery (Team:DataDiscovery)

lukasolson · 2024-03-18T16:54:49Z

Thanks for linking to that issue @nik9000... Would this sort of optimization make more sense at the ES level rather than Kibana?

drewdaemon · 2024-03-18T18:09:25Z

Side-note: we made related optimizations to aggs-building in Lens in #126941 and #135265. But those did not include merging data from multiple ES requests.

nik9000 · 2024-03-19T12:08:32Z

Hey! Since I filed that issue in ES we've mostly shifted work from aggs to ESQL - so if you desperately need aggs optimizations that's going to be harder to get to. But ESQL is picking up many of the optimizations from aggs as we go anyway.

Back to aggs - the issue that I linked to talks about optimizing the filters agg to execute quickly when it isn't possible to merge the top level query with the filters query - instead we'd walk the top level query and land the matches from the filters linearly. We never implemented this because, in general, you are better off writing other sorts of aggs - a date_histogram or terms or something. filters was never a fast agg, except in some very special cases. And this would just make it less slow. But if you are doing a sort of self-join like thing where you get the results of one agg and then rerun with a second set of aggs, maybe on a different time window or something, then you'll want filters - thus why I linked it.

ESQL plans to grow actual syntax for most of this. We're already talking about building blocks for it here: elastic/elasticsearch#106152 .

stratoula · 2024-03-20T07:06:58Z

Thanx Nik! ok it feels as something we should freeze for now as it is going to work better in ES|QL. There are plans to move to Lens to work with _query in the background but we want more feature parity to do so. I think is ok to wait till then though. cc @timductive

flash1293 added performance Team:Visualizations Visualization editors, elastic-charts and infrastructure Team:AppServicesSv Feature:Lens labels Jul 21, 2022

kibanamachine added this to Long-term goals in Lens Jul 21, 2022

ghudgins mentioned this issue Jul 25, 2022

[Meta][Lens] Performance #112931

Closed

exalate-issue-sync bot added the impact:medium Addressing this issue will have a medium level of impact on the quality/strength of our product. label Aug 17, 2022

vadimkibana added Team:DataDiscovery Discover App Team (Document Explorer, Saved Search, Surrounding documents, Graph) and removed Team:AppServicesSv labels Jan 15, 2024

kibanamachine added this to Inbox in Discover Jan 15, 2024

kertal assigned lukasolson Jan 18, 2024

lukasolson added impact:low Addressing this issue will have a low level of impact on the quality/strength of our product. loe:medium Medium Level of Effort and removed impact:medium Addressing this issue will have a medium level of impact on the quality/strength of our product. labels Mar 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize request fetching for filters and filter aggs #136796

Optimize request fetching for filters and filter aggs #136796

flash1293 commented Jul 21, 2022 •

edited

Loading

elasticmachine commented Jul 21, 2022

elasticmachine commented Jul 21, 2022

nik9000 commented Jul 27, 2022

elasticmachine commented Jan 15, 2024

lukasolson commented Mar 18, 2024

drewdaemon commented Mar 18, 2024

nik9000 commented Mar 19, 2024

stratoula commented Mar 20, 2024 •

edited

Loading

Optimize request fetching for filters and filter aggs #136796

Optimize request fetching for filters and filter aggs #136796

Comments

flash1293 commented Jul 21, 2022 • edited Loading

Possible optimizations

On Lens level

On AggConfig level

elasticmachine commented Jul 21, 2022

elasticmachine commented Jul 21, 2022

nik9000 commented Jul 27, 2022

elasticmachine commented Jan 15, 2024

lukasolson commented Mar 18, 2024

drewdaemon commented Mar 18, 2024

nik9000 commented Mar 19, 2024

stratoula commented Mar 20, 2024 • edited Loading

flash1293 commented Jul 21, 2022 •

edited

Loading

stratoula commented Mar 20, 2024 •

edited

Loading