Deprecate `search.max_buckets`? #51731

jpountz · 2020-01-31T09:35:12Z

PR #46751 introduces circuit breaking for the reduce phase and raises the question of whether we can get rid of search.max_buckets, which served a similar purpose by enforcing a limit on the number of buckets rather than their memory usage.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2020-01-31T09:35:15Z

Pinging @elastic/es-analytics-geo (:Analytics/Aggregations)

jimczi · 2020-01-31T11:56:24Z

I agree that #46751 should change the perception of the search.max_buckets setting but I wonder if we should rather increase the default limit or apply this setting only during the final reduction. IMO this setting is a good protection for users because it forces to think of solutions that don't require to return thousands of buckets even if the size doesn't exceed the memory. This is similar in spirit to the max window size we have for top hits that redirects users to scrolls or search_after if they want to deeply paginate.

polyfractal · 2020-02-03T15:46:29Z

Been thinking about this a bit. I agree that the setting is still helpful from a "soft limit" standpoint, in that it helps prevent abusive aggs even if they would technically execute without breaking anything. It helps prevent users from adding size: MAX_INT to all their terms aggs by default because they just want all the results, and instead reason about the best way to accomplish the goal (increase the limit? Different agg structure? Use composite? etc)

It also offers a convenient place to actually limit agg response size, so that clients aren't expected to handle unreasonably large responses.

Now that the memory-safety implications are max_buckets are diminished, I wonder if we need to change the semantics a bit though. There are a few problem areas with how it works today:

Even if the soft-limit is increased, it can be confusing to users when the setting trips despite the final bucket count being under the limit (Deprecate search.max_buckets? #51731). If a user sets a terms agg to size: 1000 they may be confused when the threshold trips because 20 shards report back 1000*shard_size terms each and we abort during an incremental reduction
Some aggs like rare_terms generate more buckets on the shard and prune those during reductions, so the final bucket count might be very small but fail due to the soft-limit. I imagine future clustering aggs will be similar.
Some operations implicitly generate many, many buckets. Geospatial bucketing (tile grid, etc), x/y scatter charts with histograms, 3d heatmaps, etc necessarily needs a lot of buckets, and even a high soft-limit could be problematic. We could potentially solve these with specialized aggs (scatter_chart) that operate more like metrics than buckets, but the issue is still generally a problem.

I'm not sure what to do about these problem cases, or if we should do anything. I guess I see a few options:

Deprecate search.max_buckets. Problem solved :) Although we do lose a potentially useful soft limit
Raise max_buckets to a large threshold, makes many of the common issues disappear. Anyone hitting the previously mentioned corner cases still have to work around them or increase limit
Allow max_buckets to be configured on a per-request basis, so a user can opt out of the limit for something like rare_terms or geospatial bucketing. We didn't want this before because the setting was tied to memory implications, but perhaps we can loosen that restriction now?
Only count buckets at the end of final reduction and rely on the recent breaker changes to catch memory issues. This would help with cases like terms and rare_terms avoid false-positives, but would not help the geospatial case.

Or some combination of the above :)

polyfractal · 2020-02-21T15:05:41Z

Ruminated on this a bit more, leaning towards the dual approach of:

Increase the limit to something larger, 50,000?
Only count buckets during final reduction, after all buckets have been merged. This means it is solely used to limit response size, and avoids all the confusing side-effects (like tripping because terms might have k buckets that must be merged before you can get to the requested n buckets)

jpountz · 2020-02-21T15:23:27Z

This sounds like a plan to me.

jasontedor · 2020-02-22T15:33:31Z

It seems that we have a plan here, is it okay to remove the discuss label or do we think that broader input would be helpful?

jpountz · 2020-02-22T16:31:14Z

I think it is, I just removed it.

nickpeihl · 2020-02-24T22:27:50Z

If we do set a new limit for search.max_buckets, I think it would be useful for the Elastic Maps application to set it to at least 65,536. We have a POC in Kibana for constructing vector tiles from documents and geo_tile grid aggregations. The size of the vector tiles we generate is a multiple of 64 pixels by 64 pixels and usually 256x256. A sufficiently large geo_tile grid precision could generate a bucket for each pixel in the 256x256 tile. So if we are setting search.max_buckets to an arbitrary number, perhaps we can consider at least 65,536?

@thomasneirynck

thomasneirynck · 2020-02-25T13:49:14Z

+1 on @nickpeihl's suggestion. If search.max_buckets is an arbitrary limit, a limit which is a square of a 2 ^ n makes a lot of sense in the context of mapping and tiling (especially wrt the use of geotile_grid.

polyfractal · 2020-02-25T14:26:00Z

👍 seems reasonable to me.

Related note, I'll bring this up in our team meeting because we probably want to talk through if we can increase the limit in a 7.x minor, or if that would count as a "break". E.g. if a client is using that to limit responses sizes, it could potentially break clients by now receiving a very large response, which they weren't expecting.

If we decide it's a breaking change, a potential plan could be:

Increase the limit to a large power of 2 in 8.0, leave at 10k for 7.x
Only count buckets during final reduction
Add a per-request setting that allows clients to override in 7.x.

thomasneirynck · 2020-02-26T17:13:06Z

wrt (3)

Being able to override the limit on a per-request setting would be useful for Maps in the 7.x time-frame.

There are two ongoing efforts on the Maps-side in 7.x that really could use this:

adding top-terms as a sub-agg ([Maps] add Top term aggregation kibana#57875): to support this in Maps, Maps pages through results of a composite-agg to run terms sub-aggs under either a geotile_grid or terms agg. That way, requests are not hitting the limit. This PR merged for 7.7 because we felt the functionality was valuable to introduce. The absence of top-terms was also blocking another 7.7 feature ([Maps] Blended layer that switches between documents and clusters kibana#57879). Being able to override the max-bucket setting will allow us to remove this duplicate code-path. Removing the use of the composite-agg code-path altogether will really simplify Maps, since they are not using for any other metric (sum, count, avg, ... calculations)).
vector tiling ([Maps] [Meta] Add .mvt vector tile support kibana#58519): we're hoping to support this sometime in the 7.x timeframe. We are looking for an ability to support fine gridding on a per-tile basis. After some experimentation ([Maps] Blended layer that switches between documents and clusters kibana#57879), we would likely end up somewhere between 64x64 and 256x256 geotile_grid cells per tile (hence Deprecate search.max_buckets? #51731 (comment)).

cc @alexfrancoeur @nreese @nickpeihl

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes elastic#51731

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes #51731

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes elastic#51731

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes #51731

gleventhal · 2022-05-11T17:46:14Z

How does someone (using e.g: v. 6.6.2) prevent unreasonably large aggregations from hobbling the cluster without returning misleading results to clients (due to the limits on buckets)?

jpountz added discuss :Analytics/Aggregations Aggregations labels Jan 31, 2020

jpountz mentioned this issue Jan 31, 2020

Bucket aggregation circuit breaker optimization. #46751

Merged

nreese mentioned this issue Feb 21, 2020

[Maps] add Top term aggregation elastic/kibana#57875

Merged

jpountz removed the discuss label Feb 22, 2020

This was referenced Mar 13, 2020

Bucket Aggregation size setting should never throw too_many_buckets_exception if size is less than respect search.max_buckets #51559

Closed

[Monitoring] Calling aggs queries with max_bucket_size can throw too_many_buckets_exception elastic/kibana#59983

Open

Augustin-FL mentioned this issue Apr 18, 2020

ElasticSearch 7.x too_many_buckets_exception grafana/grafana#17327

Closed

$@polyfractal$ polyfractal mentioned this issue Apr 29, 2020

Scatterplot aggregation #55943

Open

$@polyfractal$ polyfractal assigned andyb-elastic Apr 29, 2020

rjernst added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label May 4, 2020

$@polyfractal$ polyfractal assigned imotov and unassigned andyb-elastic May 19, 2020

imotov added a commit to imotov/elasticsearch that referenced this issue May 21, 2020

Increase search.max_buckets to 65,535

5ab18c6

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes elastic#51731

imotov mentioned this issue May 21, 2020

Increase search.max_buckets to 65,535 #57042

Merged

imotov closed this as completed in #57042 Jun 3, 2020

imotov added a commit that referenced this issue Jun 3, 2020

Increase search.max_buckets to 65,535 (#57042)

29b5643

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes #51731

imotov added a commit to imotov/elasticsearch that referenced this issue Jun 3, 2020

Increase search.max_buckets to 65,535 (elastic#57042)

aba5d17

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes elastic#51731

imotov mentioned this issue Jun 3, 2020

[7.x] Increase search.max_buckets to 65,535 (#57042) #57621

Merged

imotov added a commit that referenced this issue Jun 3, 2020

Increase search.max_buckets to 65,535 (#57042)

8d7f389

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes #51731

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecate `search.max_buckets`? #51731

Deprecate `search.max_buckets`? #51731

jpountz commented Jan 31, 2020

elasticmachine commented Jan 31, 2020

jimczi commented Jan 31, 2020

polyfractal commented Feb 3, 2020

polyfractal commented Feb 21, 2020

jpountz commented Feb 21, 2020

jasontedor commented Feb 22, 2020

jpountz commented Feb 22, 2020

nickpeihl commented Feb 24, 2020

thomasneirynck commented Feb 25, 2020

polyfractal commented Feb 25, 2020

thomasneirynck commented Feb 26, 2020

gleventhal commented May 11, 2022

Deprecate search.max_buckets? #51731

Deprecate search.max_buckets? #51731

Comments

jpountz commented Jan 31, 2020

elasticmachine commented Jan 31, 2020

jimczi commented Jan 31, 2020

polyfractal commented Feb 3, 2020

polyfractal commented Feb 21, 2020

jpountz commented Feb 21, 2020

jasontedor commented Feb 22, 2020

jpountz commented Feb 22, 2020

nickpeihl commented Feb 24, 2020

thomasneirynck commented Feb 25, 2020

polyfractal commented Feb 25, 2020

thomasneirynck commented Feb 26, 2020

gleventhal commented May 11, 2022

Deprecate `search.max_buckets`? #51731

Deprecate `search.max_buckets`? #51731