Search Responses to Many Shards use Excessive Amounts of Memory for OriginalIndices instances 

Running a simple/fast search on a large number of indices via an index pattern can result in significant memory usage on the coordinating node for the `org.elasticsearch.action.OriginalIndices` instances in the search responses.

When using a simple pattern of e.g. `auditbeat-${long}` and searching over 1k indices, this leads to hundreds of MB (and more during heavy load) of duplicate instances on heap that are referenced as such:

<img width="802" alt="image" src="https://user-images.githubusercontent.com/6490959/134899892-43c5e197-e833-4680-8278-53db17affd26.png">

I think it should be possible to deduplicate these (or even the full search requests referenced in the responses?).
This somewhat relates to #78164 but doesn't just apply to can_match.
The problem with these instances is that they are not accounted for all that well by the circuit breaker in the response collector and it's trivial to OOM a coordinating node if a search targets a large number of indices at once when the responses come back rapidly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Search Responses to Many Shards use Excessive Amounts of Memory for OriginalIndices instances #78314

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Search Responses to Many Shards use Excessive Amounts of Memory for OriginalIndices instances #78314

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions