Add single request circuit breaker. #46962

howardhuanghua · 2019-09-23T08:01:26Z

Currently we have request circuit breaker, but all the query requests would share this memory limit pool, then requests would be impacted with each other.

This PR introduced a single request circuit breaker which could limit memory usage of single request. A data analysis cluster may support many regular small search requests, however, if user triggers a query that consumes a lot of memory may cause other queries to fail. In this case, we could set this single request circuit breaker to prevent single search request that consumes a lot of memory and impact regular search requests.

By default, we set new setting indices.breaker.single_request.limit to 30% which is half of indices.breaker.request.limit.

elasticmachine · 2019-09-23T08:36:39Z

Pinging @elastic/es-core-infra

original-brownbear · 2019-09-23T09:10:35Z

@howardhuanghua thanks for this suggestion.

I have a question: this change does not really add a per-request circuit breaker, does it?
It seems that simply limits the maximum size of any single allocation (at least as seen by the circuit breaker) as far as I can see. I wonder what others think, but I'm not sure adding that kind of limit will have much practical impact since we rarely do these kinds of single, massive allocations and per-request cost is mostly just a sum of various allocations that would have to be tracked for the lifetime of a request.

@howardhuanghua what kind of massively expensive request is this kind of allocation limit catching for you in practice?

…o single_request_breaker

howardhuanghua · 2019-10-10T08:41:42Z

Hi @original-brownbear, sorry for the delay. Yes you are right, this PR is only considered a small piece of memory in search request. The problem we have encountered is mainly about coordinate node breaker, and I also opened #46751 and #47806. Sometimes a single big range aggregation query request consumes lots of memory on coordinate node and may impact other regular requests. So we try to add limit for single request.

It’s hard to collect all the pieces of memory usage over a single search request. But I think we could collect major memory usage for single request. I have some ideas for single request on coordinate node:

Collect all the shard results response in InboundHandler.java#handleResponse for a single search request before deserialization as described in Coordinate node memory checking during accumulating shard result response #47806.
Estimate response object size based on received message length for each shard result.
Add all estimated response object size together in request level (each single search request has a unique AbstractSearchAsyncAction object), and compare with single request limit setting for checking single request breaker.

Alternatively, if we have already pre-calculated each shard result size in InboundHandler.java#handleResponse, we could also calculate all the shard results size for current search request during result consuming:

elasticsearch/server/src/main/java/org/elasticsearch/action/search/ArraySearchPhaseResults.java

Lines 42 to 45 in a8a7477

    
           void consumeResult(Result result) { 
        
               assert results.get(result.getShardIndex()) == null : "shardIndex: " + result.getShardIndex() + " is already set"; 
        
               results.set(result.getShardIndex(), result); 
        
           }

In this case, since all the received shard result responses have been deserialized, so this calculated size is the real memory usage rather than pre-check.

rjernst · 2021-02-17T19:56:41Z

After reviewing this PR, we do not believe this change is worth the added complexity. This change will not prevent many if any conceivable out of memory situations that would not be covered by other circuit breakers already with a setting value of 30%. Also, as pointed out above, there is not necessarily a correlation between message size and the amount of memory a request will actually consume which limits the use-cases for this approach.

For now we decided to only pursue limiting the request size like that on the REST layer (see #67804) which will also filter out large transport messages from indexing on the transport layer. Since we do not plan to move forward with the approach in this PR, I hope you don't mind I close it.

howardhuanghua added 2 commits September 23, 2019 12:05

add single request breaker

2a915d6

update single request breaker default value

a3658ec

original-brownbear added the :Core/Infra/Circuit Breakers Track estimates of memory consumption to prevent overload label Sep 23, 2019

rjernst added the feedback_needed label Sep 30, 2019

Merge branch 'master' of https://github.com/elastic/elasticsearch int…

37cdaf0

…o single_request_breaker

$@polyfractal$ polyfractal requested a review from jpountz October 16, 2019 20:28

rjernst added the Team:Core/Infra Meta label for core/infra team label May 4, 2020

rjernst closed this Feb 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add single request circuit breaker. #46962

Add single request circuit breaker. #46962

howardhuanghua commented Sep 23, 2019 •

edited

Loading

elasticmachine commented Sep 23, 2019

original-brownbear commented Sep 23, 2019

howardhuanghua commented Oct 10, 2019

rjernst commented Feb 17, 2021

Add single request circuit breaker. #46962

Add single request circuit breaker. #46962

Conversation

howardhuanghua commented Sep 23, 2019 • edited Loading

elasticmachine commented Sep 23, 2019

original-brownbear commented Sep 23, 2019

howardhuanghua commented Oct 10, 2019

rjernst commented Feb 17, 2021

howardhuanghua commented Sep 23, 2019 •

edited

Loading