Query prioritisation support #1017

Bukhtawar · 2021-07-27T18:35:37Z

Problem

We lack a prioritisation mechanism for queries for instance

At shard level fetch phase could have a higher priority than query phase requests, so that requests complete faster with higher probabilities
Async search queries running across multiple clusters should have a lower priority than usual search requests
Resource intensive queries could similarly have a lower priority

AmiStrn · 2021-07-27T18:41:41Z

I was having a discussion about this literally just today. Thanks for this @Bukhtawar

getsaurabh02 · 2021-08-24T06:57:19Z

Throwing some additional thoughts and copying details from #1140 and closing it, since this was created first.

As part of #1042 we are planning to do resource mapping of queries, and selective rejection when in duress. We want to extend the solution to also allow query prioritisation, which provides mechanism to selectively execute queries, when there are multiple queries with different priorities contending for the same resources.

Not all queries in the workload are of equal importance to customer. Often performance of one request or set of queries might be more important than others. With query prioritisation, customers can define the relative importance of queries in a workload by setting a priority value. The priority is specified for a dynamic queue such as one of (CRITICAL, HIGHEST, HIGH, NORMAL, LOW, LOWEST).

Opensearch will use the priority when accepting queries for execution, and to determine the amount of resources to be allocated to a query. By default, queries run with their priority set to NORMAL. These priority of query is also used under duress, to selectively cancel resource guzzling queries, and recover the system.

jhinch-at-atlassian-com · 2023-11-03T22:21:32Z

I wanted to capture a practical use case for this feature in the company I work at. We have three main sources of load to OpenSearch:

Live user queries (essentially powering the search results of a search box) which have strict reliability, availability and latency SLOs. These queries need to be run immediately as there is an end user looking for search results. Queries of these type are observed with daily peaks under business hours and lows outside business hours. An outage of these use cases would have material customer impact
Background batch jobs which execute queries which can be slower to execute, can be retried and can be run at any point in the day.

Ideally if OpenSearch was under stress (particularly on specific data nodes), it would be able to first load shed the background jobs before it starts to load shed live user queries. This could be done outside of OpenSearch, but either the backpressure would be too crude, creating failing requests which would have executed just fine as the target data node/shard was not under stress but others were, or requires complex logic to track which shards and data node a request will map to and use external monitoring to determine if that shard/data node is under stress. The OpenSearch cluster itself is much better suited to be able to determine what shards are overloaded, provided it knows what requests are more or less important.

Bukhtawar added the enhancement Enhancement or improvement to existing feature or request label Jul 27, 2021

Bukhtawar mentioned this issue Jul 27, 2021

Search Memory Tracking - track memory used during a shard search #1009

Open

6 tasks

malpani mentioned this issue Aug 11, 2021

Part 1: Support for cancel_after_timeinterval parameter in search and msearch request #986

Merged

2 tasks

Bukhtawar mentioned this issue Aug 24, 2021

Query Prioritization #1140

Closed

getsaurabh02 mentioned this issue Aug 31, 2021

[Meta] BackPressure in the OpenSearch Query (Search) path #1042

Open

anasalkouz added distributed framework Indexing & Search labels Nov 17, 2021

Bukhtawar mentioned this issue Jul 26, 2023

[RFC] High Level Vision for Core Search in OpenSearch #8879

Open

anasalkouz removed the distributed framework label Sep 19, 2023

andrross mentioned this issue Feb 21, 2024

[RFC] Search Query Sandboxing: User Experience #12342

Open

andrross added Search Search query, autocomplete ...etc Search:Performance and removed Indexing & Search labels Feb 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query prioritisation support #1017

Query prioritisation support #1017

Bukhtawar commented Jul 27, 2021

AmiStrn commented Jul 27, 2021

getsaurabh02 commented Aug 24, 2021

jhinch-at-atlassian-com commented Nov 3, 2023

Query prioritisation support #1017

Query prioritisation support #1017

Comments

Bukhtawar commented Jul 27, 2021

Problem

AmiStrn commented Jul 27, 2021

getsaurabh02 commented Aug 24, 2021

jhinch-at-atlassian-com commented Nov 3, 2023