Change compound index to improve query performance #5568

khushboobhatia01 · 2022-02-08T05:44:01Z

We've seen increase in mongoDB memory usage whenever we try to fetch executions with filters.

Query:
db.action_execution_d_b.find({"action.ref": "sre.fleet_execution", "start_timestamp": {"$gt" : NumberLong("1638316000000000"), "$lt": NumberLong("1644192000000000")},"status" : "succeeded"}, {"context" : 1,"parameters" : 1,"action" : 1,"start_timestamp" : 1,"status" : 1,"runner.runner_parameters" : 1,"_id" : 1,"end_timestamp" : 1}).sort({"start_timestamp" : -1,"action.ref" : 1})
With above query it's expected that the compound index start_timestamp_-1_action.ref_1_status_1 should be used, but the query plan shows that action.ref_1 index is being used.
old-query-plan.txt
Execution stats from the old query plan

"executionSuccess" : true,
"nReturned" : 472,
"executionTimeMillis" : 51,
"totalKeysExamined" : 5074,
"totalDocsExamined" : 5074,

totalDocsExamined is how many documents were examined and we want this number to be low. Even more importantly, we want to look at the ratio of TotalDocsExamined and nReturned. These numbers together helps determine “how much work is MongoDB doing to return me useful data?”. We can see here that our “hit ratio” is 472 / 5074 or ~9.3%. If our cluster is examining a high number of docs with respect to those that it is returning, we're likely to see a few things happen:

longer query times overall
more utilisation of clusters CPU & memory resources
choppier cache residency and eviction
locked & blocking queries under load

Why didn't mongoDB use compound index? From the query plan we can see that the query which utilises compound index was rejected because it was very slow and didn't return any docs by the time the winning query finished.
MongoDB recommends to follow ESR rule when creating a compound index (Ref https://www.mongodb.com/blog/post/performance-best-practices-indexing)
For compound indexes, this rule of thumb is helpful in deciding the order of fields in the index:
1. First, add those fields against which Equality queries are run.
2. The next fields to be indexed should reflect the Sort order of the query.
3. The last fields represent the Range of data to be accessed.
Given the above rule a much more efficient compound index will be {"action.ref": 1,"status": 1, "start_timestamp": -1}. We want the first field to have the high cardinality (prefer action.ref over status) and start_timestamp is mostly used to access range of data.
After creating the above index and making the same query, we see that new compound index is being used and document hit ratio is 100% now also the query execution time is ↓ by 50%.
new-query-plan.txt

"executionSuccess" : true,
"nReturned" : 472,
"executionTimeMillis" : 25,
"totalKeysExamined" : 472,
"totalDocsExamined" : 472,

This issue might be prominent where no. of executions are less or execution documents are small because not a lot of data is to be fetched.

cognifloyd · 2022-02-08T20:16:40Z

Can you add a changelog entry for this as well?

cognifloyd · 2022-02-09T03:55:55Z

Will this need any kind of migration when upgrading between ST2 versions? (eg to recreate the index?)

khushboobhatia01 · 2022-02-09T14:00:06Z

Will this need any kind of migration when upgrading between ST2 versions? (eg to recreate the index?)

No, that won't be needed.

khushboobhatia01 added 2 commits February 8, 2022 10:46

Change compound index to improve query performance

6af737e

Add Changelog

0114670

pull-request-size bot added the size/XS PR that changes 0-9 lines. Quick fix/merge. label Feb 8, 2022

Add CHANGELOG

94fd3a3

pull-request-size bot added size/S PR that changes 10-29 lines. Very easy to review. and removed size/XS PR that changes 0-9 lines. Quick fix/merge. labels Feb 9, 2022

cognifloyd approved these changes Feb 9, 2022

View reviewed changes

cognifloyd added this to the 3.7.0 milestone Feb 9, 2022

cognifloyd merged commit 4aac99b into StackStorm:master Feb 9, 2022

arm4b mentioned this pull request Apr 4, 2022

Promote Khushboo Bhatia (@khushboobhatia01) from Contributor to TSC Maintainer #5613

Merged

arm4b mentioned this pull request Nov 23, 2022

Mongodb slow query #5805

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change compound index to improve query performance #5568

Change compound index to improve query performance #5568

khushboobhatia01 commented Feb 8, 2022

cognifloyd commented Feb 8, 2022

cognifloyd commented Feb 9, 2022

khushboobhatia01 commented Feb 9, 2022

Change compound index to improve query performance #5568

Change compound index to improve query performance #5568

Conversation

khushboobhatia01 commented Feb 8, 2022

cognifloyd commented Feb 8, 2022

cognifloyd commented Feb 9, 2022

khushboobhatia01 commented Feb 9, 2022