Allow adjustment of index resource constraints in ILM phase transitions #44070

DaveCTurner · 2019-07-08T13:07:03Z

The shards allocator mostly does a good job of spreading the shards of each index out across the available nodes, but there are some corner cases where it might temporarily concentrate too many hot shards on a single node, leading to poor performance and possible resource exhaustion (#17213). It is occasionally useful to add the index.routing.allocation.total_shards_per_node constraint when indices are under heavy load, to prevent too many hot shards from being allocated to the same node. We are contemplating generalising this to a more nuanced set of per-index resource constraints too (#17213 (comment)).

If an index is managed by ILM then it is likely its resource requirements will change along with its lifecycle. In particular when the index leaves the hot phase it will no longer see such heavy indexing, and so any total_shards_per_node constraint is likely no longer necessary. It's important to keep the use of this sort of setting to a minimum to avoid over-constraining the allocator since this can lead to unassigned shards (#12273). I therefore think it'd be a good idea to allow this kind of setting to be changed in the appropriate ILM phase transitions.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2019-07-08T13:07:04Z

Pinging @elastic/es-core-features

elasticmachine · 2019-07-08T13:07:06Z

Pinging @elastic/es-distributed

DaveCTurner · 2019-07-17T13:18:50Z

I raised this question with the distrib team today and although we're not super-keen on recommending the widespread use of the total_shards_per_node constraint, we think it's a good idea to allow it to be relaxed on ILM phase transitions, particularly the hot-to-warm one.

vigyasharma · 2019-07-17T13:22:11Z

Are per-index resource constraints being planned as prescriptive - as in allocator makes a best effort to comply by constraints but allocates the shards anyway if no node is able to fulfill its requirements, or will it be a hard limit, and leave shards unallocated if not fulfilled?

Resource requirements are often quite dynamic. Even in the hot stage, and index may have off peak hours when the reserved resources could've been shared by other indices.

Requirement prediction itself is hard with users almost always under/over provisioning in the first pass. It needs iterative fine tuning to get the limits right. I would vote for a prescriptive best case effort to allow for such flexibility.

Separately, +1 on revising constraints through ILM transitions.

dakrone · 2019-07-18T16:27:22Z

We discussed this and agreed on putting an option for setting index.routing.allocation.total_shards_per_node into the allocate action in ILM.

avneesh91 · 2019-08-05T20:19:53Z

@DaveCTurner would it be ok if I started working on this?

dakrone · 2019-08-05T20:37:29Z

@avneesh91 sure if you'd like to contribute a PR for this that would be great.

shoaib4330 · 2019-10-06T15:06:25Z

@avneesh91 I'd like to pick this issue, if you're not actively working on it?

epicvinny · 2020-11-19T22:38:12Z

It is impracticable to use ILM at scale :(

Aloshi · 2020-12-09T23:45:51Z

Another +1 to this issue. We've been having problems when we scale up our larger clusters by 2-3 nodes where ES decides to allocate all of the shards for the next index on the new nodes, which causes abysmal performance and slows down ingestion. We've been guarding against it by setting index.routing.allocation.total_shards_per_node to 1 or 2 on our big indices until the new nodes are mostly full, and we'd like to just leave this set all the time, but when these indices are moved to warm nodes there aren't enough warm nodes to satisfy the constraint (since having fewer warm nodes than hot nodes is kind of the point).

Is there any update on this? It sounds like the proposed solution would work great for us.

ppf2 · 2020-12-20T21:28:53Z

+1 Shard imbalance necessitating the use of total_shards_per_node does not only apply to the heavy indexing use case, we have seen this affect other cluster operations such as force merge when a majority of shards of an index being force merged ended up concentrating on a single warm node.

JohnLyman · 2021-01-06T15:33:24Z

We discussed this and agreed on putting an option for setting index.routing.allocation.total_shards_per_node into the allocate action in ILM.

Resetting total_shards_per_node is also required when using the shrink action which necessitates that all primary shards be moved to the same node.

@dakrone - if this option is put in the allocate action, would that allow for the value to be reset prior to shrink? I'm not sure of the order of operations here (I don't think it's documented). In other words, does allocate come before shrink in the warm phase?

dakrone · 2021-01-06T16:29:28Z

if this option is put in the allocate action, would that allow for the value to be reset prior to shrink? I'm not sure of the order of operations here (I don't think it's documented). In other words, does allocate come before shrink in the warm phase?

allocate does come before shrink in the warm phase, though I think regardless we should endeavor to make shrink run regardless of the setting, unsetting the setting during the allocation, then resetting it if necessary.

JohnLyman · 2021-01-06T20:40:08Z

That seems reasonable @dakrone (and thanks for the quick reply). Should I open a new issue for your suggestion?

dakrone · 2021-01-06T20:58:02Z

@JohnLyman I think it can be subsumed into this issue for now, thanks for bringing it up!

JohnLyman · 2021-01-06T21:50:14Z

I was thinking for shrink in general, whether it's initiated by ILM or not.

dakrone · 2021-01-06T22:14:20Z

I think that is one of the main goals of #63519

#76134) This adds a new optional field to the allocate ILM action called "total_shards_per_node". If present, the value of this field is set as the value of "index.routing.allocation.total_shards_per_node" before the allocation takes place. Relates to #44070

… is too low (#76732) We added configuration to AllocateAction to set the total shards per node property on the index. This makes it possible that a user could set this to a value lower than the total number of shards in the index that is about to be shrunk, meaning that all of the shards could not be moved to a single node in the ShrinkAction. This commit unsets the total shards per node property so that we fall back to the default value (-1, unlimited) in the ShrinkAction to avoid this. Relates to #44070

elastic#76134) This adds a new optional field to the allocate ILM action called "total_shards_per_node". If present, the value of this field is set as the value of "index.routing.allocation.total_shards_per_node" before the allocation takes place. Relates to elastic#44070

… is too low (elastic#76732) We added configuration to AllocateAction to set the total shards per node property on the index. This makes it possible that a user could set this to a value lower than the total number of shards in the index that is about to be shrunk, meaning that all of the shards could not be moved to a single node in the ShrinkAction. This commit unsets the total shards per node property so that we fall back to the default value (-1, unlimited) in the ShrinkAction to avoid this. Relates to elastic#44070

… is too low (#76732) (#76780) This is a backport of #76732. We added configuration to AllocateAction to set the total shards per node property on the index. This makes it possible that a user could set this to a value lower than the total number of shards in the index that is about to be shrunk, meaning that all of the shards could not be moved to a single node in the ShrinkAction. This commit unsets the total shards per node property so that we fall back to the default value (-1, unlimited) in the ShrinkAction to avoid this. Relates to #44070

#76775) Allow for setting the total shards per node in the Allocate ILM action (#76134) This is a backport of #76134. It adds a new optional field to the allocate ILM action called "total_shards_per_node". If present, the value of this field is set as the value of "index.routing.allocation.total_shards_per_node" before the allocation takes place. Relates to #44070

Updating the version where the total_shards_per_node parameter is supported, after backporting the feature to 7.16. Relates to #76775 #44070

DaveCTurner added :Distributed/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) :Data Management/ILM+SLM Index and Snapshot lifecycle management team-discuss labels Jul 8, 2019

dakrone removed the team-discuss label Jul 18, 2019

DaveCTurner removed the :Distributed/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) label Jul 22, 2019

dakrone added help wanted adoptme good first issue low hanging fruit labels Jul 23, 2019

DaveCTurner mentioned this issue Aug 26, 2019

Default allocation of new shards create hotspots #45851

Closed

dakrone mentioned this issue Oct 7, 2019

ILM Enhancement - Adjust Shard allocation setting total_shards_per_node #47697

Closed

rjernst added the Team:Data Management Meta label for data/management team label May 4, 2020

ppf2 mentioned this issue Aug 19, 2020

Document total_shards_per_node as a recipe for hotspots #61306

Closed

DaveCTurner mentioned this issue May 5, 2021

Add validation of the total_shards_per_node to the beginning of the ILM shrink step #72725

Closed

pickypg mentioned this issue Jul 16, 2021

[ILM] Settings Template #75429

Open

masseyke mentioned this issue Aug 11, 2021

Allow for setting the total shards per node in the Allocate ILM action #76134

Merged

masseyke linked a pull request Aug 11, 2021 that will close this issue

Allow for setting the total shards per node in the Allocate ILM action #76134

Merged

masseyke closed this as completed in #76134 Aug 11, 2021

masseyke mentioned this issue Aug 19, 2021

Ensuring that the ShrinkAction does not hang if total shards per node is too low #76732

Merged

masseyke linked a pull request Aug 19, 2021 that will close this issue

Ensuring that the ShrinkAction does not hang if total shards per node is too low #76732

Merged

masseyke mentioned this issue Aug 20, 2021

Allow for setting the total shards per node in the Allocate ILM action #76775

Merged

masseyke mentioned this issue Aug 20, 2021

Ensuring that the ShrinkAction does not hang if total shards per node is too low #76780

Merged

This was linked to pull requests Aug 20, 2021

Allow for setting the total shards per node in the Allocate ILM action #76775

Merged

Ensuring that the ShrinkAction does not hang if total shards per node is too low #76780

Merged

masseyke linked a pull request Aug 20, 2021 that will close this issue

Allow for setting the total shards per node in the Allocate ILM action #76775 #76794

Merged

masseyke added a commit that referenced this issue Aug 23, 2021

Updating supported version after backporting the feature (#76794)

83ca8b9

Updating the version where the total_shards_per_node parameter is supported, after backporting the feature to 7.16. Relates to #76775 #44070

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow adjustment of index resource constraints in ILM phase transitions #44070

Allow adjustment of index resource constraints in ILM phase transitions #44070

DaveCTurner commented Jul 8, 2019

elasticmachine commented Jul 8, 2019

elasticmachine commented Jul 8, 2019

DaveCTurner commented Jul 17, 2019

vigyasharma commented Jul 17, 2019

dakrone commented Jul 18, 2019

avneesh91 commented Aug 5, 2019

dakrone commented Aug 5, 2019

shoaib4330 commented Oct 6, 2019

epicvinny commented Nov 19, 2020

Aloshi commented Dec 9, 2020 •

edited

ppf2 commented Dec 20, 2020

JohnLyman commented Jan 6, 2021

dakrone commented Jan 6, 2021

JohnLyman commented Jan 6, 2021

dakrone commented Jan 6, 2021

JohnLyman commented Jan 6, 2021

dakrone commented Jan 6, 2021

Allow adjustment of index resource constraints in ILM phase transitions #44070

Allow adjustment of index resource constraints in ILM phase transitions #44070

Comments

DaveCTurner commented Jul 8, 2019

elasticmachine commented Jul 8, 2019

elasticmachine commented Jul 8, 2019

DaveCTurner commented Jul 17, 2019

vigyasharma commented Jul 17, 2019

dakrone commented Jul 18, 2019

avneesh91 commented Aug 5, 2019

dakrone commented Aug 5, 2019

shoaib4330 commented Oct 6, 2019

epicvinny commented Nov 19, 2020

Aloshi commented Dec 9, 2020 • edited

ppf2 commented Dec 20, 2020

JohnLyman commented Jan 6, 2021

dakrone commented Jan 6, 2021

JohnLyman commented Jan 6, 2021

dakrone commented Jan 6, 2021

JohnLyman commented Jan 6, 2021

dakrone commented Jan 6, 2021

Aloshi commented Dec 9, 2020 •

edited