index.routing.allocation.initial_recovery._id split after shrink #43955

laf-rge · 2019-07-04T05:09:06Z

Elasticsearch version (bin/elasticsearch --version): 7.1.1

Plugins installed: []

JVM version (java -version):

OS version (uname -a if on a Unix-like system):

Description of the problem including expected versus actual behavior:

Steps to reproduce:

Create an index
Shrink an index
remove the node the shrink occurred on from the cluster
Wait for index to recover
attempt to split the index

Provide logs (if relevant):
Split fails on allocation. Decider says all primary shards must be on the dead node. Can not remove index.routing.allocation.initial_recovery._id setting from index.

The text was updated successfully, but these errors were encountered:

laf-rge · 2019-07-04T05:10:01Z

Similar to #31787

laf-rge · 2019-07-04T05:11:04Z

You might be asking yourself: why would you split a node after a shrink? Short answer: bad ILM policy.

elasticmachine · 2019-07-04T05:14:12Z

Pinging @elastic/es-distributed

If an index is the result of a shrink then it will have a value set for `index.routing.allocation.initial_recovery._id`. If this index is subsequently split then this value will be copied over, forcing the initial allocation of the split shards to occur on the node on which the shrink took place. Moreover if this node no longer exists then the split will fail. This commit suppresses the copying of this setting when splitting an index. Fixes elastic#43955

DaveCTurner · 2019-07-08T07:08:06Z

Thanks for the report @laf-rge. Splitting an index after a shrink is a legitimate thing to do so this does look wrong. I opened #44053 to address this.

If an index is the result of a shrink then it will have a value set for `index.routing.allocation.initial_recovery._id`. If this index is subsequently split then this value will be copied over, forcing the initial allocation of the split shards to occur on the node on which the shrink took place. Moreover if this node no longer exists then the split will fail. This commit suppresses the copying of this setting when splitting an index. Fixes elastic#43955

If an index is the result of a shrink then it will have a value set for `index.routing.allocation.initial_recovery._id`. If this index is subsequently split then this value will be copied over, forcing the initial allocation of the split shards to occur on the node on which the shrink took place. Moreover if this node no longer exists then the split will fail. This commit suppresses the copying of this setting when splitting an index. Fixes #43955

laf-rge · 2019-07-08T23:25:41Z

This is great. Is there a fix for the current version? Will the updated version also fix this?
edit I meant workaround.

DaveCTurner · 2019-07-09T07:52:29Z

One possible workaround is to use reindex instead of simply splitting the index. This will take longer, unfortunately, but seems likely to work. Another idea, assuming you haven't already shrunk this index down to a single shard, is to shrink it again (which will overwrite the index.routing.allocation.initial_recovery._id setting) and then split it from there.

If an index is the result of a shrink then it will have a value set for `index.routing.allocation.initial_recovery._id`. If this index is subsequently split then this value will be copied over, forcing the initial allocation of the split shards to occur on the node on which the shrink took place. Moreover if this node no longer exists then the split will fail. This commit suppresses the copying of this setting when splitting an index. Fixes #43955

henningandersen added the :Distributed/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) label Jul 4, 2019

henningandersen added the >bug label Jul 4, 2019

DaveCTurner self-assigned this Jul 6, 2019

DaveCTurner mentioned this issue Jul 8, 2019

Do not copy initial recovery filter during split #44053

Merged

DaveCTurner closed this as completed in #44053 Jul 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

index.routing.allocation.initial_recovery._id split after shrink #43955

index.routing.allocation.initial_recovery._id split after shrink #43955

laf-rge commented Jul 4, 2019

laf-rge commented Jul 4, 2019

laf-rge commented Jul 4, 2019

elasticmachine commented Jul 4, 2019

DaveCTurner commented Jul 8, 2019

laf-rge commented Jul 8, 2019 •

edited

DaveCTurner commented Jul 9, 2019

index.routing.allocation.initial_recovery._id split after shrink #43955

index.routing.allocation.initial_recovery._id split after shrink #43955

Comments

laf-rge commented Jul 4, 2019

laf-rge commented Jul 4, 2019

laf-rge commented Jul 4, 2019

elasticmachine commented Jul 4, 2019

DaveCTurner commented Jul 8, 2019

laf-rge commented Jul 8, 2019 • edited

DaveCTurner commented Jul 9, 2019

laf-rge commented Jul 8, 2019 •

edited