Node shut down during the last phase of recovery needlessly fails shard #9496

bleskes · 2015-01-30T08:06:32Z

During the final stage of recovery, the target shard is being moved to POST_RECOVERY and the master is sent a request to activate the shard. At the point the master reports the cluster as green, i.e., it is safe to shut down a node without loosing data (potentially going to yellow). However, if the source node is shut down quickly enough before the recovery code cleanly finishes, we may fail the new copy resulting in a red index.

This is an issue with a non-released refactoring done on 1.x and master.

See:
http://build-us-00.elasticsearch.org/job/es_g1gc_1x_metal/3366/testReport/junit/org.elasticsearch.recovery/FullRollingRestartTests/testFullRollingRestart/

brwe · 2015-01-30T08:15:19Z

I think this failure of RelocationTests.testMoveShardsWhileRelocation could be caused by the same effect: http://build-us-00.elasticsearch.org/job/es_core_1x_small/1474/

s1monw · 2015-03-17T22:45:56Z

@bleskes I moved this out to 1.6

bleskes · 2015-03-20T11:11:35Z

This should be fixed with #9902

bleskes added v2.0.0-beta1 v1.5.0 :Distributed/Recovery Anything around constructing a new shard, either from a local or a remote source. labels Jan 30, 2015

s1monw added v1.6.0 and removed v1.5.0 labels Mar 17, 2015

bleskes added v1.5.0 and removed v1.6.0 labels Mar 20, 2015

bleskes closed this as completed Mar 20, 2015

clintongormley added the >enhancement label Jun 8, 2015

clintongormley changed the title ~~Recovery: node shut down during the last phase of recovery needlessly fails shard~~ Node shut down during the last phase of recovery needlessly fails shard Jun 8, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Node shut down during the last phase of recovery needlessly fails shard #9496

Node shut down during the last phase of recovery needlessly fails shard #9496

bleskes commented Jan 30, 2015

brwe commented Jan 30, 2015

s1monw commented Mar 17, 2015

bleskes commented Mar 20, 2015

Node shut down during the last phase of recovery needlessly fails shard #9496

Node shut down during the last phase of recovery needlessly fails shard #9496

Comments

bleskes commented Jan 30, 2015

brwe commented Jan 30, 2015

s1monw commented Mar 17, 2015

bleskes commented Mar 20, 2015