Recovery should wipe the shard state file before starting recovery #10053

s1monw · 2015-03-10T23:46:15Z

When we start recovery of a shard we should wipe the state file of the copy if it's present otherwise gateway allocating can get confused interpreting a shard that is not fully recovered ie. due to a recovery failure as a valid copy since we only write the state when the shard is started.

s1monw · 2015-03-10T23:47:47Z

@brwe can you take care of this?

bleskes · 2015-03-10T23:52:33Z

I wonder if the correct time to wipe any _state file is before the temp file rename. Until then, the recovery doesn’t mess with any non-temp files. If the recover is cancelled, we leave the target shard intact.

On 10 Mar 2015, at 16:48, Simon Willnauer notifications@github.com wrote:

@brwe can you take care of this?

—
Reply to this email directly or view it on GitHub.

s1monw · 2015-03-15T21:26:43Z

@bleskes agreed.. we should remove it before we rename the first file.

brwe · 2015-03-17T20:58:57Z

Just for reference, here is the relevant test failure: http://build-us-00.elasticsearch.org/job/es_core_1x_small/1800/

Today we leave the shard state behind even if a recovery is half finished this causes in rare conditions shards to be recovered and promoted as primaries that have never been fully recovered. Closes elastic#10053

s1monw added >bug v2.0.0-beta1 v1.5.0 v1.4.4 labels Mar 10, 2015

s1monw assigned bleskes and brwe and unassigned bleskes Mar 10, 2015

s1monw added v1.6.0 and removed v1.6.0 v1.5.0 labels Mar 17, 2015

brwe assigned s1monw and unassigned brwe Mar 19, 2015

s1monw removed v1.4.4 v1.6.0 labels Mar 20, 2015

s1monw mentioned this issue Mar 20, 2015

Wipe shard state before switching recovered files live #10179

Merged

s1monw closed this as completed in #10179 Mar 20, 2015

bleskes mentioned this issue Feb 26, 2016

Write shard state metadata as soon as shard is created / initializing #16625

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recovery should wipe the shard state file before starting recovery #10053

Recovery should wipe the shard state file before starting recovery #10053

s1monw commented Mar 10, 2015

s1monw commented Mar 10, 2015

bleskes commented Mar 10, 2015

s1monw commented Mar 15, 2015

brwe commented Mar 17, 2015

Recovery should wipe the shard state file before starting recovery #10053

Recovery should wipe the shard state file before starting recovery #10053

Comments

s1monw commented Mar 10, 2015

s1monw commented Mar 10, 2015

bleskes commented Mar 10, 2015

s1monw commented Mar 15, 2015

brwe commented Mar 17, 2015