Resiliency: Cancelling a recovery may leave temporary files behind #7893

bleskes · 2014-09-26T10:07:27Z

We currently cancel recoveries when the shard is no longer assigned to the target node, or the primary shard (source of copying) is moved to another node (and there are more scenarios). That cancel logic doesn't clean up any temporary files created during the recovery.

Normally that's not a problem as the files will be cleaned up once the shard is safely recovered somewhere else (or locally). However, if one runs into continuous failure cycles we can fill up disk space, causing bigger problems like corrupting other shards on the node.

At the moment, we leave around temporary files if a peer (replica) recovery is canceled. Those files will normally be cleaned up once the shard is started else but in case of errors this can lead to trouble. If recovery are started and canceled often, we may cause nodes to run out of disk space. Closes elastic#7893

bleskes · 2014-11-03T12:01:03Z

fixed with #8092

bleskes added v1.4.0.Beta1 v2.0.0-beta1 >bug resiliency labels Sep 26, 2014

clintongormley changed the title ~~Recovery: cancelling a recovery may leave temporary files behind~~ Resiliency: Cancelling a recovery may leave temporary files behind Sep 26, 2014

s1monw added v1.4.0 and removed v1.4.0.Beta1 labels Sep 30, 2014

bleskes added v1.5.0 and removed v1.4.0 labels Nov 3, 2014

bleskes closed this as completed Nov 3, 2014

bleskes mentioned this issue Mar 19, 2015

Refactor RecoveryTarget state management #8092

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resiliency: Cancelling a recovery may leave temporary files behind #7893

Resiliency: Cancelling a recovery may leave temporary files behind #7893

bleskes commented Sep 26, 2014

bleskes commented Nov 3, 2014

Resiliency: Cancelling a recovery may leave temporary files behind #7893

Resiliency: Cancelling a recovery may leave temporary files behind #7893

Comments

bleskes commented Sep 26, 2014

bleskes commented Nov 3, 2014