[CORE] Wait for pending shard removal on IndexService close #8608

s1monw · 2014-11-22T19:43:55Z

When an index service gets closed due to a delete index event it's possible
that shards are already removed from the service that are currently in
recovery. The recovery source applies the delete index first and causes
the target to fail its shard. This shard gets removed from the service
but it's content doesn't get deleted if the actual delete index is faster
applied. This commit makes sure we wait for the pending removals that
are in-flight before we close the index an delete it's content too once
the lock is released.

When an index service gets closed due to a delete index event it's possible that shards are already removed from the service that are currently in recovery. The recovery source applies the delete index first and causes the target to fail its shard. This shard gets removed from the service but it's content doesn't get deleted if the actual delete index is faster applied. This commit makes sure we wait for the pending removals that are in-flight before we close the index an delete it's content too once the lock is released.

s1monw · 2014-11-24T15:11:50Z

@bleskes if you have time I'd love you to review this - assinging you

bleskes · 2014-11-27T14:55:35Z

I looked at it and my main concern is complexity. @s1monw had some ideas about simplifying it and is working on it.

elastic#8436 has introduced shard level locks in order to prevent directory reuse during fast deletion & creation of indices. As part for the change, close listeners were introduced to delete the folders once all out standing references were released. The new change has created race conditions causing shard folders not to be deleted (causing test failures due to left over corruption markers). This commit removes the listeners in favor of a simple timeout based solution to be use until a better listener based solution is ready ( elastic#8608 ).

elastic#8436 has introduced shard level locks in order to prevent directory reuse during fast deletion & creation of indices. As part for the change, close listeners were introduced to delete the folders once all out standing references were released. The new change has created race conditions causing shard folders not to be deleted (causing test failures due to left over corruption markers). This commit removes the listeners in favour of a simple timeout based solution to be use until a better listener based solution is ready ( elastic#8608 ). Closes elastic#9009

s1monw · 2014-12-29T15:17:53Z

closing in favor of #9083

s1monw added >enhancement v1.5.0 v2.0.0-beta1 review labels Nov 22, 2014

s1monw assigned bleskes Nov 24, 2014

bleskes mentioned this pull request Dec 19, 2014

Remove IndexCloseListener & Store.OnCloseListener #9009

Closed

s1monw mentioned this pull request Dec 29, 2014

Delete shard content under lock #9083

Merged

s1monw closed this Dec 29, 2014

s1monw removed review >enhancement v1.5.0 v2.0.0-beta1 labels Dec 29, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CORE] Wait for pending shard removal on IndexService close #8608

[CORE] Wait for pending shard removal on IndexService close #8608

Uh oh!

s1monw commented Nov 22, 2014

Uh oh!

s1monw commented Nov 24, 2014

Uh oh!

bleskes commented Nov 27, 2014

Uh oh!

s1monw commented Dec 29, 2014

Uh oh!

Uh oh!

[CORE] Wait for pending shard removal on IndexService close #8608

[CORE] Wait for pending shard removal on IndexService close #8608

Uh oh!

Conversation

s1monw commented Nov 22, 2014

Uh oh!

s1monw commented Nov 24, 2014

Uh oh!

bleskes commented Nov 27, 2014

Uh oh!

s1monw commented Dec 29, 2014

Uh oh!

Uh oh!