Inc store reference before refresh #28656

jimczi · 2018-02-13T13:17:55Z

If a tragic even happens while we are refreshing a searcher/reader the engine can open new files on a store that is already closed.
For instance the following CI job failed because a refresh was concurrently called on a failing shard:
https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+oracle-java10-periodic/84
This change increments the ref count of the store during a refresh in order to postpone the closing after a tragic event.

If a tragic even happens while we are refreshing a searcher/reader the engine can open new files on a store that is already closed For instance the following CI job failed because a merge was concurrently called on a failing shard: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+oracle-java10-periodic/84 This change increments the ref count of the store during a refresh in order to postpone the closing after a tragic event.

s1monw

LGTM good catch

If a tragic even happens while we are refreshing a searcher/reader the engine can open new files on a store that is already closed For instance the following CI job failed because a merge was concurrently called on a failing shard: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+oracle-java10-periodic/84 This change increments the ref count of the store during a refresh in order to postpone the closing after a tragic event.

bleskes · 2018-02-13T14:44:58Z

great catch. I think it's trappy that acquiring a lock isn't enough to keep things alive. I think this kind of failure can happen in many other ways. I'm wondering if should increment the store count everytime we issue a lock and decrement it when the lock is freed. Just a thought. This change is great.

jimczi added >bug :Engine v7.0.0 v6.3.0 labels Feb 13, 2018

jimczi requested review from jpountz and s1monw February 13, 2018 13:17

s1monw approved these changes Feb 13, 2018

View reviewed changes

jpountz approved these changes Feb 13, 2018

View reviewed changes

jimczi merged commit 3b9f530 into elastic:master Feb 13, 2018

jimczi deleted the refresh_inc_store branch February 13, 2018 14:38

bleskes mentioned this pull request Feb 15, 2018

ES Crash: 'Failed to close SearcherManager' #28687

Closed

colings86 added v7.0.0-beta1 and removed v7.0.0 labels Feb 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inc store reference before refresh #28656

Inc store reference before refresh #28656

jimczi commented Feb 13, 2018 •

edited

Loading

s1monw left a comment

bleskes commented Feb 13, 2018

Inc store reference before refresh #28656

Inc store reference before refresh #28656

Conversation

jimczi commented Feb 13, 2018 • edited Loading

s1monw left a comment

Choose a reason for hiding this comment

bleskes commented Feb 13, 2018

jimczi commented Feb 13, 2018 •

edited

Loading