Snapshot/Restore: snapshot during rolling restart of a 2 node cluster might get stuck #9924

imotov · 2015-02-27T19:39:26Z

The issue was originally reported in #7980 (comment) If a current master node that contains all primary shards is restarted in the middle of snapshot operation, it might leave the snapshot hanging in ABORTED state.

The text was updated successfully, but these errors were encountered:

cxxr · 2015-03-10T23:28:59Z

👍 ran into this a few times.

… nodes that no longer exist Related to elastic#9924

… nodes that no longer exist Related to #9924

srgclr · 2015-06-08T08:41:08Z

Ran into a similar issue and tried the snapshot cleanup utility. It didn't work as all shards were ignored:

Ignoring shard [[dev1_10_event.2015-03-15][4]] with state [ABORTED] on node [kyU3N9lpTIuTbdeUGp5ThQ] - node exists : [true]

What's the reason for ignoring shards when the node exists?

imotov · 2015-06-10T13:41:26Z

@srgclr if a node exists and a shard is in ABORTED state it can mean one of the two things - we hit #11314 or the shard is stuck in the I/O operation and we need to wait until the I/O operation is over or we need to restart the node. It's impossible for the cleanup utility to determine which state we are in. Because of this, it takes a safer route - assume that we are stuck in I/O operation and skip such shards.

… nodes that no longer exist Related to elastic#9924

imotov · 2015-08-20T22:20:08Z

This should be solved by #11450. Closing.

imotov added >bug :Distributed/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs labels Feb 27, 2015

imotov self-assigned this Feb 27, 2015

imotov mentioned this issue Feb 27, 2015

Snapshot/Restore: snapshot with missing metadata file cannot be deleted #7980

Closed

imotov mentioned this issue Mar 4, 2015

Delete operation should ignore finalizing shards on nodes that no longer exist #9981

Merged

imotov added a commit to imotov/elasticsearch that referenced this issue Mar 12, 2015

Snapshot/Restore: delete operation should ignore finalizing shards on…

55f2a54

… nodes that no longer exist Related to elastic#9924

imotov added a commit that referenced this issue Mar 12, 2015

Snapshot/Restore: delete operation should ignore finalizing shards on…

81c5160

… nodes that no longer exist Related to #9924

imotov added a commit that referenced this issue Mar 12, 2015

Snapshot/Restore: delete operation should ignore finalizing shards on…

bdd297f

… nodes that no longer exist Related to #9924

imotov mentioned this issue Apr 13, 2015

Snapshot operation stuck, delete command doesn't work #10564

Closed

imotov mentioned this issue May 23, 2015

Snapshot/Restore: restart of a master node during snapshot can lead to hanging snapshots #11314

Closed

mute pushed a commit to mute/elasticsearch that referenced this issue Jul 29, 2015

Snapshot/Restore: delete operation should ignore finalizing shards on…

3de81ac

… nodes that no longer exist Related to elastic#9924

imotov closed this as completed Aug 20, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Snapshot/Restore: snapshot during rolling restart of a 2 node cluster might get stuck #9924

Snapshot/Restore: snapshot during rolling restart of a 2 node cluster might get stuck #9924

imotov commented Feb 27, 2015

cxxr commented Mar 10, 2015

srgclr commented Jun 8, 2015

imotov commented Jun 10, 2015

imotov commented Aug 20, 2015

Snapshot/Restore: snapshot during rolling restart of a 2 node cluster might get stuck #9924

Snapshot/Restore: snapshot during rolling restart of a 2 node cluster might get stuck #9924

Comments

imotov commented Feb 27, 2015

cxxr commented Mar 10, 2015

srgclr commented Jun 8, 2015

imotov commented Jun 10, 2015

imotov commented Aug 20, 2015