Fail replica shards locally upon failures #5847

bleskes · 2014-04-17T09:51:05Z

When a replication operation (index/delete/update) fails to be executed properly, we fail the replica and allow master to allocate a new copy of it. At the moment, the node hosting the primary shard is responsible of notifying the master of a failed replica. However, if the replica shard is initializing (POST_RECOVERY state), we have a racing condition between the failed shard message and moving the shard into the STARTED state. If the latter happen first, master will fail to resolve the fail shard message.

This PR builds on #5800 and fails the engine of the replica shard if a replication operation fails. This protects us against the above as the shard will reject the STARTED command from master. It also makes us more resilient to other racing conditions in this area.

When a replication operation (index/delete/update) fails to be executed properly, we fail the replica and allow master to allocate a new copy of it. At the moment, the node hosting the primary shard is responsible of notifying the master of a failed replica. However, if the replica shard is initializing (`POST_RECOVERY` state), we have a racing condition between the failed shard message and moving the shard into the `STARTED` state. If the latter happen first, master will fail to resolve the fail shard message. This PR builds on elastic#5800 and fails the engine of the replica shard if a replication operation fails. This protects us against the above as the shard will reject the `STARTED` command from master. It also makes us more resilient to other racing conditions in this area.

kimchy · 2014-04-17T10:44:00Z

...a/org/elasticsearch/action/support/replication/TransportShardReplicationOperationAction.java

+
+    private void failReplicaIfNeeded(String index, int shardId, Throwable t) {
+        if (!ignoreReplicaException(t)) {
+            logger.warn("Failed to perform " + transportAction + " on replica [" + index + "][" + shardId + "]. failing shard.", t);


we end up double logging warnings, no? The first here, and the second when failing the engine. I think its enough to log a warning when failing the engine later.

I tend to agree but I think we should log that we executed this as debug?

s1monw · 2014-04-17T13:54:03Z

one small comments but otherwise LGTM

…it contained is passed on

bleskes · 2014-04-18T07:35:52Z

I pushed another commit with the log message removed. I adapted the reason (which is logged by the shard failure) to include the information that was missing. I decided in the end not to add a debug logging as there is no logic and hardly any code between here and where we log it. If anyone feels strongly about it, I'll happily add it.

s1monw · 2014-04-18T07:36:32Z

LGTM

When a replication operation (index/delete/update) fails to be executed properly, we fail the replica and allow master to allocate a new copy of it. At the moment, the node hosting the primary shard is responsible of notifying the master of a failed replica. However, if the replica shard is initializing (`POST_RECOVERY` state), we have a racing condition between the failed shard message and moving the shard into the `STARTED` state. If the latter happen first, master will fail to resolve the fail shard message. This commit builds on #5800 and fails the engine of the replica shard if a replication operation fails. This protects us against the above as the shard will reject the `STARTED` command from master. It also makes us more resilient to other racing conditions in this area. Closes #5847

bleskes · 2014-04-18T17:00:56Z

thx @s1monw @kimchy . pushed.

kimchy reviewed Apr 17, 2014
View reviewed changes

removed duplicate logging message and made sure the information that …

40ff440

…it contained is passed on

bleskes closed this in 12bbe28 Apr 18, 2014

bleskes added v2.0.0 labels Apr 18, 2014

bleskes deleted the enhance/local_fail_shard branch April 18, 2014 17:01

clintongormley added the :Distributed/Recovery Anything around constructing a new shard, either from a local or a remote source. label Jun 7, 2015

bleskes mentioned this pull request Mar 13, 2016

Port Primary Terms to master #17044

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail replica shards locally upon failures #5847

Fail replica shards locally upon failures #5847

bleskes commented Apr 17, 2014

kimchy Apr 17, 2014

s1monw Apr 17, 2014

s1monw commented Apr 17, 2014

bleskes commented Apr 18, 2014

s1monw commented Apr 18, 2014

bleskes commented Apr 18, 2014

Fail replica shards locally upon failures #5847

Fail replica shards locally upon failures #5847

Conversation

bleskes commented Apr 17, 2014

kimchy Apr 17, 2014

Choose a reason for hiding this comment

s1monw Apr 17, 2014

Choose a reason for hiding this comment

s1monw commented Apr 17, 2014

bleskes commented Apr 18, 2014

s1monw commented Apr 18, 2014

bleskes commented Apr 18, 2014