Only run retention lease actions on active primary #40386

jasontedor · 2019-03-22T21:45:18Z

In some cases, a request to perform a retention lease action can arrive on a primary shard before it is active. In this case, the primary shard would not yet be in primary mode, tripping an assertion in the replication tracker. Instead, we should not attempt to perform such actions on an initializing shard. This commit addresses this by not returning the primary shard in the single shard iterator if the primary shard is not yet active.

Closes #40089
Closes #40373

In some cases, a request to perform a retention lease action can arrive on a primary shard before it is active. In this case, the primary shard would not yet be in primary mode, tripping an assertion in the replication tracker. Instead, we should not attempt to perform such actions on an initializing shard. This commit addresses this by not returning the primary shard in the single shard iterator if the primary shard is not yet active.

elasticmachine · 2019-03-22T21:45:20Z

Pinging @elastic/es-distributed

jasontedor · 2019-03-22T21:46:27Z

Another option that I considered is rejecting this when acquiring the permit on a shard that is not yet active, but this approach seems preferable to me (note that we manually handle inactive shards elsewhere, such as in the reroute phase of a replication action).

dnhatn

LGTM.

jasontedor · 2019-03-23T10:04:03Z

@elasticmachine test this please

jasontedor · 2019-03-23T11:23:20Z

@elasticmachine run elasticsearch-ci/1

In some cases, a request to perform a retention lease action can arrive on a primary shard before it is active. In this case, the primary shard would not yet be in primary mode, tripping an assertion in the replication tracker. Instead, we should not attempt to perform such actions on an initializing shard. This commit addresses this by not returning the primary shard in the single shard iterator if the primary shard is not yet active.

bleskes · 2019-03-25T13:58:10Z

server/src/main/java/org/elasticsearch/index/seqno/RetentionLeaseActions.java

-                    .shardRoutingTable(request.concreteIndex(), request.request().getShardId().id())
-                    .primaryShardIt();
+                    .shardRoutingTable(request.concreteIndex(), request.request().getShardId().id());
+            if (shardRoutingTable.primaryShard().active()) {


TransportSingleShardAction doesn't do our usual "chase the shard" pattern where we re-resolve shards on each node until we find a place where the shard is locally available. This means that if the coordinating node thinks the shard is active but the node with the shard didn't yet process the shard activation cluster state, I think this still goes wrong (i.e., the primary would not be in primary mode, which is activated when the cluster state is processed). I hope I'm wrong and please let me know what I'm missing.

Good catch. In this case, I think that we should use the other approach that I considered. I can not think of any situations where we would want to acquire a permit on a non-active primary?

That was actually why I went looking - my instinct was that this should be done under permit and that the permit shouldn't be given under a non-initialized primary. We currently don't do that so that requires a much bigger change/vision. We can also have a targeted-check in asyncShardOperation that the shard is active before performing the operation. @ywelsch any thoughts?

RetentionLeaseActions and TransportForgetFollowerAction can also possibly violate the assertion in acquirePrimaryOperationPermit that the shard is actually a primary (by the time the request arrives, the primary could have failed and a replica allocated instead). Ensuring this was previously left to the caller of this method.

We can explore changing acquirePrimaryOperationPermit and acquireAllPrimaryOperationsPermits to throw appropriate exceptions if the shard is not in primary mode (e.g. replica or initializing/relocated primary). TRA can then react to an IndexShardRelocatedException to delegate to the relocation target.

That takes it one level higher - ensuring that the primary is actually an active primary (rather than just an active shard). SGTM.

jasontedor · 2019-03-25T17:11:37Z

I thought about the targeted check there but I realized earlier there is another operation prone to this problem: forget follower which is not a single shard action. That’s why I now lean towards a global approach.

dnhatn · 2019-05-20T03:23:24Z

A failure relating to this: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+6.8+bwc-tests/60/console.

[2019-05-20T02:35:56,265][ERROR][o.e.b.ElasticsearchUncaughtExceptionHandler] [upgraded-node-leader-0] fatal error in thread [Thread-4], exiting
|    java.lang.AssertionError: shard [leader_index3][0], node[K_dcaDtcTlSLJ76pP0Wz8g], [P], recovery_source[existing store recovery; bootstrap_history_uuid=false], s[INITIALIZING], a[id=Ebf14YQsQkaXCWdL38NFKQ], unassigned_info[[reason=NODE_LEFT], at[2019-05-20T02:35:36.628Z], delayed=true, details[node_left [K_dcaDtcTlSLJ76pP0Wz8g]], allocation_status[fetching_shard_data]] is not a primary shard in primary mode
|        at org.elasticsearch.index.shard.IndexShard.assertPrimaryMode(IndexShard.java:1632) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.index.shard.IndexShard.renewRetentionLease(IndexShard.java:2030) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.index.seqno.RetentionLeaseActions$Renew$TransportAction.doRetentionLeaseAction(RetentionLeaseActions.java:248) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.index.seqno.RetentionLeaseActions$Renew$TransportAction.doRetentionLeaseAction(RetentionLeaseActions.java:222) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.index.seqno.RetentionLeaseActions$TransportRetentionLeaseAction$1.onResponse(RetentionLeaseActions.java:115) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.index.seqno.RetentionLeaseActions$TransportRetentionLeaseAction$1.onResponse(RetentionLeaseActions.java:110) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.index.shard.IndexShardOperationPermits.acquire(IndexShardOperationPermits.java:273) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.index.shard.IndexShardOperationPermits.acquire(IndexShardOperationPermits.java:240) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.index.shard.IndexShard.acquirePrimaryOperationPermit(IndexShard.java:2561) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.index.seqno.RetentionLeaseActions$TransportRetentionLeaseAction.asyncShardOperation(RetentionLeaseActions.java:109) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.index.seqno.RetentionLeaseActions$TransportRetentionLeaseAction.asyncShardOperation(RetentionLeaseActions.java:65) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.action.support.single.shard.TransportSingleShardAction$ShardTransportHandler.messageReceived(TransportSingleShardAction.java:296) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]
|        at org.elasticsearch.action.support.single.shard.TransportSingleShardAction$ShardTransportHandler.messageReceived(TransportSingleShardAction.java:289) ~[elasticsearch-6.8.0-SNAPSHOT.jar:6.8.0-SNAPSHOT]

@jasontedor @ywelsch I think we need to act on this.

jasontedor added >test Issues or PRs that are addressing/adding tests v7.0.0 :Distributed/CCR Issues around the Cross Cluster State Replication features v8.0.0 v7.2.0 v6.7.1 labels Mar 22, 2019

dnhatn approved these changes Mar 22, 2019

View reviewed changes

jasontedor added >bug and removed >test Issues or PRs that are addressing/adding tests labels Mar 23, 2019

jasontedor merged commit d193f29 into elastic:master Mar 23, 2019

jasontedor deleted the acquire-primary-permit-initializing-shard branch March 23, 2019 13:41

jasontedor mentioned this pull request Mar 23, 2019

[CI] Multiple failures in CcrRetentionLeaseIT #40089

Closed

michaelbaamonde added v7.0.0-rc1 and removed v7.0.0 labels Mar 25, 2019

bleskes reviewed Mar 25, 2019

View reviewed changes

dnhatn mentioned this pull request Apr 3, 2019

[CI] testRecoveryWithConcurrentIndexing timed out waiting for green state for index #40731

Closed

jasontedor mentioned this pull request May 20, 2019

Execute actions under permit in primary mode only #42241

Merged

dnhatn mentioned this pull request Jun 17, 2019

Create peer-recovery retention leases #43190

Merged

jakelandis removed the v8.0.0 label Jul 26, 2021

jakelandis added the v8.0.0-alpha1 label Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only run retention lease actions on active primary #40386

Only run retention lease actions on active primary #40386

jasontedor commented Mar 22, 2019 •

edited

Loading

elasticmachine commented Mar 22, 2019

jasontedor commented Mar 22, 2019

dnhatn left a comment

jasontedor commented Mar 23, 2019

jasontedor commented Mar 23, 2019

bleskes Mar 25, 2019

jasontedor Mar 25, 2019

bleskes Mar 25, 2019

ywelsch Mar 25, 2019

bleskes Mar 27, 2019

jasontedor commented Mar 25, 2019

dnhatn commented May 20, 2019

Only run retention lease actions on active primary #40386

Only run retention lease actions on active primary #40386

Conversation

jasontedor commented Mar 22, 2019 • edited Loading

elasticmachine commented Mar 22, 2019

jasontedor commented Mar 22, 2019

dnhatn left a comment

Choose a reason for hiding this comment

jasontedor commented Mar 23, 2019

jasontedor commented Mar 23, 2019

bleskes Mar 25, 2019

Choose a reason for hiding this comment

jasontedor Mar 25, 2019

Choose a reason for hiding this comment

bleskes Mar 25, 2019

Choose a reason for hiding this comment

ywelsch Mar 25, 2019

Choose a reason for hiding this comment

bleskes Mar 27, 2019

Choose a reason for hiding this comment

jasontedor commented Mar 25, 2019

dnhatn commented May 20, 2019

jasontedor commented Mar 22, 2019 •

edited

Loading