Primary shard allocator observes limits in forcing allocation #19811

abeyad · 2016-08-04T19:03:25Z

Previously, during primary shards allocation of shards
with prior allocation IDs, if all nodes returned a
NO decision for allocation (e.g. the settings blocked
allocation on that node), we would chose one of those
nodes and force the primary shard to be allocated to it.

However, this meant that primary shard allocation
would not adhere to the decision of the MaxRetryAllocationDecider,
which would lead to attempting to allocate a shard
which has failed N number of times already (presumably
due to some configuration issue).

This commit solves this issue by introducing the
notion of force allocating a primary shard to a node
and each decider implementation must implement whether
this is allowed or not. In the case of MaxRetryAllocationDecider,
it just forwards the request to canAllocate.

Closes #19446

abeyad · 2016-08-04T19:07:36Z

@ywelsch FYI, initial pass on this, would welcome your feedback.

ywelsch · 2016-08-08T13:02:55Z

core/src/main/java/org/elasticsearch/cluster/routing/allocation/decider/AllocationDecider.java

+    public Decision canForceAllocatePrimary(ShardRouting shardRouting, RoutingNode node, RoutingAllocation allocation) {
+        assert shardRouting.primary() : "must not call canForceAllocatePrimary on a non-primary shard routing [" +
+                                            shardRouting.shardId() + "]";
+        return Decision.YES;


this means that throttling is just ignored.
Assume scenario where FilterAllocationDecider says NO and ThrottlingAllocationDecider says THROTTLE. The implementation here would just forcefully allocate the shard, ignoring the throttling.

abeyad · 2016-08-08T16:49:47Z

@ywelsch I pushed 397e374 to address your feedback

ywelsch · 2016-08-09T11:16:32Z

core/src/main/java/org/elasticsearch/cluster/routing/allocation/decider/AllocationDecider.java

+    /**
+     * Returns a {@link Decision} whether the given primary shard can be
+     * forcibly allocated on the given node. This method should only be called
+     * on nodes for which previous allocations exist for the primary shard.


"This method should only be called on nodes for which previous allocations exist for the primary shard." should be more like "This method should only be called for unassigned primary shards where the node has a shard copy on disk."
Can you also assert shardRouting.unassigned() ?

ywelsch · 2016-08-09T11:55:16Z

@abeyad I've left some comments. Can you also check if there are tests that FilterAllocationDecider and other deciders are indeed allowing force-allocating primaries?

I would also like to see some docs where we can explain the implemented behavior. This could be useful to understand why primary shards are / are not allocated.

Previously, during primary shards allocation of shards with prior allocation IDs, if all nodes returned a NO decision for allocation (e.g. the settings blocked allocation on that node), we would chose one of those nodes and force the primary shard to be allocated to it. However, this meant that primary shard allocation would not adhere to the decision of the MaxRetryAllocationDecider, which would lead to attempting to allocate a shard which has failed N number of times already (presumably due to some configuration issue). This commit solves this issue by introducing the notion of force allocating a primary shard to a node and each decider implementation must implement whether this is allowed or not. In the case of MaxRetryAllocationDecider, it just forwards the request to canAllocate. Closes elastic#19446

abeyad · 2016-08-12T04:48:54Z

@ywelsch I pushed 9e47cd1, which addresses your review comments, and adds PrimaryAllocationIT test that covers filter allocation decider force allocating primaries.

As we discussed, I augmented the PrimaryShardAllocator with some more javadocs to explain the behavior. Docs explaining shard allocation in general (including primary shard allocation) will come in a separate PR.

ywelsch · 2016-08-12T15:23:08Z

core/src/main/java/org/elasticsearch/cluster/routing/allocation/decider/AllocationDecider.java

+    public Decision canForceAllocatePrimary(ShardRouting shardRouting, RoutingNode node, RoutingAllocation allocation) {
+        assert shardRouting.primary() : "must not call canForceAllocatePrimary on a non-primary shard " + shardRouting;
+        assert shardRouting.unassigned() : "must not call canForceAllocatePrimary on an assigned shard " + shardRouting;
+        return Decision.YES; // by default, a decider will allow force allocation of the primary


I think I prefer the way it was before with the default being

decision = canAllocate(...) if (decision == Decision.NO) { decision = Decision.single(Type.YES, "force override of " + decision.label, ...) }

This keeps the override logic out of AllocationDeciders.

My concern with your suggestion (which was the initial approach) is that for any decider that overrides canForceAllocatePrimary, it is easy for it to forget to take into account the decision of canAllocate. That's why I figured it would be better to put the invocation of canAllocate in AllocationDeciders, then if the decision is NO (as opposed to throttle, for example), then call the deciders canForceAllocatePrimary implementation. The only downside to this that I can think of is that its rigid in that no canForceAllocatePrimary method would be able to override a non-NO canAllocate decision - but I don't see any reason why we would want that?

I pushed 5c25b15 to address this

extend TestAllocateDecision that provide the desired behavior.

abeyad · 2016-08-12T17:00:06Z

@ywelsch I pushed 9446267 to remove ForcePrimaryDecider in favor of anonymous class creation that extends TestAllocateDecision

…cationDeciders

abeyad · 2016-08-15T19:46:56Z

@elasticmachine retest this please

ywelsch · 2016-08-16T11:09:47Z

...ain/java/org/elasticsearch/cluster/routing/allocation/decider/MaxRetryAllocationDecider.java

+
+    @Override
+    public Decision canForceAllocatePrimary(ShardRouting shardRouting, RoutingNode node, RoutingAllocation allocation) {
+        assert shardRouting.primary() : "must not call canForceAllocatePrimary on a non-primary shard [" + shardRouting.shardId() + "]";


print shardRouting, not only shardRouting.shardId().

ywelsch · 2016-08-16T11:38:38Z

@abeyad Left minor comments about docs/tests, change looks good o.w.

abeyad · 2016-08-16T15:17:17Z

@ywelsch I pushed 5b536be

ywelsch · 2016-08-16T15:22:05Z

LGTM. Thanks @abeyad!

abeyad · 2016-08-16T15:23:02Z

Thanks for the review @ywelsch !

abeyad added >enhancement review v5.0.0-beta1 labels Aug 4, 2016

abeyad changed the title ~~Primary shard allocation observes limits in forcing allocation~~ Primary shard allocator observes limits in forcing allocation Aug 4, 2016

ywelsch reviewed Aug 8, 2016
View reviewed changes

abeyad force-pushed the improve-primary-allocation-retries branch from 7324b43 to 397e374 Compare August 8, 2016 16:48

ywelsch reviewed Aug 9, 2016
View reviewed changes

Ali Beyad added 3 commits August 12, 2016 00:04

Force allocation takes throttling decisions into account

3b3d868

Addresses code review comments

9e47cd1

abeyad force-pushed the improve-primary-allocation-retries branch from 397e374 to 9e47cd1 Compare August 12, 2016 04:48

ywelsch reviewed Aug 12, 2016
View reviewed changes

Remove ForcePrimaryDecider in favor of anonymous classes that

9446267

extend TestAllocateDecision that provide the desired behavior.

canForceAllocatePrimary's initial canAllocate check moved out of Allo…

5c25b15

…cationDeciders

ywelsch reviewed Aug 16, 2016
View reviewed changes

Addresses code review comments

5b536be

Remove empty line

a65441d

abeyad merged commit 88aff40 into elastic:master Aug 16, 2016

abeyad deleted the improve-primary-allocation-retries branch August 16, 2016 15:25

lcawl added :Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. and removed :Allocation labels Feb 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Primary shard allocator observes limits in forcing allocation #19811

Primary shard allocator observes limits in forcing allocation #19811

abeyad commented Aug 4, 2016

abeyad commented Aug 4, 2016

ywelsch Aug 8, 2016

abeyad commented Aug 8, 2016

ywelsch Aug 9, 2016

abeyad Aug 12, 2016

ywelsch commented Aug 9, 2016

abeyad commented Aug 12, 2016

ywelsch Aug 12, 2016

abeyad Aug 12, 2016

abeyad Aug 15, 2016

abeyad commented Aug 12, 2016

abeyad commented Aug 15, 2016

ywelsch Aug 16, 2016

ywelsch commented Aug 16, 2016

abeyad commented Aug 16, 2016

ywelsch commented Aug 16, 2016

abeyad commented Aug 16, 2016

Primary shard allocator observes limits in forcing allocation #19811

Primary shard allocator observes limits in forcing allocation #19811

Conversation

abeyad commented Aug 4, 2016

abeyad commented Aug 4, 2016

ywelsch Aug 8, 2016

Choose a reason for hiding this comment

abeyad commented Aug 8, 2016

ywelsch Aug 9, 2016

Choose a reason for hiding this comment

abeyad Aug 12, 2016

Choose a reason for hiding this comment

ywelsch commented Aug 9, 2016

abeyad commented Aug 12, 2016

ywelsch Aug 12, 2016

Choose a reason for hiding this comment

abeyad Aug 12, 2016

Choose a reason for hiding this comment

abeyad Aug 15, 2016

Choose a reason for hiding this comment

abeyad commented Aug 12, 2016

abeyad commented Aug 15, 2016

ywelsch Aug 16, 2016

Choose a reason for hiding this comment

ywelsch commented Aug 16, 2016

abeyad commented Aug 16, 2016

ywelsch commented Aug 16, 2016

abeyad commented Aug 16, 2016