Allocate primary shards based on allocation IDs #15281

ywelsch · 2015-12-07T14:31:59Z

Add allocation IDs to TransportNodesListGatewayStartedShards action.
Use the above to assign a primary shard on recovery.
Also add allocation id to indices shard store response (/some_index/_shard_stores)

Relates to #14739

bleskes · 2015-12-08T11:17:38Z

core/src/main/java/org/elasticsearch/gateway/PrimaryShardAllocator.java


-            NodesAndVersions nodesAndVersions = buildNodesAndVersions(shard, recoverOnAnyNode(indexSettings), allocation.getIgnoreNodes(shard.shardId()), shardState);
-            logger.debug("[{}][{}] found {} allocations of {}, highest version: [{}]", shard.index(), shard.id(), nodesAndVersions.allocationsFound, shard, nodesAndVersions.highestVersion);
+            Set<String> allocationIds = indexMetaData.activeAllocationIds(shard.id());


call this lastActiveAllocationIds?

ywelsch · 2015-12-09T14:50:10Z

Pushed changes related to our discussion:

Use allocation ids and index creation version to determine allocatedPostIndexCreate
Added integration test for Index creation context lost #15241 and Do not allow stale replicas to automatically be promoted to primary #14671

bleskes · 2015-12-14T09:35:40Z

.../src/main/java/org/elasticsearch/action/admin/indices/shards/IndicesShardStoresResponse.java

@@ -115,9 +116,10 @@ private void writeTo(StreamOutput out) throws IOException {
        private StoreStatus() {
        }

-        public StoreStatus(DiscoveryNode node, long version, Allocation allocation, Throwable storeException) {
+        public StoreStatus(DiscoveryNode node, long version, String allocationId, Allocation allocation, Throwable storeException) {


That Allocation enum is really confusing in this context (I stumbled on this 3 times now). This is unrelated to this change, so I'm not suggesting we change it here - but do you have suggestions for a better name? maybe AllocatedAs ?

What about AllocationStatus? The JSON field should also be adapted.

I liked allocatedAs better :) (note that the name of the java JSON field is ALLOCATED :))

bleskes · 2015-12-14T11:42:41Z

Thanks @ywelsch . I think this looks great. I left some comments and also want to ping @dakrone to discuss the recover on any node option. I hope we can get this simpler...

ywelsch · 2015-12-15T11:31:16Z

@bleskes I pushed a new set of changes that address your comments.

bleskes · 2015-12-15T12:38:44Z

core/src/main/java/org/elasticsearch/gateway/PrimaryShardAllocator.java

+                continue;
+            }
+
+            if (recoverOnAnyNode(indexSettings)) {


can we only skip allocation if we don't find any copy?

bleskes · 2015-12-15T13:30:32Z

This looks great. Left some minor comment and one important one about the recover on any node settings. Also let's have another discussion with @dakrone about the failing test.

ywelsch · 2015-12-16T16:26:59Z

Pushed another set of changes, dealing with recover_on_any_node.

bleskes · 2015-12-17T11:46:24Z

core/src/main/java/org/elasticsearch/gateway/PrimaryShardAllocator.java

+                // Note that once the shard has been active, lastActiveAllocationIds will be non-empty
+                nodesAndVersions = buildNodesAndVersions(shard, snapshotRestore || recoverOnAnyNode, allocation.getIgnoreNodes(shard.shardId()), shardState);
+                if (snapshotRestore || recoverOnAnyNode) {
+                    enoughAllocationsFound = nodesAndVersions.nodes.isEmpty() == false;


nit: can we be consistent and use allocationsFound > 0?

bleskes · 2015-12-17T11:53:49Z

LGTM. Left some extremely minor comments. No need for another review. Just merge after they are addressed. Thanks @ywelsch ! I can't tell you how happy I am for having this.

Closes elastic#15281

Allocate primary shards based on allocation IDs

#14252 , #7572 , #15900, #12573, #14671, #15281 and #9126 have all been closed/merged and will be part of 5.0.0.

ywelsch added >enhancement :Allocation v5.0.0-alpha1 labels Dec 7, 2015

ywelsch assigned bleskes Dec 7, 2015

ywelsch mentioned this pull request Dec 7, 2015

Allocate primary shard based on allocation IDs #14739

Closed

7 tasks

bleskes reviewed Dec 8, 2015
View reviewed changes

bleskes mentioned this pull request Dec 8, 2015

CreateIndexIT.testCreateAndDeleteIndexConcurrently fails #15312

Closed

bleskes reviewed Dec 14, 2015
View reviewed changes

bleskes reviewed Dec 15, 2015
View reviewed changes

bleskes reviewed Dec 17, 2015
View reviewed changes

ywelsch force-pushed the feature/alloc-ids-primary branch from dc3479a to ae5e9e3 Compare December 17, 2015 14:32

Allocate primary shards based on allocation ids

3a442db

Closes elastic#15281

ywelsch force-pushed the feature/alloc-ids-primary branch from ae5e9e3 to 3a442db Compare December 17, 2015 14:57

ywelsch pushed a commit that referenced this pull request Dec 17, 2015

Merge pull request #15281 from ywelsch/feature/alloc-ids-primary

8f14b10

Allocate primary shards based on allocation IDs

ywelsch merged commit 8f14b10 into elastic:master Dec 17, 2015

ywelsch mentioned this pull request Dec 30, 2015

Index creation context lost #15241

Closed

clintongormley mentioned this pull request Feb 14, 2016

Do not allow stale replicas to automatically be promoted to primary #14671

Closed

bleskes added a commit that referenced this pull request Apr 7, 2016

Update resliency page

557a3d1

#14252 , #7572 , #15900, #12573, #14671, #15281 and #9126 have all been closed/merged and will be part of 5.0.0.

bleskes mentioned this pull request Apr 7, 2016

Update resliency page #17586

Merged

bleskes added a commit that referenced this pull request Apr 7, 2016

Update resiliency page (#17586)

8eee28e

#14252 , #7572 , #15900, #12573, #14671, #15281 and #9126 have all been closed/merged and will be part of 5.0.0.

lcawl added :Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. and removed :Allocation labels Feb 13, 2018

clintongormley added :Distributed/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) and removed :Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. labels Feb 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allocate primary shards based on allocation IDs #15281

Allocate primary shards based on allocation IDs #15281

ywelsch commented Dec 7, 2015

bleskes Dec 8, 2015

ywelsch commented Dec 9, 2015

bleskes Dec 14, 2015

ywelsch Dec 15, 2015

bleskes Dec 15, 2015

bleskes commented Dec 14, 2015

ywelsch commented Dec 15, 2015

bleskes Dec 15, 2015

bleskes commented Dec 15, 2015

ywelsch commented Dec 16, 2015

bleskes Dec 17, 2015

bleskes commented Dec 17, 2015

Allocate primary shards based on allocation IDs #15281

Allocate primary shards based on allocation IDs #15281

Conversation

ywelsch commented Dec 7, 2015

bleskes Dec 8, 2015

Choose a reason for hiding this comment

ywelsch commented Dec 9, 2015

bleskes Dec 14, 2015

Choose a reason for hiding this comment

ywelsch Dec 15, 2015

Choose a reason for hiding this comment

bleskes Dec 15, 2015

Choose a reason for hiding this comment

bleskes commented Dec 14, 2015

ywelsch commented Dec 15, 2015

bleskes Dec 15, 2015

Choose a reason for hiding this comment

bleskes commented Dec 15, 2015

ywelsch commented Dec 16, 2015

bleskes Dec 17, 2015

Choose a reason for hiding this comment

bleskes commented Dec 17, 2015