Simplify delayed shard allocation #18351

ywelsch · 2016-05-14T15:05:21Z

This PR simplifies the delayed shard allocation implementation by assigning clear responsibilities to the various components that are affected by delayed shard allocation:

UnassignedInfo gets a boolean flag delayed which determines whether assignment of the shard should be delayed. The flag gets persisted in the cluster state and is thus available across nodes, i.e. each node knows whether a shard was delayed-unassigned in a specific cluster state. Before, nodes other than the current master were unaware of that information.
This flag is initially set as true if the shard becomes unassigned due to a node leaving and the index setting index.unassigned.node_left.delayed_timeout being strictly positive. From then on, unassigned shards can only transition from delayed to non-delayed, never in the other direction.
The reroute step is in charge of removing the delay marker (comparing timestamp when node left to current timestamp).
A dedicated service DelayedAllocationService, reacting to cluster change events, has the responsibility to schedule reroutes to remove the delay marker.

Relates to #18293

ywelsch · 2016-05-14T15:05:44Z

@bleskes can you have a look?

bleskes · 2016-05-20T12:44:31Z

core/src/main/java/org/elasticsearch/cluster/routing/RoutingService.java

@@ -59,18 +54,13 @@
    private final AllocationService allocationService;

    private AtomicBoolean rerouting = new AtomicBoolean();
-    private volatile long minDelaySettingAtLastSchedulingNanos = Long.MAX_VALUE;
-    private volatile ScheduledFuture registeredNextDelayFuture;

    @Inject
    public RoutingService(Settings settings, ThreadPool threadPool, ClusterService clusterService, AllocationService allocationService) {
        super(settings);
        this.threadPool = threadPool;


we don't need the thread pool anymore

ywelsch · 2016-05-23T13:53:38Z

@bleskes I've updated the PR. Please have another look.

bleskes · 2016-05-25T12:29:17Z

core/src/main/java/org/elasticsearch/cluster/routing/DelayedAllocationService.java

+            if (earlierRerouteNeeded) {
+                logger.info("scheduling reroute for delayed shards in [{}] ({} delayed shards)", nextDelay,
+                    UnassignedInfo.getNumberOfDelayedUnassigned(state));
+                newTask.schedule();


I'm thinking some more about scheduling first and making it visible second and having second doubts on that one. Looking at the code again , why do we need to schedule first/what's the down side of doing it after setting delayedRerouteTask, so we know removeTaskAndCancel/removeIfSameTask work?

Yes, we can do it the other way around. We know that close will be run after scheduleIfNeeded, and never the other way around.

bleskes · 2016-05-25T14:30:07Z

Looking good! left comments here and there. Almost all very minor.

ywelsch · 2016-05-25T16:43:02Z

@bleskes thanks for reviewing. I've updated the PR with your suggestions. Please have another look.

bleskes · 2016-05-26T09:13:49Z

LGTM. @ywelsch thanks for the extra iterations

ywelsch added >enhancement :Allocation v5.0.0-alpha3 labels May 14, 2016

bleskes reviewed May 20, 2016
View reviewed changes

clintongormley added v5.0.0-alpha4 and removed v5.0.0-alpha3 labels May 24, 2016

bleskes reviewed May 25, 2016
View reviewed changes

ywelsch force-pushed the fix/delayed-shard-allocation branch from c7eeb27 to 2369748 Compare May 26, 2016 10:42

Simplify delayed shard allocation

45e8798

ywelsch force-pushed the fix/delayed-shard-allocation branch from 2369748 to 45e8798 Compare May 26, 2016 10:49

ywelsch merged commit 31b0777 into elastic:master May 26, 2016

lcawl added :Distributed/Distributed A catch all label for anything in the Distributed Area. If you aren't sure, use this one. and removed :Allocation labels Feb 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify delayed shard allocation #18351

Simplify delayed shard allocation #18351

ywelsch commented May 14, 2016 •

edited

Loading

ywelsch commented May 14, 2016

bleskes May 20, 2016

ywelsch commented May 23, 2016

bleskes May 25, 2016

ywelsch May 25, 2016

bleskes commented May 25, 2016

ywelsch commented May 25, 2016

bleskes commented May 26, 2016

Simplify delayed shard allocation #18351

Simplify delayed shard allocation #18351

Conversation

ywelsch commented May 14, 2016 • edited Loading

ywelsch commented May 14, 2016

bleskes May 20, 2016

Choose a reason for hiding this comment

ywelsch commented May 23, 2016

bleskes May 25, 2016

Choose a reason for hiding this comment

ywelsch May 25, 2016

Choose a reason for hiding this comment

bleskes commented May 25, 2016

ywelsch commented May 25, 2016

bleskes commented May 26, 2016

ywelsch commented May 14, 2016 •

edited

Loading