Add unthrottled path for replicas in ThrottlingAllocationDecider #138545

DiannaHohensee · 2025-11-24T21:34:54Z

Adds a new cluster setting to allow the ThrottlingAllocationDecider
to bypass replica shard throttling during balancer simulation. Primary
shards already always bypass throttling during simulation so that new
index shards are assigned (and made available) as quickly as possible.
Replicas need the same quick availability in some environments.

Relates ES-12942

I'm splitting the work for ES-12942 into pieces. Next I'll need to make sure the BalancedShardsAllocator#allocate() can assign all the replicas in one call, and then change the DesiredBalanceComputer's early return logic to pick up new assignment of unassigned replicas.

elasticsearchmachine · 2025-11-24T23:10:39Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

nicktindall · 2025-11-25T01:47:47Z

...n/java/org/elasticsearch/cluster/routing/allocation/decider/ThrottlingAllocationDecider.java

+            // During simulation, this supports early publishing DesiredBalance, with all unassigned shards assigned.
+            // Notably, this bypass is only in simulation decisions. Reconciliation will continue to obey throttling, in particular the
+            // requirement to assign a primary before allowing its replicas to begin initializing.
+            return allocation.decision(Decision.YES, NAME, "replica allocation is not throttled when simulating");


~~Is this change possibly redundant since we implemented #134786. If I understand correctly the ThrottlingAllocationDecider won't ever kick in now that we do a single move per balancing round?~~

~~I thought that ES-12942 was more similar to this change #115511~~

Never mind I see now we'll need this with the subsequent changes

nicktindall · 2025-11-25T05:46:45Z

...test/java/org/elasticsearch/cluster/routing/allocation/ThrottlingAllocationDeciderTests.java

+        ShardRouting shardRouting1Primary,
+        ShardRouting shardRouting1Replica,
+        ShardRouting shardRouting2Primary,
+        ShardRouting shardRouting2Replica


should these have unassigned in the name (e.g. unassignedShard1Primary) or similar?

Yep, that'd be clearer, done 👍

nicktindall · 2025-11-25T05:56:20Z

...rk/src/main/java/org/elasticsearch/action/support/replication/ClusterStateCreationUtils.java

+        String[] indices,
+        int numberOfShards,
+        List<ShardRouting.Role> replicaRoles
+    ) {


Should we validate that numberOfShards == replicaRoles.size() ?

I don't think that has to be true, AFAICT. The number of replicaRoles is equal to the number of replicas, which is not limited to the number of shards.

nicktindall · 2025-11-25T05:58:01Z

...rk/src/main/java/org/elasticsearch/action/support/replication/ClusterStateCreationUtils.java

+            } else if (i == numberOfDataNodes) {
+                discoBuilder.masterNodeId(node.getId());
+            }
+        }


If I'm reading it right, the actual number of data nodes ends up being numberOfDataNodes + 1, this seems counter-intuitive, could we change numberOfDataNodes to have the actual number of data nodes?

Yeah, this is pretty convoluted. I never have found a reason for it in the other helpers. Perhaps some historical testing need that no longer exists.

Fixing 👍

nicktindall · 2025-11-25T06:21:23Z

...test/java/org/elasticsearch/cluster/routing/allocation/ThrottlingAllocationDeciderTests.java

+        ShardRouting shardRouting1Primary = TestShardRouting.newShardRouting(testShardId1, null, null, true, ShardRoutingState.UNASSIGNED);
+        ShardRouting shardRouting2Primary = TestShardRouting.newShardRouting(testShardId2, null, null, true, ShardRoutingState.UNASSIGNED);
+        ShardRouting shardRouting1Replica = TestShardRouting.newShardRouting(testShardId1, null, null, false, ShardRoutingState.UNASSIGNED);
+        ShardRouting shardRouting2Replica = TestShardRouting.newShardRouting(testShardId2, null, null, false, ShardRoutingState.UNASSIGNED);


Maybe it's not important, but I think we should be able to pull these out of the routing table with

RoutingTable routingTable = clusterState.routingTable(ProjectId.DEFAULT); routingTable.shardRoutingTable(shardId).primaryShard(); routingTable.shardRoutingTable(shardId).replicaShards().get(0);

Then perhaps we can avoid the change to TestShardRouting?

Yeah, routing table is probably better. I just copy-pasted.

The TestShardRouting change is label tidying, other callers provide null.

nicktindall

LGTM with some minor queries

…d of create

DiannaHohensee requested a review from a team as a code owner November 24, 2025 21:34

elasticsearchmachine added v9.3.0 needs:triage Requires assignment of a team area label labels Nov 24, 2025

DiannaHohensee force-pushed the 2025/11/21/throttle-decider-unthrottles-replica branch from aef7911 to bd3d442 Compare November 24, 2025 21:35

DiannaHohensee removed the request for review from a team November 24, 2025 21:39

DiannaHohensee force-pushed the 2025/11/21/throttle-decider-unthrottles-replica branch from bd3d442 to 4ade9fa Compare November 24, 2025 21:57

Add unthrottled path for replicas in ThrottlingAllocationDecider

4b5ef05

DiannaHohensee force-pushed the 2025/11/21/throttle-decider-unthrottles-replica branch from 4ade9fa to 4b5ef05 Compare November 24, 2025 21:58

DiannaHohensee requested a review from nicktindall November 24, 2025 23:13

nicktindall reviewed Nov 25, 2025

View reviewed changes

nicktindall approved these changes Nov 25, 2025

View reviewed changes

DiannaHohensee added 3 commits November 25, 2025 10:20

Merge branch 'main' into 2025/11/21/throttle-decider-unthrottles-replica

405d4d9

ShardRouting: update variable names; fetch from routing table, instea…

5548322

…d of create

improve test cluster state util method

15fdf50

DiannaHohensee added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Nov 25, 2025

Merge branch 'main' into 2025/11/21/throttle-decider-unthrottles-replica

98fdf85

elasticsearchmachine merged commit ee72be0 into elastic:main Dec 1, 2025
34 checks passed

DiannaHohensee deleted the 2025/11/21/throttle-decider-unthrottles-replica branch December 1, 2025 19:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add unthrottled path for replicas in ThrottlingAllocationDecider #138545

Add unthrottled path for replicas in ThrottlingAllocationDecider #138545

Uh oh!

DiannaHohensee commented Nov 24, 2025

Uh oh!

elasticsearchmachine commented Nov 24, 2025

Uh oh!

nicktindall Nov 25, 2025 •

edited

Loading

Uh oh!

nicktindall Nov 25, 2025

Uh oh!

DiannaHohensee Nov 25, 2025

Uh oh!

nicktindall Nov 25, 2025

Uh oh!

DiannaHohensee Nov 25, 2025

Uh oh!

nicktindall Nov 25, 2025

Uh oh!

DiannaHohensee Nov 25, 2025

Uh oh!

nicktindall Nov 25, 2025

Uh oh!

DiannaHohensee Nov 25, 2025

Uh oh!

nicktindall left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add unthrottled path for replicas in ThrottlingAllocationDecider #138545

Add unthrottled path for replicas in ThrottlingAllocationDecider #138545

Uh oh!

Conversation

DiannaHohensee commented Nov 24, 2025

Uh oh!

elasticsearchmachine commented Nov 24, 2025

Uh oh!

nicktindall Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nicktindall left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nicktindall Nov 25, 2025 •

edited

Loading