Cluster sharding state warmup with health check #27168

chbatey · 2019-06-18T10:39:58Z

When a new node is added to a cluster using cluster sharding, if user requests are initially routed to it then each of them will incur a latency penalty as the ShardRegion gets the Shard location from the ShardCoordinator. This is for all Shards, not just the ones that will be moved.

The ShardCoordinator could instead send the current regions per shardId to the ShardRegion when it registers and a health check could be exposed (https://doc.akka.io/docs/akka-management/current/healthchecks.html) to prevent user traffic being routed to the node in environments like Kubernetes until this has happened.

This might help #30315.

/cc @jroper

The text was updated successfully, but these errors were encountered:

jroper · 2019-06-19T00:47:10Z

I think this would be really helpful in elastic environments, thanks for raising!

milanvdm · 2019-10-14T19:20:02Z

Is there any progress on this? Since we deploy on ECS (which shuts down the oldest containers first), we have a lot of movement of our ShardCoordinator during a redeploy.
We would like to find a way to know when to route requests to the new nodes.

patriknw · 2020-09-11T11:13:42Z

One way would be to reply with all shard locations when the region register to the coordinator. I think it's better to add a separate message for this. Then it can maybe be used in other situations like #29589

It could be many shards, and might not fit in one single message, but we can split it several smaller.

patriknw · 2020-09-25T11:00:25Z

The health check was added in #29638

…#27168 This should shorten latencies per new shard that is addressed from a newly joined region/proxy

This should shorten latencies per new shard that is addressed from a newly joined region/proxy

chbatey added t:cluster discuss Tickets that need some discussion before proceeding. Not decided if it's a good idea. labels Jun 18, 2019

patriknw added t:cluster-sharding and removed t:cluster-sharding labels Jul 9, 2019

patriknw mentioned this issue Sep 11, 2020

Relax shard hand-off for region proxies in other DC #29589

Open

patriknw added 1 - triaged Tickets that are safe to pick up for contributing in terms of likeliness of being accepted and removed discuss Tickets that need some discussion before proceeding. Not decided if it's a good idea. labels Sep 11, 2020

johanandren self-assigned this Nov 29, 2021

johanandren added 3 - in progress Someone is working on this ticket and removed 1 - triaged Tickets that are safe to pick up for contributing in terms of likeliness of being accepted labels Nov 29, 2021

johanandren added a commit to johanandren/akka that referenced this issue Nov 29, 2021

Quick-push all allocated shards to region/proxy when registering akka…

a404334

…#27168 This should shorten latencies per new shard that is addressed from a newly joined region/proxy

johanandren mentioned this issue Nov 29, 2021

Quick-push all allocated shards to region/proxy when registering #30950

Merged

johanandren added a commit that referenced this issue Dec 3, 2021

Quick-push all allocated shards to region/proxy when registering #27168

9c0e624

This should shorten latencies per new shard that is addressed from a newly joined region/proxy

johanandren added this to the 2.6.18 milestone Dec 3, 2021

johanandren closed this as completed Dec 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster sharding state warmup with health check #27168

Cluster sharding state warmup with health check #27168

chbatey commented Jun 18, 2019 •

edited by raboof

Loading

jroper commented Jun 19, 2019

milanvdm commented Oct 14, 2019

patriknw commented Sep 11, 2020

patriknw commented Sep 25, 2020

Cluster sharding state warmup with health check #27168

Cluster sharding state warmup with health check #27168

Comments

chbatey commented Jun 18, 2019 • edited by raboof Loading

jroper commented Jun 19, 2019

milanvdm commented Oct 14, 2019

patriknw commented Sep 11, 2020

patriknw commented Sep 25, 2020

chbatey commented Jun 18, 2019 •

edited by raboof

Loading