Update Hash ring only with the in-ready-status replicas of Statefulset #70

spaparaju · 2021-03-22T04:49:03Z

Currently Hash ring is getting updated based on the .spec of the statefulset that is being watched. There are few edge cases where Hash ring is updated pre maturely (eg: replicas take some time to come up) / Hash ring is updated with incorrect replicas if the scaling of statefulset would not succeed (eg: not enough resources on the cluster). Downside of this behaviour is that requests to statefulsets like Thanos-default-receive result in temporary (with successful scaling up of a statefulset) / permanent (with unsuccessful scaling up of a statefulset) HTTP 500s.

This fix update Hash ring only with the replicas of the statefulset which are in 'Ready' status.

spaparaju · 2021-03-22T07:23:20Z

As this PR updates hash ring based on replicas in the 'Ready state', understandably tests are failing as the current tests are performed against a 'fake cluster'.

kakkoyun

I think we should retrofit the tests to support this case.

Also what happens while we rollout? The number of ready replicas changing constantly, do we react to that? Or do we only react to the configmap changes?

bwplotka · 2021-03-23T14:22:34Z

And let's think about the typical scenario of single node getting down for a while (restart,crash). We might not want to retrigger whole hashring update and causing cascading error 🤗

Maybe there is something in between we could do?

bwplotka · 2021-03-23T14:22:41Z

cc @brancz

metalmatze · 2021-03-23T15:52:56Z

As part of an effort to make auto scaling more reliable I happened to work on the same stuff. PR incoming.

metalmatze · 2021-03-23T15:54:19Z

I've commented on that topic on the CNCF Slack in #thanos-dev. Trying not to repeat the findings please allow me to simply link you there: https://cloud-native.slack.com/archives/CL25937SP/p1616512341031300

metalmatze · 2021-03-23T16:09:19Z

I've opened my attempt at solving the problem in #71 Seems like we've accidentally work on this at the same time. Sorry.

spaparaju · 2021-04-21T09:42:50Z

closing this PR in favor of #75 to address the scenario of Hashring contain endpoints of replicas in ready status even if scaling up statefulsets do not reach intentended # of replicas

Update Hashring only with the in-ready-status replicas of statefulset

077bad1

spaparaju mentioned this pull request Mar 23, 2021

receive + receive controller: Eliminate downtime when scaling up/down hashring replicas. #69

Open

kakkoyun requested changes Mar 23, 2021

View reviewed changes

Merge remote-tracking branch 'up/master' into reconcile-error

b119332

spaparaju closed this Apr 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Hash ring only with the in-ready-status replicas of Statefulset #70

Update Hash ring only with the in-ready-status replicas of Statefulset #70

spaparaju commented Mar 22, 2021 •

edited

Loading

spaparaju commented Mar 22, 2021

kakkoyun left a comment

bwplotka commented Mar 23, 2021

bwplotka commented Mar 23, 2021

metalmatze commented Mar 23, 2021

metalmatze commented Mar 23, 2021

metalmatze commented Mar 23, 2021

spaparaju commented Apr 21, 2021

Update Hash ring only with the in-ready-status replicas of Statefulset #70

Update Hash ring only with the in-ready-status replicas of Statefulset #70

Conversation

spaparaju commented Mar 22, 2021 • edited Loading

spaparaju commented Mar 22, 2021

kakkoyun left a comment

Choose a reason for hiding this comment

bwplotka commented Mar 23, 2021

bwplotka commented Mar 23, 2021

metalmatze commented Mar 23, 2021

metalmatze commented Mar 23, 2021

metalmatze commented Mar 23, 2021

spaparaju commented Apr 21, 2021

spaparaju commented Mar 22, 2021 •

edited

Loading