Auto-restart scheduler in shoot maintenance #2756

rfranzke · 2020-08-19T13:48:32Z

How to categorize this PR?

/area ops-productivity robustness
/kind enhancement
/priority normal

What this PR does / why we need it:
The kube-scheduler is now auto-restarted in the shoot maintenance time window, similar to other controllers.

Which issue(s) this PR fixes:
Fixes #2722

Special notes for your reviewer:
As #2731 is still in discussion, I'm making this change now separately and rebase the PR later if necessary.

Release note:

The `kube-scheduler` is now auto-restarted in the shoot maintenance time window, similar to other controllers.

rfranzke · 2020-08-19T13:48:40Z

/invite @timuthy

timuthy

/lgtm

ialidzhikov · 2020-08-19T20:40:16Z

We initially stared with restart for cloud-controller-manager, then we did the same for kcm and mcm, now we do it also the kube-scheduler in this PR. This sounds like we bury/workaround issues? Are there occurrences of kube-scheduler Pod which "hangs" for some reason? If yes, shouldn't we rather try to understand the root cause?

rfranzke · 2020-08-20T04:11:38Z

@ialidzhikov I don't think there are clear steps to reproduce the problems that we see (luckily very rarely). If you can help here it's highly appreciated to tackle the root cause in the first place, sure, but until then this one-time auto-restart in the maintenance time window is a simply thing to improve the ops experience.

Auto-restart scheduler in shoot maintenance

9a0265a

rfranzke requested a review from a team as a code owner August 19, 2020 13:48

gardener-robot added area/ops-productivity Operator productivity related (how to improve operations) area/robustness Robustness, reliability, resilience related kind/enhancement Enhancement, improvement, extension priority/normal labels Aug 19, 2020

gardener-robot-ci-1 added the reviewed/ok-to-test label Aug 19, 2020

gardener-robot requested a review from timuthy August 19, 2020 13:48

gardener-robot added the needs/review label Aug 19, 2020

timuthy approved these changes Aug 19, 2020

View reviewed changes

gardener-robot added reviewed/lgtm and removed needs/review labels Aug 19, 2020

gardener-robot-ci-2 added needs/ok-to-test and removed reviewed/ok-to-test labels Aug 19, 2020

danielfoehrKn approved these changes Aug 19, 2020

View reviewed changes

rfranzke merged commit 89610c1 into gardener:master Aug 20, 2020

rfranzke deleted the feature/auto-restart-scheduler branch August 20, 2020 04:11

gardener-robot added priority/3 Priority (lower number equals higher priority) and removed priority/3 Priority (lower number equals higher priority) labels Mar 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto-restart scheduler in shoot maintenance #2756

Auto-restart scheduler in shoot maintenance #2756

rfranzke commented Aug 19, 2020

rfranzke commented Aug 19, 2020

timuthy left a comment

ialidzhikov commented Aug 19, 2020

rfranzke commented Aug 20, 2020

Auto-restart scheduler in shoot maintenance #2756

Auto-restart scheduler in shoot maintenance #2756

Conversation

rfranzke commented Aug 19, 2020

rfranzke commented Aug 19, 2020

timuthy left a comment

Choose a reason for hiding this comment

ialidzhikov commented Aug 19, 2020

rfranzke commented Aug 20, 2020