Make upgrade wait time between batches configurable #217

alex-dabija · 2020-10-16T10:45:06Z

User Story

As a customer, I want to configure the wait time between rolling nodes batches during a tenant cluster upgrade.

Background

The current wait time between rolling nodes batches is hard-coded to 15 minutes. In some situations, nodes with lots of pods or workloads which take a long time to start, this mount of time is not enough to restore the system to a stable state before starting the next batch.

Requirements

must be optional feature defined per tenant cluster in first release (ex: configured via annotation Cluster CR).

calvix · 2020-10-28T09:45:51Z

PauseTime - annotation aws.giantswarm.io/update-pause-time

the value should be ISO 8601 duration format http://en.wikipedia.org/wiki/ISO_8601#Durations , which is the same AWS CF wants

I want to have validation in the admission controller to inform/block users about using the wrong value but I would like to have a safeguard validation in aws-operator as well in case something fails, rather than have crashlooping operator. If the aws-operator sees an invalid value then it would simply use the default one.

annotation would be either on cluster CR or machine deployment CR, machine deployment value would override any cluster value

@giantswarm/team-firecracker-engineers @giantswarm/sig-ux please raise any concerns/suggestion

paurosello · 2020-10-29T07:51:23Z

do we want to use the alpha.... annotation for this features?

njuettner · 2020-10-29T08:40:49Z

do we want to use the alpha.... annotation for this features?

Plus one, we should be good citizen and start using alpha/beta/stable more often indicating this is a feature which evolves over time and to clarify the expectation of such features.

alex-dabija · 2020-10-29T09:16:32Z

I'm fine if we version the annotations or if we don't, because I do see them more as a temporary mechanism to enable a new feature.

If they are versioned we would have to make sure that newer versions of the aws-operator consider all previous versions of the annotation when it tries to get the required information. This might complicate the implementation a bit.

paurosello · 2020-10-29T09:26:00Z

As long as we do the same with all annotations we can manage it easily I think.

I like having the "alpha" in there so customers know this is a new feature and that they should be careful with it

calvix · 2020-11-05T10:18:47Z

functionality merged into AWS operator and validation in admission-controller, only docs are remaining

calvix · 2020-11-19T10:51:25Z

released as part of giantswarm/releases#512

alex-dabija added team/firecracker area/kaas Mission: Cloud Native Platform - Self-driving Kubernetes as a Service kind/story labels Oct 16, 2020

alex-dabija added target-release/12.6.0 provider/aws Related to cloud provider Amazon AWS labels Oct 26, 2020

alex-dabija assigned calvix Oct 26, 2020

This was referenced Oct 27, 2020

aws-annotations-for-update-control giantswarm/apiextensions#599

Merged

vaclav-configure-aws-cf-update-batch-params giantswarm/aws-operator#2828

Merged

pipo02mix mentioned this issue Oct 30, 2020

Upgrades (more graceful, more flexible, less aggressive, more robust) #15

Closed

12 tasks

This was referenced Nov 2, 2020

add-annotation-validation-for-awsmd giantswarm/aws-admission-controller#112

Merged

add-annotation-validation-for-awcluster giantswarm/aws-admission-controller#113

Merged

pipo02mix added this to In Progress ( 1-3 months ) in Giant Swarm Roadmap (Deprecated) Nov 4, 2020

pipo02mix moved this from In Progress ( 1-3 months ) to Ready Soon ( <4 weeks ) in Giant Swarm Roadmap (Deprecated) Nov 4, 2020

alex-dabija added this to the 12.x.x AWS milestone Nov 5, 2020

alex-dabija added target-release/12.7.0 and removed target-release/12.6.0 labels Nov 5, 2020

calvix mentioned this issue Nov 5, 2020

add-fine-tuning-upgrade-distruption-on-aws-guide giantswarm/docs#615

Merged

calvix closed this as completed Nov 19, 2020

Giant Swarm Roadmap (Deprecated) automation moved this from Ready Soon ( <4 weeks ) to Released Nov 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make upgrade wait time between batches configurable #217

Make upgrade wait time between batches configurable #217

alex-dabija commented Oct 16, 2020 •

edited

calvix commented Oct 28, 2020

paurosello commented Oct 29, 2020

njuettner commented Oct 29, 2020

alex-dabija commented Oct 29, 2020

paurosello commented Oct 29, 2020

calvix commented Nov 5, 2020

calvix commented Nov 19, 2020

Make upgrade wait time between batches configurable #217

Make upgrade wait time between batches configurable #217

Comments

alex-dabija commented Oct 16, 2020 • edited

User Story

Background

Requirements

calvix commented Oct 28, 2020

paurosello commented Oct 29, 2020

njuettner commented Oct 29, 2020

alex-dabija commented Oct 29, 2020

paurosello commented Oct 29, 2020

calvix commented Nov 5, 2020

calvix commented Nov 19, 2020

alex-dabija commented Oct 16, 2020 •

edited