clusterctl rollout #3439

Arvinderpal · 2020-08-03T16:21:32Z

As an operator I would like a convenient and consistent mechanism through which I can rollout updates to my control-plane and worker nodes.

As an operator I would like to inspect a rollout as it occurs, rollback changes if needed and view the rollout history.

Detailed Description

Motivated by kubectl rollout.

The idea is to create a new clusterctl sub-command: clusterctl rollout.

Issue/PR Tracker:

Proposal doc: https://docs.google.com/document/d/1fUThmYoAyvpyVMy1Jb6WwhoVfr20CRdJWdcyNXb4rt4/edit?usp=sharing
Add Conditions to MachineDeployment Add conditions to MachineDeployment Object #3486. Required for status.
Implement restart for MachineDeployments: ✨ Add command and client for clusterctl alpha rollout restart #3838
Implement pause/resume for MachineDeployments ✨ clusterctl alpha rollout pause/resume for MachineDeployments #4054
Implement status for MachineDeployments
Implement undo for MachineDeployments ✨ Clusterctl alpha rollout undo for MachineDeployments #4098
Implement history for MachineDeployments
Update clusterctl docs in CAPI book 📖 Add CAPI book section on clusterctl alpha rollout #4328
Cleanup: clusterctl rollout clean up #4266

Related:
Issue #3401
Issue #3203

/kind feature

The text was updated successfully, but these errors were encountered:

vincepri · 2020-08-03T16:28:17Z

+1 this feature makes sense, we might need a small RFE/proposal

Arvinderpal · 2020-08-03T16:29:57Z

Common usage patterns may include:

Immediate Rollouts:

clusterctl rollout machinedeployment/my-cluster-md-0
clusterctl rollout kubeadmcontrolplane/my-cluster-control-plane

Rollout based on specific infra machine template. For example, modify the existing MachineDeployment to reference the new infra (e.g. docker) machine template resource. It's assumed that the user has created the my-cluster-md-0-rev-1 beforehand:

clusterctl rollout machinedeployment/my-cluster-md-0 --template dockermachinetemplate/my-cluster-md-0-rev-1

Monitor status:

clusterctl rollout status machinedeployment/my-cluster-md-0
clusterctl rollout status kubeadmcontrolplane/my-cluster-control-plane

Rollback to the previous deployment or a specific revision:

clusterctl rollout undo machinedeployment/my-cluster-md-0
clusterctl  rollout undo machinedeployment/my-cluster-md-0 --to-revision=2

History:

clusterctl rollout history machinedeployment/my-cluster-md-0

Arvinderpal · 2020-08-03T16:39:05Z

+1 this feature makes sense, we might need a small RFE/proposal

More than happy to put together a proposal and a POC if we agree that this is the right way to go about this.

vincepri · 2020-08-03T16:41:48Z

cc @wfernandes @fabriziopandini

/milestone v0.4.0

detiber · 2020-08-03T16:42:12Z

+1 from me to the high level approach for a near term solution to the problem. It might make sense to also propose support in upstream Kubernetes/kubectl/kubebuilder for a sub-resource type interface so that we could eventually have direct support in kubectl similar to the way we have with the scale subresource today.

fabriziopandini · 2020-08-03T16:52:19Z

I'm ok with the proposal but I agree with @detiber that the long term solution is to make this to work in kubectl

Arvinderpal · 2020-08-21T23:50:36Z

I added a link to the proposal. PTAL

Arvinderpal · 2020-10-13T13:13:43Z

I'm going to start implementing a PoC -- focusing just on MachineDeployments for now.
I wanted to ask, if people are okay with having a top level command like clusterctl rolloutor would you prefer something else like (i) clusterctl experimental rollout (ii) clusterctl workload-cluster rollout (iii) ...?

@fabriziopandini @wfernandes,

vincepri · 2020-10-13T13:51:14Z

clusterctl alpha <>? So we can follow the alpha phases we have in other tools

fabriziopandini · 2020-11-12T20:56:26Z

/area clusterctl

fejta-bot · 2021-07-04T16:56:52Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

fabriziopandini · 2021-07-05T07:50:57Z

/lifecycle frozen

Arvinderpal · 2021-07-06T17:08:06Z

The remaining MD commands -- status and history -- depend on conditions in MD. Here is the tracker for that: #3486

vincepri · 2021-10-19T14:14:10Z

/milestone v1.0

chrischdi · 2022-07-18T07:54:36Z

Just because I did some research: some context when considering an implementation of clusterctl alpha rollout undo for KCP: if it gets implemented it should take care to not allow downgrades of ControlPlane nodes which could break a cluster.

We should always take https://kubernetes.io/releases/version-skew-policy/ into account which more or less means from a ControlPlane perspective that no MachineDeployments or MachinePools of the cluster should run a kubelet in a minor version which is newer than the ControlPlane kubernetes version to downgrade to.
Also from an etcd perspective: downgrades of minor versions of etcd are not allowed according 3.3 -> 3.4 and 3.4 -> 3.5 upgrade docs, once a cluster was fully upgraded to a specific etcd minor version.

Some more context from upstream discussions about downgrades are available at:

clarify control plane downgrade and/or rollback during HA upgrade kubernetes/website#12327

(which got closed due to rotten, not resolved).

fabriziopandini · 2022-08-05T17:45:50Z

/triage accepted

hiromu-a5a · 2023-01-24T14:18:00Z

/assign

killianmuldoon · 2023-01-24T16:38:59Z

@hiromu-a5a Good to see somebody picking up this work! I just wanted to mention that some parts of this - if they involve changes to the MachineDeployment controller - might overlap with work ongoing in #7730

I think it might be a good idea to sync on those parts of the work to ensure stability on main (and have fewer rebases 😄 ).

Thanks again for picking this up though! I think the pieces that impact clusterctl (like #7988) should have no / few clashes with the MD work.

hiromu-a5a · 2023-02-15T02:03:59Z

While I tried the existing rollout undo command, I felt that the rollout might violate the version skew policy easily and accidentally. I'd like to suggest emitting a warning if the operation of a user breaks the version skew policy. What do you think?
If you agree, I'll make another issue.

fabriziopandini · 2023-02-16T22:29:04Z

I'm +1 to open a discussion on how to prevent undo operations that can lead to issue

hiromu-a5a · 2023-03-02T08:28:03Z

Posted discussion.

hiromu-a5a · 2023-03-15T02:59:32Z

I couldn't find any responses to #8170.
Please let me know if there is any appropriate forum for discussion.
(If you meant something different, such as opening a discussion in the office hour, I am sorry)

fabriziopandini · 2023-03-20T18:06:13Z

@hiromu-a5a i'm not sure to understand why this topic is not gaining traction after callouts to office hours.
My only assumption is that really few users are relying on this feature, and this somehow matches with the fact that no one reported other existing issues we found while working on label propagation (on top of my mind: history was not tracking in-place changes, clusterctl rollout was not considering all the versions an MS might have, probably more)

What I can suggest at this stage is to continue to collect ideas on this issue or to take the initiative in defining what should be improved in this feature and how.

hiromu-a5a · 2023-03-28T15:58:20Z

Thank you for your FB.

To take the initiative, I've opened issue for now. I think this should be discussed in a separated issue rather than a sub topic of this issue.
#8408

k8s-triage-robot · 2024-03-27T16:34:17Z

This issue has not been updated in over 1 year, and should be re-triaged.

You can:

Confirm that this issue is still relevant with /triage accepted (org members only)
Close this issue with /close

For more details on the triage process, see https://www.kubernetes.dev/docs/guide/issue-triage/

/remove-triage accepted

fabriziopandini · 2024-04-11T18:40:42Z

/priority backlog

fabriziopandini · 2024-04-22T12:47:01Z

/unassign @hiromu-a5a

My personal understanding about this feature is that it is becoming less and less relevant considering git ops, cluster class, lack of request/queries/feedback from the community etc.

Considering that, the fact that we never completed this feature, and we have pending issues, we have maintenace costs related to it, I think that as a project we should ask ourself it it the case to deprecate and remove it.

/remove-lifecycle frozen

k8s-ci-robot added the kind/feature Categorizes issue or PR as related to a new feature. label Aug 3, 2020

k8s-ci-robot added this to the v0.4.0 milestone Aug 3, 2020

Arvinderpal mentioned this issue Oct 21, 2020

✨ Add command and client for clusterctl alpha rollout restart #3838

Merged

k8s-ci-robot added the area/clusterctl Issues or PRs related to clusterctl label Nov 12, 2020

lb4368 mentioned this issue Dec 9, 2020

Integrate clusterctl rollout into airshipctl airshipit/airshipctl#433

Open

Arvinderpal mentioned this issue Jan 6, 2021

✨ clusterctl alpha rollout pause/resume for MachineDeployments #4054

Merged

Arvinderpal mentioned this issue Jan 21, 2021

✨ Clusterctl alpha rollout undo for MachineDeployments #4098

Merged

Arvinderpal mentioned this issue Feb 8, 2021

Add conditions to MachineDeployment Object #3486

Open

5 tasks

Arvinderpal mentioned this issue Mar 8, 2021

clusterctl rollout clean up #4266

Closed

3 tasks

fiunchinho mentioned this issue Mar 9, 2021

Enhanced kubernetes version upgrades for workload clusters #3203

Closed

Arvinderpal mentioned this issue Mar 16, 2021

📖 Add CAPI book section on clusterctl alpha rollout #4328

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 4, 2021

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 5, 2021

k8s-ci-robot modified the milestones: v0.4, v1.0 Oct 19, 2021

chrischdi mentioned this issue Jul 7, 2022

Implement clusterctl KCP rollout command #6857

Closed

tobiasgiese mentioned this issue Jul 7, 2022

✨ Add KCP feature to clusterctl alpha rollout #6858

Merged

fabriziopandini added the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Jul 29, 2022

fabriziopandini removed this from the v1.2 milestone Jul 29, 2022

fabriziopandini removed the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Jul 29, 2022

k8s-ci-robot added the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Aug 5, 2022

k8s-ci-robot assigned hiromu-a5a Jan 24, 2023

hiromu-a5a mentioned this issue Jan 24, 2023

⚠️ Refactor clusterctl alpha rollout #7988

Merged

hiromu-a5a mentioned this issue Jan 25, 2023

✨ Clusterctl alpha rollout history #7994

Open

hiromu-a5a mentioned this issue Mar 28, 2023

Prevent k8s version skew policy violations #8408

Open

hiromu-a5a mentioned this issue Apr 25, 2023

✨ Clusterctl alpha rollout status #8569

Open

k8s-ci-robot added needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. and removed triage/accepted Indicates an issue or PR is ready to be actively worked on. labels Mar 27, 2024

k8s-ci-robot added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Apr 11, 2024

k8s-ci-robot unassigned hiromu-a5a Apr 22, 2024

k8s-ci-robot removed the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Apr 22, 2024

sbueringer mentioned this issue Apr 22, 2024

Consider deprecation of revision management in MachineDeployments #10479

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clusterctl rollout #3439

clusterctl rollout #3439

Arvinderpal commented Aug 3, 2020 •

edited

vincepri commented Aug 3, 2020

Arvinderpal commented Aug 3, 2020

Arvinderpal commented Aug 3, 2020

vincepri commented Aug 3, 2020

detiber commented Aug 3, 2020

fabriziopandini commented Aug 3, 2020

Arvinderpal commented Aug 21, 2020

Arvinderpal commented Oct 13, 2020

vincepri commented Oct 13, 2020

fabriziopandini commented Nov 12, 2020

fejta-bot commented Jul 4, 2021

fabriziopandini commented Jul 5, 2021

Arvinderpal commented Jul 6, 2021

vincepri commented Oct 19, 2021

chrischdi commented Jul 18, 2022

fabriziopandini commented Aug 5, 2022

hiromu-a5a commented Jan 24, 2023

killianmuldoon commented Jan 24, 2023

hiromu-a5a commented Feb 15, 2023 •

edited

fabriziopandini commented Feb 16, 2023

hiromu-a5a commented Mar 2, 2023

hiromu-a5a commented Mar 15, 2023

fabriziopandini commented Mar 20, 2023

hiromu-a5a commented Mar 28, 2023

k8s-triage-robot commented Mar 27, 2024

fabriziopandini commented Apr 11, 2024

fabriziopandini commented Apr 22, 2024

clusterctl rollout #3439

clusterctl rollout #3439

Comments

Arvinderpal commented Aug 3, 2020 • edited

vincepri commented Aug 3, 2020

Arvinderpal commented Aug 3, 2020

Arvinderpal commented Aug 3, 2020

vincepri commented Aug 3, 2020

detiber commented Aug 3, 2020

fabriziopandini commented Aug 3, 2020

Arvinderpal commented Aug 21, 2020

Arvinderpal commented Oct 13, 2020

vincepri commented Oct 13, 2020

fabriziopandini commented Nov 12, 2020

fejta-bot commented Jul 4, 2021

fabriziopandini commented Jul 5, 2021

Arvinderpal commented Jul 6, 2021

vincepri commented Oct 19, 2021

chrischdi commented Jul 18, 2022

fabriziopandini commented Aug 5, 2022

hiromu-a5a commented Jan 24, 2023

killianmuldoon commented Jan 24, 2023

hiromu-a5a commented Feb 15, 2023 • edited

fabriziopandini commented Feb 16, 2023

hiromu-a5a commented Mar 2, 2023

hiromu-a5a commented Mar 15, 2023

fabriziopandini commented Mar 20, 2023

hiromu-a5a commented Mar 28, 2023

k8s-triage-robot commented Mar 27, 2024

fabriziopandini commented Apr 11, 2024

fabriziopandini commented Apr 22, 2024

Arvinderpal commented Aug 3, 2020 •

edited

hiromu-a5a commented Feb 15, 2023 •

edited