New issue

Jump to bottom

KEP-5901: Add Kubectl Checkpoint KEP #5092

Open

adrianreber wants to merge 1 commit into kubernetes:master from adrianreber:2025-01-27-kubectl-checkpoint

Contributor

adrianreber commented Jan 27, 2025

With "Forensic Container Checkpointing" being Beta and discussions around graduating it to GA, the next step would be kubectl integration of the container checkpointing functionality.

In addition to the "Forensic Container Checkpointing" use case this KEP lists multiple use cases how checkpointing containers can be used.

One of the main motivations for this KEP is to make it easier for users to checkpoint containers, independent of the reason. Having it available via kubectl reduces the complexity of connecting to the node and accessing the kubectl checkpoint API endpoint.

One-line PR description: adding new KEP

Issue link: Kubectl Checkpoint #5091

k8s-ci-robot added cncf-cla: yes kind/kep sig/cli labels

k8s-ci-robot requested review from ardaguclu and mpuckett159

January 27, 2025 17:42

Contributor

k8s-ci-robot commented Jan 27, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: adrianreber
Once this PR has been reviewed and has the lgtm label, please assign ardaguclu for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

keps/sig-cli/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the needs-ok-to-test label

Contributor

k8s-ci-robot commented Jan 27, 2025

Hi @adrianreber. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot added the size/XXL label

adrianreber mentioned this pull request

Kubectl Checkpoint #5091

Open

6 tasks

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md

Comment on lines +195 to +198

+              Beta in Kubernetes 1.30, which means that the corresponding feature gate
+              defaults to the feature being enabled, the next step would be to extend the
+              existing checkpointing functionality from the *kubelet* to *kubectl* for easier
+              user consumption. The main motivation is to make it easier by not requiring

Member

aojea Feb 19, 2025

I'm not familiar with the criteria followed for kubectl commands, but should not wait first for the feature to be GA so it is available? or is the plan to GA KEP-2008 and add the kubectl command at the same time?

Contributor Author

adrianreber Mar 10, 2025

I also don't know. Not sure. But as there is a possibility to add plugins for kubectl or alpha commands I thought it is possible to expose non GA features on the api server level. But I don't know.

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md

+              Currently the design details are based on the existing pull request: [Add
+              'checkpoint' command to kubectl][pr120898]
+              The API server is extended to handle checkpoint requests from *kubectl*:

Member

aojea Feb 19, 2025

This is the most important change , adding a new endpoint to the apiserver is where you need to expand, Jordan also commented here kubernetes/kubernetes#120898 (comment) along those lines

Contributor Author

adrianreber Mar 10, 2025

I am sorry. But what is necessary. The comment you linked to was talking about the kubelet KEP which didn't had any reference to kubectl or the to the API server. This here is created to address the mentioned comment.

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md

+              for the initialization to finish. The startup time is reduced to the time
+              necessary to read back all memory pages to their previous location.
+              This feature is already used in production to decrease startup time of

Member

aojea Feb 19, 2025

claims should have links to references

Contributor Author

adrianreber Mar 10, 2025

Added

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md

+              This feature is already used in production to decrease startup time of
+              containers.
+              Another similar use case for quicker starting containers has been reported in

Member

aojea Feb 19, 2025

reference?

Contributor Author

adrianreber Mar 10, 2025

Added

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md Outdated

+              #### Optimize Resource Utilization
+              This use case is motivated by interactive long running containers. One very
+              common problem with things like Jupyter notebooks or remote development

Member

aojea Feb 19, 2025

naive questions, is this state not stored in a database or some persistent storage and recovered when it reconnects? at the end you'll have to keep all the state stored somewhere

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md


		#### Container Migration

		One of the main use cases for checkpointing and restoring containers is

Member

aojea Feb 19, 2025

migration between nodes? the IPs are most likely to be lost so the application has to be agnostic of the IP per example

Contributor Author

adrianreber Mar 10, 2025

Added a paragraph concerning migration of TCP connections.

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md Outdated

+              migrate containers or processes. It is a well researched topic especially
+              in the field of high performance computing (HPC). To avoid loss of work
+              already done by a container the container is migrated to another node before
+              the current node crashes. There are many scientific papers describing how

Member

aojea Feb 19, 2025

one or two references to this papers will be nice

Contributor Author

adrianreber Mar 10, 2025

Added.

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md

+              case, only useful for stateful containers.
+              With GPUs becoming a costly commodity, there is an opportunity to help
+              users save on costs by leveraging container checkpointing to prevent

Member

aojea Feb 19, 2025

Workloads are already doing checkpointing , do you know what is the state of the art of existing checkpointing mechanisms vs container checkpointing?

Contributor Author

adrianreber Mar 10, 2025

That is the hard question for the last 25 years. What is better. Application level checkpointing or system level checkpointing. Both approaches have their advantages and drawbacks. As it is unlikely that every application will have application level checkpointing some workloads can only be migrated with system level checkpointing.

Currently there are multiple startups and scientific researchers trying to solve how to better use GPU resources. All of the one I have been following are betting on system level checkpointing as application level checkpointing does not work or exist. The main problem is that for every application it has to be re-implemented. But as I am coming from the system level checkpointing area I am probably biased.

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md Outdated

+              Container migration for load balancing is something where checkpoint/restore
+              as implemented by CRIU is already used in production today. A prominent example
+              is Google as presented at the Linux Plumbers conference in 2018:
+              [Task Migration at Scale Using CRIU]<[task-migration]>

Member

aojea Feb 19, 2025

The example says that connections are dropped and client must reconnect, this is well understood at google where are librarie and applications that handle the client side reconnection, but my observation is that most people expect to auto-magically reconnect, and AFAIK this will not do it

Contributor Author

adrianreber Mar 10, 2025

Added a paragraph that talks about TCP connection and checkpoint/restore.

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md

+              ##### Spot Instances
+              Yet another possible use case where checkpoint/restore is already used today
+              are spot instances. Spot instances are usually resources that are cheaper but

Member

aojea Feb 19, 2025

This will need to take into account the time you have for checkpointing, as spot is like that, eventually you'll get destroyed

Contributor Author

adrianreber Mar 10, 2025

Added a reference to an existing solution that handles this.

aojea reviewed

View reviewed changes

keps/sig-cli/5091-kubectl-checkpoint/README.md

Comment on lines +487 to +522

+              Also, *kubectl* is extended to call this new API server interface. The API
+              server, upon receiving a request, will call the kubelet with the corresponding
+              parameters passed from *kubectl*. Once the checkpoint has been successfully written
+              to disk *kubectl* will return the name of the node as well as the location of
+              the checkpoint archive to the user:

Member

aojea Feb 19, 2025 •

edited

Loading

as commented above, this is the most tricky part of the KEP, you need to expand on the technical design here, these endpoints are complex to implement also you need to play with version skews between apiserver, kubelet and container runtimes

Contributor Author

adrianreber Mar 10, 2025

Unfortunately I am not sure what is needed here. I described the API to be just as the API provided by the kubelet. It just forwards everything 1:1 to the kubelet. Concerning different versions of api server and kubelet I described in one section that it will probably just return an error. I guess I do not get it what is required here.

Any existing examples I can take a look at?


          Add Kubectl Checkpoint KEP

cccb3cb

With "Forensic Container Checkpointing" being Beta and discussions
around graduating it to GA, the next step would be kubectl integration
of the container checkpointing functionality.

In addition to the "Forensic Container Checkpointing" use case this KEP
lists multiple use cases how checkpointing containers can be used.

One of the main motivations for this KEP is to make it easier for users
to checkpoint containers, independent of the reason. Having it available
via kubectl reduces the complexity of connecting to the node and
accessing the kubectl checkpoint API endpoint.

Signed-off-by: Adrian Reber <areber@redhat.com>

adrianreber force-pushed the 2025-01-27-kubectl-checkpoint branch from 1b65fed to cccb3cb Compare

March 10, 2025 15:47

Contributor Author

adrianreber commented Mar 10, 2025

@aojea thanks for your review. I added a couple more references.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cncf-cla: yes kind/kep needs-ok-to-test sig/cli size/XXL