New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
In-Place Update of Pod Resources #1287
Comments
/assign @vinaykul |
👋 Hey there @vinaykul. I'm a shadow on the 1.17 Release Team, working on Enhancements. We're tracking issues for the 1.17 release and I wanted to reach out and ask we should track this (or more specifically I guess the In-Place Update of Pod Resources feature) for 1.17? The current release schedule is: Monday, September 23 - Release Cycle Begins We're only 5 days away from the Enhancements Freeze, so if you intend to graduate this capability in the 1.17 release, here are the requirements that you'll need to satisfy:
Thanks @vinaykul |
Hi @jeremyrickard I'll do my best to get this KEP to implementable state by next Tuesday, but it looks like a stretch at this point - the major item is to complete API review with @thockin , and that depends on his availability. The actual code changes are not that big. Nevertheless, the safe option would be to track this for 1.18.0 release, I'll update you by next Monday. |
@jeremyrickard @mrbobbytables This KEP will take some more discussion - key thing is API review. It does not look like @thockin or another API reviewer is available soon. Could we please track this KEP for v1.18? |
/milestone v1.18 |
@PatrickLang Here's a first stab at the proposed CRI change to allow UpdateContainerResources to work with Windows. Please take a look.. let's discuss in tomorrow's sig meeting
|
@vinaykul It looks like since the above PR was merged, this was removed from the API review queue. I believe you need to open a new PR that moves the state to implementable, and then add the API-review label to get it back in the queue and get a reviewer. Edit: you should also include any other changes (e.g. windows CRI changes) required to move the feature to implementable in the PR as well. |
@dashpole Thanks! I've started a provisional mini-KEP per our discussion last week for the CRI changes (Dawn mentioned last week that we should take that up separately). imho the CRI changes does not block the implementation of this KEP, as it is between Kubelet and runtime, and user is not affected by it. In a second commit to the same PR, I've addressed another key issue (update api failure handling), and requested change to move primary KEP to implementable. With this, everything is in one place, and we can use it for API review. |
Hey there @vinaykul -- 1.18 Enhancements shadow here. I wanted to check in and see if you think this Enhancement will be graduating to alpha in 1.18? The current release schedule is:
To be included in the release,
If you would like to include this enhancement, once coding begins please list all relevant k/k PRs in this issue so they can be tracked properly. 👍 We'll be tracking enhancements here: http://bit.ly/k8s-1-18-enhancements Thanks! :) |
@palnabarun Yes, I'm planning to work towards alpha code targets for this feature in 1.18. I've updated the KEP adding test plan and graduation criteria sections that I will be reviewing with SIG-Node this week and hope to get it implementable before Jan 28. I'll update this thread if anything changes. |
Thank you @vinaykul for the updates. :) |
/stage alpha |
/milestone v1.18 |
Hello @vinaykul @Jeffwan 👋, 1.29 Enhancements team here! Just checking in as we approach enhancements freeze on 01:00 UTC, Friday, 6th October, 2023. This enhancement is targeting for stage Here's where this enhancement currently stands:
For this KEP, we would just need to update the following:
The status of this enhancement is marked as |
@npolshakova Thanks for the update and I will file a PR to make the KEP update. At the same time, I had some discussion with @LingyanYin and build up a issue/bug list for v1.29. @LingyanYin please help reorganize the contents and publish a list here for communities to discuss and collaborate. |
sure. Will do |
@LingyanYin, Vinay has listed few issues here https://github.com/vinaykul/kubernetes/wiki/In-Place-Pod-Vertical-Scaling-Issues-and-Status, Have you considered them as well. |
Hi @LingyanYin, just checking in once more as we approach the 1.29 enhancement freeze deadline this week on 01:00 UTC, Friday, 6th October, 2023. The status of this enhancement is marked as Please update the latest-milestone in the KEP yaml to 1.29. If there are major changes to the KEP you will also need a new production readiness review. Let me know if I missed anything. Thanks! |
#4267 @npolshakova Here's the PR to update the |
Is this staying in alpha or moving to beta? To move to beta there are PRR sections missing https://github.com/kubernetes/enhancements/blob/master/keps/sig-node/1287-in-place-update-pod-resources/README.md?plain=1#L988, the stage needs updating, and a PRR update is expected https://github.com/kubernetes/enhancements/blob/master/keps/prod-readiness/sig-node/1287.yaml |
/unassign @vinaykul |
This is staying in alpha as of the last discussions |
With KEP PR #4267 approved, the enhancement is ready for the enhancements freeze. The status is now marked as |
Hello @vinaykul @Jeffwan 👋, v1.29 Docs Shadow here. |
Hi again @vinaykul @Jeffwan @LingyanYin, 👋, 1.29 Enhancements team here! Just checking in as we approach code freeze at 01:00 UTC Wednesday 1st November 2023: . Here's where this enhancement currently stands:
It looks like https://github.com/vinaykul/kubernetes/wiki/In-Place-Pod-Vertical-Scaling-Issues-and-Status tracks the alpha blocker issues. It looks like these PRs have already merged:
Are there additional code related PRs that need to be merged for 1.29? It looks like kubernetes/kubernetes#112599 is mentioned in the issue description and kubernetes/kubernetes#121218 has been opened recently. Also, please let me know if there are other PRs in k/k we should be tracking for this KEP. |
@npolshakova Two related PRs that are low risk fixes were recently merged should be tracked for release purposes (I have updated the description above). PR kubernetes/kubernetes#117615 looks ready to merge with additional review. cc: @mrunalp @Random-Liu @Jeffwan may have additional PRs on the way. Jiaxin please LMK the key ones that I need to look at and I'll do my best to find the time to review. |
Hey @Huang-Wei @SergeyKanzhelev @liggitt The Pod Scheduling Readiness feature empowers users to implement their custom resource quotas. There is a need to incorporate the ability to add a scaling readiness gate, acting as a finalizer/scheduling gate. This enables users to dynamically remove it using their own controller, ensuring the validity of newly allocated resources. |
Hi @vinaykul @Jeffwan @LingyanYin, 👋 from the v1.29 Release Team-Communications! We would like to check if you have any plans to publish a blog for this KEP regarding new features, removals, and deprecations for this release. If so, you need to open a PR placeholder in the website repository. |
@drewhagen yeah. I confirmed with you offline and there won't be doc change yet. @a-mccarthy Not yet as well. current fixed are not enough to publish a blog yet. |
Thanks! With those PRs merged and the issue description updated this is |
Please also track PRs: |
I already have opened an issue about this: |
@npolshakova I really like kubernetes/kubernetes#120432 this one to be included in v1.29 and it's pending review and needs approval. This resolves a critical performance issue. Can we have an exception for this one? /cc @vinaykul Can you help take a look at this link? |
Enhancement Description
One-line enhancement description (can be used as a release note):
This issue tracks a list of KEP review conversations that need resolving before we GA the feature.
Kubernetes Enhancement Proposal: https://github.com/kubernetes/enhancements/tree/master/keps/sig-node/1287-in-place-update-pod-resources
Primary contact (assignee): @Jeffwan
Responsible SIGs: sig-node, sig-autoscaling
Enhancement target (which target equals to which milestone):
Alpha(v1.27~v1.29)
KEP (k/enhancements) update PR(s):
Code (k/k) update PR(s):
Docs (k/website) update PR(s):
Please to keep this description up to date. This will help the Enhancement Team track efficiently the evolution of the enhancement
Identify CRI changes needed for UpdateContainerResources API, define response message for UpdateContainerResourcesExtend UpdateContainerResources API to return info such as ‘not supported’, ‘not enough memory’, ‘successful’, ‘pending page evictions’ etc.Define expected behavior for runtime when UpdateContainerResources is invoked. Define timeout duration of the CRI call.Define behavior when multiple containers are being resized, and UpdateContainerResources fails for one or more containers.
Check with API reviewers if we can keep maps instead list of named sub-objects for ResizePolicy.Can we find a more intuitive name for ResizePolicy?
Can we use ResourceVersion to figure out the ordering of Pod resize requests?
Do we need to add back the ‘RestartPod’ resize policy? Is there a strong use-case for it?Alpha Feature Code Issues:
These are Items and issues discovered during code review that need further discussion and need to be addressed before Beta.
Can we figure out GetPodQOS differently once it is determined on pod create? See In-place Pod Vertical Scaling feature kubernetes#102884 (comment)Add real values to dockershim test for ContainerStatus.Resources In-place Pod Vertical Scaling feature kubernetes#102884 (comment)Change PodStatus.Resources from v1.ResourceRequirements to *v1.ResourceRequirementsThe text was updated successfully, but these errors were encountered: