-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Robust VolumeManager reconstruction after kubelet restart #3756
Comments
/sig storage |
/label lead-opted-in |
Hello @jsafrane 👋, 1.27 Enhancements team here. Just checking in as we approach enhancements freeze on 18:00 PDT Thursday 9th February 2023. This enhancement is targeting for stage Here's where this enhancement currently stands:
It looks like #3763 will address most of these issues. The status of this enhancement is marked as |
We're targeting Beta directly, we had alpha as part of https://github.com/kubernetes/enhancements/tree/master/keps/sig-storage/1710-selinux-relabeling And the KEP is merged, together with PRR approval, I think everything is fine for 1.27 |
Great! This enhancement meets all the requirements for being included in v1.27 and is now tracked for the release. One thing to note, make sure to update the PRR section in the KEP README. /stage beta |
@npolshakova: Those labels are not set on the issue: In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Hi @jsafrane 👋, I’m reaching out from the 1.27 Release Docs team. This enhancement is marked as ‘Needs Docs’ for the 1.27 release. Please follow the steps detailed in the documentation to open a PR against dev-1.27 branch in the k/website repo. This PR can be just a placeholder at this time, and must be created by March 16. For more information, please take a look at Documenting for a release to familiarize yourself with the documentation requirements for the release. Please feel free to reach out with any questions. Thanks! |
Hi @jsafrane, Checking in as we approach 1.27 code freeze at 17:00 PDT on Tuesday 14th March 2023. Please ensure the following items are completed:
For this enhancement, it looks like the following PRs are open and need to be merged before code freeze: Please let me know if there are any other PRs in k/k I should be tracking for this KEP. |
There are kubernetes/kubernetes#115972 and kubernetes/kubernetes#115965 that are part of this feature and were already merged. |
Docs: kubernetes/website#40038 |
After a forced reboot, kubelet fails to clean up orphaned volume files and directories. These orphaned volumes persist because the directories still hold files, a situation that would typically be resolved during a normal flow but instead is caused by the forced reboot. The volume directories are tied to the pods' unique identifiers. Since none of the pods are running after the reboot, the associated volume directories and files can be safely removed. The work in this commit will eventually be superseded by kubernetes/enhancements#3756. Test Plan: PASS: - Build an iso and install an aio-sx system and verify that the cleanup script is installed and the kubelet service is running with ExecStarPre=/usr/local/bin/kubelet-cleanup-orphaned-volumes.sh. PASS: - Reboot the system with active pods that contain files in their volumes directories and verify that all volume directories and their files under /var/lib/kubelet/pods/ are deleted after reboot. PASS: - Verify that explictly restarting the kubelet service does not attempt to delete kubelet volume directories. PASS: - Verify volume-subpaths directories and files are cleaned up after reboot. Closes-Bug: 2027810 Change-Id: Ie7e637c4d5e79ec08d33bd80dade35890b711548 Signed-off-by: Gleb Aronsky <gleb.aronsky@windriver.com>
/milestone v1.30 |
I think that reconstruction of global directories is worth its own feature + feature gate. We would need to go back to alpha with |
Downgrade/upgrade test report I tested installation of 1.28.5 (the feature is enabled there) -> downgrade kubelet to 1.27.9 -> upgrade kubelet back to 1.28.5 using a vanilla cluster installed by kops. The feature is limited to kubelet, I did not downgrade API server and KCM. No issues found, new/old kubelet can read /var/lib/kubelet of the old/new kubelet just fine and clean its volumes.
Force-delete will trigger the volume reconstruction in kubelet - the newly started kubelet cannot see the deleted Pods in the API server and thus has to reconstruct its state from the OS (
All pod-dirs of the force-deleted pods were unmounted + deleted. |
Hello {enhancement owner} 👋, Enhancements team here. Just checking in as we approach enhancements freeze on 02:00 UTC Friday 9th February 2024. This enhancement is targeting for stage Here's where this enhancement currently stands:
For this KEP, we would just need to update the following:
The status of this enhancement is marked as |
@AnaMMedina21, both were updated yesterday in #4432 |
@jsafrane Can we also update |
With all the requirements fulfilled this enhancement is now marked as tracked for the upcoming enhancements freeze 🚀 |
Fixed the label. It's moving from beta to GA in 1.30. |
I think we marked KEPs as |
@jsafrane You are correct. The enhancement is only updated to implemented once all code, docs, etc... are merged and the entire feature is finished. Enhancements should not be marked |
Hello @jsafrane, 👋 1.30 Docs Shadow here. |
Hi @jsafrane, 👋 from the v1.30 Communications Team! We'd love for you to opt in to write a feature blog about your enhancement! We encourage blogs for features including, but not limited to: breaking changes, features and changes important to our users, and features that have been in progress for a long time and are graduating. To opt in, you need to open a Feature Blog placeholder PR against the website repository. |
I opened placeholder doc: kubernetes/website#45282 |
Hey again @jsafrane 👋 Enhancements team here, Just checking in as we approach code freeze at 02:00 UTC Wednesday 6th March 2024 . Here's where this enhancement currently stands:
For this enhancement, I wanted to loop back on the Also, please let me know if there are other PRs in k/k we should be tracking for this KEP. |
Marking the KEP as implemented: #4532 There are no other code PRs in flight, only docs in kubernetes/website#45282. |
Hi @jsafrane 👋, 1.31 Enhancements Lead here. If you wish to progress this enhancement in v1.31, please have the SIG lead opt-in your enhancement by adding the lead-opted-in label and set the milestone to v1.31 before the Production Readiness Review Freeze. /remove-label lead-opted-in |
This went GA in 1.30 and is marked as implemented, #4532 |
@jsafrane: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
Enhancement Description
k/enhancements
) update PR(s):k/k
) update PR(s):k/website
) update(s): Add docs for VolumeManager reconstruction after kubelet restart website#40038k/enhancements
) update PR(s): Graduate NewVolumeManagerReconstruction GA #4432k/k
) update PR(s): KEP 3756: Mark NewVolumeManagerReconstruction as GA kubernetes#123442k/website
) update(s): Mark NewVolumeManagerReconstruction as GA website#45282Please keep this description up to date. This will help the Enhancement Team to track the evolution of the enhancement efficiently.
The text was updated successfully, but these errors were encountered: