-
Notifications
You must be signed in to change notification settings - Fork 38.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix OOM killer #119670
fix OOM killer #119670
Conversation
Signed-off-by: lengrongfu <rongfu.leng@daocloud.io>
Please note that we're already in Test Freeze for the Fast forwards are scheduled to happen every 6 hours, whereas the most recent run was: Sun Jul 30 04:14:48 UTC 2023. |
Hi @lengrongfu. Thanks for your PR. I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/ok-to-test |
LGTM label has been added. Git tree hash: 16c8dcef482da9e55d3bd1534e540c414b5d5e7d
|
/triage accepted |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: aojea, lengrongfu, tzneal The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Adding f48daf5412edac432c28 so this will show up when searching from k8s-triage. |
@lengrongfu The OOM killer tests are still failing in #120258 . Ref : https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/pull/120258/pull-kubernetes-node-e2e-containerd/1696861389160714240/ I think the issue still exists. |
There is more discussion and a linked containerd issue in #119600. From my testing, containerd is not sending the OOM status sometimes and kubelet is behaving correctly. The initial sleep here doesn't appear to help. |
What type of PR is this?
/kind bug
/kind failing-test
What this PR does / why we need it:
Which issue(s) this PR fixes:
Fixes # #119600
Special notes for your reviewer:
The problem with OOM kill looks the same as this pr #116082,
https://storage.googleapis.com/kubernetes-jenkins/logs/ci-containerd-node-e2e-1-7/1683184526936772608/artifacts/tmp-node-e2e-92a26bb7-cos-97-16919-353-1-system.log
Does this PR introduce a user-facing change?
Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.: