-
Notifications
You must be signed in to change notification settings - Fork 453
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add validation when container OOMKilled #1146
Conversation
Welcome @bitoku! |
thanks @bitoku is this still a draft? |
I wanted to check the result of the tests before making it not-draft. I think I have to check the behaviour on my machine. |
related to #1102? |
Yes, sorry I put |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CI fails because of this change:
[FAILED] in [It] - github.com/kubernetes-sigs/cri-tools/pkg/validate/container_linux.go:202 @ 05/02/23 07:12:14.529
STEP: stop PodSandbox @ 05/02/23 07:12:14.529
STEP: delete PodSandbox @ 05/02/23 07:12:14.692
<< Timeline
[FAILED] Timed out after 60.000s.
Expected
<v1.ContainerState>: 1
to equal
<v1.ContainerState>: 2
In [It] at: github.com/kubernetes-sigs/cri-tools/pkg/validate/container_linux.go:202 @ 05/02/23 07:12:14.529
------------------------------
Summarizing 1 Failure:
[FAIL] [k8s.io] Container OOM runtime should output OOMKilled reason [It] should terminate with exitCode 137 and reason OOMKilled
github.com/kubernetes-sigs/cri-tools/pkg/validate/container_linux.go:202
Ran 84 of 89 Specs in 85.944 seconds
I found the failure is because of containerd/containerd#7749. Is there any specific reason to stay on the So there are two solutions I came up with:
Which one do you think is better? |
I added |
It seems it timed out in the build step. |
Looking at output
and based on the ordering the |
Yes It passes OK on github actions in my forked repo.
Are you mentioning about containerd version? I looked over the past commits and some commits failed Do we need to set build timeout 20 mins? |
Sounds reasonable for me. @saschagrunert? |
Yeah let’s increase the timeout. This test flakes often because of that. |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: bitoku, saschagrunert, SergeyKanzhelev The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What type of PR is this?
/kind feature
What this PR does / why we need it:
This PR adds validation for OOMKilled container.
It checks if exitCode==137 && reason==OOMKilled
Which issue(s) this PR fixes:
Fixes #1102
Special notes for your reviewer:
Does this PR introduce a user-facing change?