-
Notifications
You must be signed in to change notification settings - Fork 38.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
e2e_node: fixes after dynamic configuration removal #106210
e2e_node: fixes after dynamic configuration removal #106210
Conversation
/sig node |
/test |
@cynepco3hahue: The
The following commands are available to trigger optional jobs:
Use
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/test pull-kubernetes-node-kubelet-serial-cpu-manager |
@SergeyKanzhelev Hi, can you please review, it is the follow-up after dropping the dynamic kubelet configuration. |
b7b51ae
to
bc5b20b
Compare
@@ -85,7 +85,7 @@ func registerNodeFlags(flags *flag.FlagSet) { | |||
// It is hard and unnecessary to deal with the complexity inside the test suite. | |||
flags.BoolVar(&framework.TestContext.NodeConformance, "conformance", false, "If true, the test suite will not start kubelet, and fetch system log (kernel, docker, kubelet log etc.) to the report directory.") | |||
flags.BoolVar(&framework.TestContext.PrepullImages, "prepull-images", true, "If true, prepull images so image pull failures do not cause test failures.") | |||
flags.BoolVar(&framework.TestContext.RestartKubelet, "restart-kubelet", true, "If true, restart Kubelet unit when the process is killed.") | |||
flags.BoolVar(&framework.TestContext.RestartKubelet, "restart-kubelet", false, "If true, restart Kubelet unit when the process is killed.") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why this change?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Now that we do not have DynamicKubelet configuration, we do not need to run a restart loop.
How did it work before:
- Test updated kubelet via DynamicKubeletConfiguration
- DynamicKubeletConfiguration controller applied updated configuration and stopped the kubelet.
- The restart loop monitors the kubelet health link and once it does not respond restart the kubelet.
How does it work now:
- The test stops the kubelet.
- The test update kubelet static configuration.
- The test restarts kubelet.
So now it totally tests responsibility to make sure the kubelet runs after the test. It makes restarts more predictable because it is not external to the test component that can restart the kubelet in the middle of the update configuration.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm thinking if there were other uses like when kubelet paniced or something. I don't remember any tests explicitly stopping kubelet. But I saw supressed panics before (#101719).
I tend to ok this change since panic is bad enough condition to make test grid red.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm except I don't understand the change of default for the flag
/test pull-kubernetes-node-kubelet-serial |
- CPU manager - Memory Manager - Topology Manager Signed-off-by: Artyom Lukianov <alukiano@redhat.com>
bc5b20b
to
117141e
Compare
/test pull-kubernetes-node-kubelet-serial |
@cynepco3hahue: The following test failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
@SergeyKanzhelev Thanks for the review. I answered the comment and tests passed, except one that failed because of some unrelated problem
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/priority important-soon
/triage accepted
/lgtm
/skip
test failures seems unrelated
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: cynepco3hahue, SergeyKanzhelev The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What type of PR is this?
/kind failing-test
/kind bug
What this PR does / why we need it:
The PR should fix a number of tests that were broken after Dynamic Kubelet configuration removal.
Fixes:
Which issue(s) this PR fixes:
Fixes #
Special notes for your reviewer:
Does this PR introduce a user-facing change?
Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.: