New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sig-windows-gce test jobs are failing consistently for a long time #124047
Comments
@AnishShah: GuidelinesPlease ensure that the issue body includes answers to the following questions:
For more details on the requirements of such an issue, please see here and ensure that they are met. If this request no longer meets these requirements, the label can be removed In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/triage accepted |
@AnishShah: The label In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
I can give this a shot |
/triage accepted Thanks! Reach out on slack if you need help |
i think i can work on this @AnishShah |
@AryaMoghaddam are you still working on this issue? |
Which jobs are failing?
gce-windows-2019-containerd-master
andgce-windows-2022-containerd-master
are failingWhich tests are failing?
The Windows nodes are failing to come up because the startup scripts are failing.
Since when has it been failing?
It has been failing for a long time. It is red on the whole testgrid.
Testgrid link
https://testgrid.k8s.io/sig-windows-gce
Reason for failure (if possible)
The Windows startup scripts are failing to create NPD kubeconfig. I found this in the Windows Node serial log from a test run failure.
Anything else we need to know?
On the Linux node, we run NPD in standalone mode and use the kubeconfig generated from the token specified in
NODE_PROBLEM_DETECTOR_TOKEN
or use the kubelet kubeconfig. ref.But on Windows, we do not run NPD because
ENABLE_NODE_PROBLEM_DETECTOR
is set tonone
based on the serial logs.Based on the Linux node behavior, We should maybe set
ENABLE_NODE_PROBLEM_DETECTOR
to standalone on Windows and use kubelet kubeconfig ifNODE_PROBLEM_DETECTOR_TOKEN
is missing?Relevant SIG(s)
/sig windows
/good-first-issue
The text was updated successfully, but these errors were encountered: