-
Notifications
You must be signed in to change notification settings - Fork 38.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP DNM testing CI #119590
WIP DNM testing CI #119590
Conversation
Signed-off-by: Francesco Romani <fromani@redhat.com>
Skipping CI for Draft Pull Request. |
Adding the "do-not-merge/release-note-label-needed" label because no release-note block was detected, please follow our release note process to remove it. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
This cherry pick PR is for a release branch and has not yet been approved by Release Managers. To merge this cherry pick, it must first be approved ( If you didn't cherry-pick this change to all supported release branches, please leave a comment describing why other cherry-picks are not needed to speed up the review process. If you're not sure is it required to cherry-pick this change to all supported release branches, please consult the cherry-pick guidelines document. AFTER it has been approved by code owners, please leave the following comment on a line by itself, with no leading whitespace: /cc kubernetes/release-managers (This command will request a cherry pick review from Release Managers and should work for all GitHub users, whether they are members of the Kubernetes GitHub organization or not.) For details on the patch release process and schedule, see the Patch Releases page. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
This issue is currently awaiting triage. If a SIG or subproject determines this is a relevant issue, they will accept it by applying the The Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/test pull-kubernetes-e2e-gce-device-plugin-gpu |
The gpu jobs on 1.27 branch (and likely earlier) are failing at startup stage (xref: kubernetes/kubernetes#119590 https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/pull/119590/pull-kubernetes-e2e-gce-device-plugin-gpu/1684154811563380736/ ) A clear stand out is the error message ``` I0726 11:21:54.238] Begin Captured GinkgoWriter Output >> I0726 11:21:54.238] ... I0726 11:21:54.238] Jul 26 11:21:52.897: INFO: Nvidia GPUs not available on Node: "e2e-4a2d554360-8baae-minion-group-dhf5" I0726 11:21:54.238] Jul 26 11:21:52.925: INFO: Get container nvidia-driver-installer-28m4c/nvidia-driver-installer usage on node e2e-4a2d554360-8baae-minion-group-trvc. CPUUsageInCores: 1.146806814, MemoryUsageInBytes: 4332822528, MemoryWorkingSetInBytes: 621940736 I0726 11:21:54.238] Jul 26 11:21:52.925: INFO: Get container nvidia-gpu-device-plugin-tdtrj/nvidia-gpu-device-plugin usage on node e2e-4a2d554360-8baae-minion-group-trvc. CPUUsageInCores: 3.468e-05, MemoryUsageInBytes: 1650688, MemoryWorkingSetInBytes: 1650688 I0726 11:21:54.238] Jul 26 11:21:53.342: INFO: Get container nvidia-gpu-device-plugin-2zrxr/nvidia-gpu-device-plugin usage on node e2e-4a2d554360-8baae-minion-group-dhf5. CPUUsageInCores: 4.1366e-05, MemoryUsageInBytes: 1503232, MemoryWorkingSetInBytes: 1503232 I0726 11:21:54.238] Jul 26 11:21:53.342: INFO: Get container nvidia-driver-installer-fvzlz/nvidia-driver-installer usage on node e2e-4a2d554360-8baae-minion-group-dhf5. CPUUsageInCores: 0.996069722, MemoryUsageInBytes: 4413071360, MemoryWorkingSetInBytes: 271339520 I0726 11:21:54.238] Jul 26 11:21:53.898: INFO: Getting list of Nodes from API server I0726 11:21:54.238] Jul 26 11:21:53.941: INFO: gpuResourceName nvidia.com/gpu I0726 11:21:54.238] Jul 26 11:21:53.941: INFO: Nvidia GPUs not available on Node: "e2e-4a2d554360-8baae-minion-group-dhf5" I0726 11:21:54.238] Jul 26 11:21:54.170: INFO: Get container nvidia-driver-installer-5clvp/nvidia-driver-installer usage on node e2e-4a2d554360-8baae-minion-group-nbjs. CPUUsageInCores: 0.977369235, MemoryUsageInBytes: 4303605760, MemoryWorkingSetInBytes: 269389824 I0726 11:21:54.238] Jul 26 11:21:54.170: INFO: Get container nvidia-gpu-device-plugin-8nbnk/nvidia-gpu-device-plugin usage on node e2e-4a2d554360-8baae-minion-group-nbjs. CPUUsageInCores: 3.7186e-05, MemoryUsageInBytes: 1507328, MemoryWorkingSetInBytes: 1507328 I0726 11:21:54.238] << End Captured GinkgoWriter Output ``` Crosschecking the diff with the working pre-submit job, the lack of preset stands out. From prow docs it seems presets aren't inherited across files, so let's clone them from the working job. Signed-off-by: Francesco Romani <fromani@redhat.com>
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: ffromani The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/test pull-kubernetes-e2e-gce-device-plugin-gpu |
2101f7e
to
96a2f73
Compare
96a2f73
to
f381603
Compare
/test pull-kubernetes-e2e-gce-device-plugin-gpu |
f381603
to
4969ae6
Compare
/test pull-kubernetes-e2e-gce-device-plugin-gpu |
4969ae6
to
35e11d7
Compare
/test pull-kubernetes-e2e-gce-device-plugin-gpu |
1 similar comment
/test pull-kubernetes-e2e-gce-device-plugin-gpu |
35e11d7
to
ed49601
Compare
/test pull-kubernetes-e2e-gce-device-plugin-gpu |
Signed-off-by: Francesco Romani <fromani@redhat.com>
ed49601
to
55fe670
Compare
/test pull-kubernetes-e2e-gce-device-plugin-gpu |
/test pull-kubernetes-e2e-capz-windows-1-27 |
1 similar comment
/test pull-kubernetes-e2e-capz-windows-1-27 |
/test pull-kubernetes-e2e-gce-device-plugin-gpu |
de02300
to
55fe670
Compare
/test pull-kubernetes-e2e-gce-device-plugin-gpu |
/test pull-kubernetes-e2e-capz-windows-1-27 |
@ffromani: The following tests failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
there were real issues, fixed by kubernetes/test-infra#30450 and kubernetes/test-infra#30352 |
DNM Test CI