🐛 Fix or reduce frequency or switch off perma-failing jobs #8784

dims · 2023-06-05T01:31:47Z

Data as of June 3rd, 9:00 PM Eastern. Latest data can be found here:
http://storage.googleapis.com/k8s-metrics/failures-latest.json

CI Job	Days Failed
post-cluster-api-provider-cloudstack-push-images	201
pull-cluster-api-e2e-scale-main-experimental	100
post-cluster-api-operator-push-images	89
periodic-cluster-api-provider-digitalocean-conformance-ci-artifacts	53
pull-cluster-api-e2e-mink8s-main	3
pull-cluster-api-provider-aws-e2e-eks-release-1-5	108

Additional Context:
kubernetes/test-infra#18600

Folks, i am aware that not all jobs may not be related to this repository per se, please feel free to divvy this up to respective teams/folks/repos

sbueringer · 2023-06-05T09:12:59Z

To split this up

core CAPI:

pull-cluster-api-e2e-scale-main-experimental
- We added this job as an experiment and will probably eventually use it to run some sort of scale e2e tests (with a mock provider).
- I think the jobs shouldn't be a problem, it's an optional presubmit that only runs when it is manually triggered
pull-cluster-api-e2e-mink8s-main
- Again this is an optional presubmit. We recently added it and had an initial configuration error. It should work now (https://prow.k8s.io/job-history/gs/kubernetes-jenkins/pr-logs/directory/pull-cluster-api-e2e-mink8s-main)
- EDIT: Might not be stable yet, but it's supposed to be stable. I'll take a closer look :)

CloudStack

post-cluster-api-provider-cloudstack-push-images
- @rohityadavcloud @davidjumani @jweite-amazon @dims @g-gaston @chrisdoherty4 @weizhouapache Can one of you please take a look?

Cluster API Operator:

post-cluster-api-operator-push-images
- @JoelSpeed @Fedosin @alexander-demicev @damdo Can one of you please take a look?

Digital Ocean:

periodic-cluster-api-provider-digitalocean-conformance-ci-artifacts
- @cpanato @gottwald @prksu @timoreimann Can one of you please take a look?

AWS:

pull-cluster-api-provider-aws-e2e-eks-release-1-5
- @richardcase @sedefsavas @Skarlso @Ankitasw @dlipovetsky Can one of you please take a look?

killianmuldoon · 2023-06-05T09:19:37Z

/triage accepted

Ankitasw · 2023-06-05T10:51:19Z

Raised fix for pull-cluster-api-provider-aws-e2e-eks-release-1-5: kubernetes-sigs/cluster-api-provider-aws#4314

furkatgofurov7 · 2023-06-05T11:33:31Z

Cluster API Operator:

post-cluster-api-operator-push-images

@JoelSpeed @Fedosin @alexander-demicev @damdo Can one of you please take a look?

Fix is up: kubernetes-sigs/cluster-api-operator#133

cpanato · 2023-06-08T15:33:01Z

for CAPDO is fixed now :)

killianmuldoon · 2023-06-08T15:37:51Z

To keep track:

post-cluster-api-provider-cloudstack-push-images "post-cluster-api-provider-cloudstack-push-images" job failed for 201 days cluster-api-provider-cloudstack#266
pull-cluster-api-e2e-scale-main-experimental (experimenal presubmit - not running regularly)
post-cluster-api-operator-push-images 🐛 build: fix image push job by disabling CGO when building kustomize cluster-api-operator#133
periodic-cluster-api-provider-digitalocean-conformance-ci-artifacts not fail the command if is not able to delete the key cluster-api-provider-digitalocean#498
pull-cluster-api-e2e-mink8s-main (bad config - fixed)
pull-cluster-api-provider-aws-e2e-eks-release-1-5 [release-1.5] [E2E] Fix kubernetes version for EKS upgrade tests cluster-api-provider-aws#4314

killianmuldoon · 2023-06-08T22:07:33Z

I think that's all of the jobs fixed - thanks again @dims for bringing this up, and thanks everyone for getting the fixes in quickly!

dims · 2023-06-08T22:25:02Z

thanks for the prompt responses @killianmuldoon

sbueringer · 2023-06-09T06:47:54Z

Should / can we close the issue?

killianmuldoon · 2023-06-09T09:46:05Z

/close

(Thought I did that 🙂)

k8s-ci-robot · 2023-06-09T09:46:09Z

@killianmuldoon: Closing this issue.

In response to this:

/close

(Thought I did that 🙂)

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Jun 5, 2023

sbueringer added the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Jun 5, 2023

k8s-ci-robot removed the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Jun 5, 2023

weizhouapache mentioned this issue Jun 5, 2023

"post-cluster-api-provider-cloudstack-push-images" job failed for 201 days kubernetes-sigs/cluster-api-provider-cloudstack#266

Closed

furkatgofurov7 mentioned this issue Jun 5, 2023

CI: post-cluster-api-operator-push-images jobs failing for 89 days kubernetes-sigs/cluster-api-operator#132

Closed

Ankitasw mentioned this issue Jun 5, 2023

[release-1.5] [E2E] Fix kubernetes version for EKS upgrade tests kubernetes-sigs/cluster-api-provider-aws#4314

Merged

4 tasks

furkatgofurov7 mentioned this issue Jun 5, 2023

🐛 build: fix image push job by disabling CGO when building kustomize kubernetes-sigs/cluster-api-operator#133

Merged

cpanato mentioned this issue Jun 7, 2023

not fail the command if is not able to delete the key kubernetes-sigs/cluster-api-provider-digitalocean#498

Merged

g-gaston mentioned this issue Jun 8, 2023

Disable CGO when running generate deep copy kubernetes-sigs/cluster-api-provider-cloudstack#268

Merged

k8s-ci-robot closed this as completed Jun 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 Fix or reduce frequency or switch off perma-failing jobs #8784

🐛 Fix or reduce frequency or switch off perma-failing jobs #8784

dims commented Jun 5, 2023 •

edited

Loading

sbueringer commented Jun 5, 2023 •

edited

Loading

killianmuldoon commented Jun 5, 2023

Ankitasw commented Jun 5, 2023

furkatgofurov7 commented Jun 5, 2023

cpanato commented Jun 8, 2023

killianmuldoon commented Jun 8, 2023 •

edited

Loading

killianmuldoon commented Jun 8, 2023 •

edited

Loading

dims commented Jun 8, 2023

sbueringer commented Jun 9, 2023

killianmuldoon commented Jun 9, 2023

k8s-ci-robot commented Jun 9, 2023

🐛 Fix or reduce frequency or switch off perma-failing jobs #8784

🐛 Fix or reduce frequency or switch off perma-failing jobs #8784

Comments

dims commented Jun 5, 2023 • edited Loading

sbueringer commented Jun 5, 2023 • edited Loading

killianmuldoon commented Jun 5, 2023

Ankitasw commented Jun 5, 2023

furkatgofurov7 commented Jun 5, 2023

cpanato commented Jun 8, 2023

killianmuldoon commented Jun 8, 2023 • edited Loading

killianmuldoon commented Jun 8, 2023 • edited Loading

dims commented Jun 8, 2023

sbueringer commented Jun 9, 2023

killianmuldoon commented Jun 9, 2023

k8s-ci-robot commented Jun 9, 2023

dims commented Jun 5, 2023 •

edited

Loading

sbueringer commented Jun 5, 2023 •

edited

Loading

killianmuldoon commented Jun 8, 2023 •

edited

Loading

killianmuldoon commented Jun 8, 2023 •

edited

Loading