Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🐛 Fix or reduce frequency or switch off perma-failing jobs #8784

Closed
dims opened this issue Jun 5, 2023 · 11 comments
Closed

🐛 Fix or reduce frequency or switch off perma-failing jobs #8784

dims opened this issue Jun 5, 2023 · 11 comments
Labels
triage/accepted Indicates an issue or PR is ready to be actively worked on.

Comments

@dims
Copy link
Member

dims commented Jun 5, 2023

Data as of June 3rd, 9:00 PM Eastern. Latest data can be found here:
http://storage.googleapis.com/k8s-metrics/failures-latest.json

CI Job Days Failed
post-cluster-api-provider-cloudstack-push-images 201
pull-cluster-api-e2e-scale-main-experimental 100
post-cluster-api-operator-push-images 89
periodic-cluster-api-provider-digitalocean-conformance-ci-artifacts 53
pull-cluster-api-e2e-mink8s-main 3
pull-cluster-api-provider-aws-e2e-eks-release-1-5 108

Additional Context:
kubernetes/test-infra#18600

Folks, i am aware that not all jobs may not be related to this repository per se, please feel free to divvy this up to respective teams/folks/repos

@k8s-ci-robot k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Jun 5, 2023
@sbueringer
Copy link
Member

sbueringer commented Jun 5, 2023

To split this up

core CAPI:

  • pull-cluster-api-e2e-scale-main-experimental
    • We added this job as an experiment and will probably eventually use it to run some sort of scale e2e tests (with a mock provider).
    • I think the jobs shouldn't be a problem, it's an optional presubmit that only runs when it is manually triggered
  • pull-cluster-api-e2e-mink8s-main

CloudStack

Cluster API Operator:

Digital Ocean:

AWS:

@sbueringer sbueringer added the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Jun 5, 2023
@k8s-ci-robot k8s-ci-robot removed the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Jun 5, 2023
@killianmuldoon
Copy link
Contributor

/triage accepted

@Ankitasw
Copy link
Member

Ankitasw commented Jun 5, 2023

Raised fix for pull-cluster-api-provider-aws-e2e-eks-release-1-5: kubernetes-sigs/cluster-api-provider-aws#4314

@furkatgofurov7
Copy link
Member

Cluster API Operator:

Fix is up: kubernetes-sigs/cluster-api-operator#133

@cpanato
Copy link
Member

cpanato commented Jun 8, 2023

for CAPDO is fixed now :)

@killianmuldoon
Copy link
Contributor

killianmuldoon commented Jun 8, 2023

To keep track:

@killianmuldoon
Copy link
Contributor

killianmuldoon commented Jun 8, 2023

I think that's all of the jobs fixed - thanks again @dims for bringing this up, and thanks everyone for getting the fixes in quickly!

@dims
Copy link
Member Author

dims commented Jun 8, 2023

thanks for the prompt responses @killianmuldoon

@sbueringer
Copy link
Member

Should / can we close the issue?

@killianmuldoon
Copy link
Contributor

/close

(Thought I did that 🙂)

@k8s-ci-robot
Copy link
Contributor

@killianmuldoon: Closing this issue.

In response to this:

/close

(Thought I did that 🙂)

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
triage/accepted Indicates an issue or PR is ready to be actively worked on.
Projects
None yet
Development

No branches or pull requests

7 participants