Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migrate the robustness tests to prow #18136

Open
3 of 5 tasks
serathius opened this issue Jun 6, 2024 · 14 comments
Open
3 of 5 tasks

Migrate the robustness tests to prow #18136

serathius opened this issue Jun 6, 2024 · 14 comments
Assignees
Labels
area/robustness-testing priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. type/feature

Comments

@serathius
Copy link
Member

serathius commented Jun 6, 2024

What would you like to be added?

After the last robustness team meeting it was clear how superior Prow + TestGrid is over GitHub actions.

https://testgrid.k8s.io/sig-etcd-robustness#Summary vs https://github.com/etcd-io/etcd/actions/workflows/robustness-nightly.yaml

Advantages:

  • More stable, 7% vs 56% failure rate running the same code
  • Testgrid being much better UI to monitor failures plus addition of more advanced features like creating bugs alerting.
  • Prow being much better tool to view logs from tests, it parses logs, groups them by test, doesn't crash browser, doesn't fail on downloading compressed logs, doesn't created archives.

TODO:

cc @jmhbnz @ivanvc

Why is this needed?

Migration to Prow opens a new chapter for stability and debuggability of robustness test with the goal of making the process more approachable for new contributors.

@henrybear327
Copy link
Contributor

@ArkaSaha30

@ivanvc
Copy link
Member

ivanvc commented Jun 6, 2024

Do we have access to arm nodes in the Prow infra? The last I remember is that we were waiting for them. I don't see any updates regarding this on kubernetes/k8s.io#6102. So, it may be a blocker for the second point.

@serathius
Copy link
Member Author

serathius commented Jun 6, 2024

Not great, but I will not block the migration regardless. Robustness tests only bring value if there is someone willing to review them. With Prow being much better, no-one will be willing to review arm robustness failures.

@ivanvc
Copy link
Member

ivanvc commented Jun 6, 2024

I can see two options: pause running robustness for the ARM architecture (not ideal) or keep ARM tests running on GitHub actions.

I don't see much activity in kubernetes/k8s.io#6102. Who or where would be a good place to ask for a status update/ETA for ARM nodegroups?

@jmhbnz
Copy link
Member

jmhbnz commented Jun 6, 2024

Hi @upodroid - We spoke at KubeCon EU Paris about a dedicated arm64 cluster for prow. Can you please provide an update on the timeline for it being available?

@serathius
Copy link
Member Author

I can see two options: pause running robustness for the ARM architecture (not ideal) or keep ARM tests running on GitHub actions.

I was thinking about the second option, however due to sub-par user experience I expect it would be equal the first one.

@ivanvc
Copy link
Member

ivanvc commented Jun 7, 2024

Discussed on Slack with Arka, we'll be working on the following at the moment:

/assign @ArkaSaha30 @ivanvc

@k8s-ci-robot
Copy link

@ivanvc: GitHub didn't allow me to assign the following users: ArkaSaha30.

Note that only etcd-io members with read permissions, repo collaborators and people who have commented on this issue/PR can be assigned. Additionally, issues/PRs can only have 10 assignees at the same time.
For more information please see the contributor guide

In response to this:

Discussed on Slack with Arka, we'll be working on the following at the moment:

/assign @ArkaSaha30 @ivanvc

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@ivanvc ivanvc removed the help wanted label Jun 7, 2024
@ArkaSaha30
Copy link
Contributor

/assign

@ArkaSaha30
Copy link
Contributor

Currently, the robustness tests on Github Actions run only on main or PRs to main. Do we need to run it on release-3.5 and release-3.4?
The existing robustness periodic and presubmit can be configured to handle all the 3 branches.

@serathius
Copy link
Member Author

There are no robustness test on other branches beside main. We develop and run robustness test from main branch and validate binaries build from older branches.

@jmhbnz jmhbnz added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label Jun 11, 2024
@ivanvc
Copy link
Member

ivanvc commented Jun 13, 2024

We have finished the first and the third tasks. When would you think is a good time to remove the GitHub action @serathius?

We can't move forward with the second, as we don't have a timeline on when ARM runners are going to be available.

@serathius
Copy link
Member Author

We have finished the first and the third tasks. When would you think is a good time to remove the GitHub action @serathius?

We can keep arm64 on Github actions to not block on it.

@ivanvc
Copy link
Member

ivanvc commented Jun 14, 2024

@ArkaSaha30, can you help with

Remove non-arm robustness tests from github actions.?

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/robustness-testing priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. type/feature
Development

No branches or pull requests

6 participants