Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate Consistent CI Failure for Follower Bootup Integration Test #4632

Open
AshtonStephens opened this issue Apr 3, 2024 · 9 comments
Assignees

Comments

@AshtonStephens
Copy link
Collaborator

AshtonStephens commented Apr 3, 2024

Investigate Consistent CI Failure for Follower Bootup Integration Test

A number of PRs began failing integration tests after seemingly no changes were made to the areas of code that would impact the integration tests. The purpose of this ticket is to identify why this one test uniformly fails and define a path forward to either resolving or ignoring it.

This is blocking the following PRs submitted by @jferrant:

@obycode
Copy link
Contributor

obycode commented Apr 3, 2024

This test passes locally on next.

@obycode
Copy link
Contributor

obycode commented Apr 3, 2024

Passes locally on bugfix/resubmit-dkg-vote-inc-nonce (#4621) as well.

@obycode
Copy link
Contributor

obycode commented Apr 3, 2024

Passes locally on bugfix/enable-dkg-in-current-reward-cycle (#4591).

@obycode
Copy link
Contributor

obycode commented Apr 3, 2024

Passes locally on feat/stacks-signer-fee-estimate (#4583) as well.

@obycode
Copy link
Contributor

obycode commented Apr 3, 2024

The issue just seems to be that it takes too long and times out in CI.

@obycode
Copy link
Contributor

obycode commented Apr 3, 2024

It takes about 4 minutes to run locally, but is timing out after 30 minutes in the CI runner. I'm not sure what the best approach should be for tests like this. It might be nice to ignore it during CI, but have a daily job that runs all the slow tests. @wileyj do we have anything like that for other slow tests at the moment?

@AshtonStephens
Copy link
Collaborator Author

@obycode Thank you so much for the quick research. Truly an MVP

@wileyj
Copy link
Contributor

wileyj commented Apr 3, 2024

It takes about 4 minutes to run locally, but is timing out after 30 minutes in the CI runner. I'm not sure what the best approach should be for tests like this. It might be nice to ignore it during CI, but have a daily job that runs all the slow tests. @wileyj do we have anything like that for other slow tests at the moment?

we do - it's a workflow literally called "slow-tests". was literally speaking to @jferrant in slack about this.

https://github.com/stacks-network/stacks-core/blob/master/.github/workflows/slow-tests.yml#L33C1-L34C63

this worfklow has 2 tests that i identified a while back as tests that can end up running indefinitely, but if they are to pass they each take around 20 minutes on a github runner. these tests are only run during a release process, but can be run manually as a github action with the https://github.com/stacks-network/stacks-core/actions/workflows/standalone-tests.yml workflow (select "slow tests" from the dropdown).

running these daily won't give us anything useful - the tests in that slow-tests workflow will either work or they won't and because it's a black blox they're running on, it's rare we'll know why.
but what i've seen is that the test is simply waiting for something to happen that never will.

i recommended to @jferrant that this specific test, if it's going to be unreliable, is to move it to the slow-tests workflow.

@wileyj
Copy link
Contributor

wileyj commented Apr 3, 2024

i also have seen this test pass or fail on the same commit, so i have a strong feeling it's what i wrote above - the workflow is just waiting for the test to do something that is either taking too long or will never happen for reasons.
we can increase the timeout, but if the test passes, it's pretty quick - i see it complete in < 10 minutes reliably when it passes.

@saralab saralab added the CI label Apr 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Status: 🆕 New
Development

No branches or pull requests

5 participants