-
Notifications
You must be signed in to change notification settings - Fork 26.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
More Linux/Android resources for staging are needed #96864
Comments
We have three linux bots running tasks in staging, there are 4 in idle status (1 motog4 (M), 3 samsung) which are not being scheduled to run tests ever. A couple of things to move forward:
As a short-time workaround, we can limit the number of devicelab tests running in the staging pool, giving room for the real flaky tests validation. |
https://flutter-review.googlesource.com/c/infra/+/25440 to skip ~35 tests migrated from mac/android. |
This issue has been mitigated removing a subset of tests from staging. |
Thanks! I think we can close this as fixed, and I'll re-open or file a new one as needed. |
Yeah, builds are queued up quickly especially when we have several commits merged around the same time. There are currently 3 linux staging bots running 25 staging linux/android tests, whereas there are 17 linux prod bots running 83 prod linux/android test (with current 90th% queue time 22min, SLO 35 min). It makes sense to me to migrate, say 2 bots, from prod to staging to help validate the flaky tests for now. We can expand the prod capacity when new bots are available. For windows bots, opened #97017. |
Instead of migrating bots between prod and staging, https://flutter-review.googlesource.com/c/infra/+/25566 to run devicelab staging linux tests in A02 testbeds. |
Builds are running at a frequent pace. Closing. |
This thread has been automatically locked since there has not been any recent activity after it was closed. If you are still experiencing a similar issue, please open a new bug, including the output of |
Flaky tests and benchmark get moved to staging and need some number of consecutive non-flaky runs in order to move back to prod. However, it looks like staging is under-provisioned, and so achieving that number of runs is going to take a long time. As an example
Linux_android opacity_peephole_fade_transition_text_perf__e2e_summary
, has only been run on 6 out of the last 70 framework commits. There are several other benchmarks that appear to be receiving similar treatment. For contrastLinux_android animated_placeholder_perf__e2e_summary
has been run on 50 out of the last 70 commits.Marking P2 to determine whether this is really due to insufficient resources or rather due to a bug in scheduling.
/cc @godofredoc
The text was updated successfully, but these errors were encountered: