-
Notifications
You must be signed in to change notification settings - Fork 27.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gardening: mark flaky tests flaky. #82754
Conversation
Please cross-reference the individual issues for the flaky tests in the PR description so that github will add links, and link to the individual issues for the flaky tests in |
For the tests exceeding 2%, the proposal is to remove them out of the dashboard. They need to go through the staging env. for validation before enabled back: https://github.com/flutter/flutter/wiki/Reducing-Test-Flakiness#fixing-flaky-tests |
Can I get a link to a failure for |
@keyonghan If "remove them out of the dashboard" means that they won't run at all, then I don't think that's the right thing to do. Especially since these are benchmarks, removing them entirely will mean that we will miss regressions. |
"remove them out of the dashboard" means they will not show up in the flutter build dashboard, but they will still be triggered and run. Milo dashboard will still show the build info. |
I've removed the entries (and so didn't add the bug cross-references, since there's nothing left to connect them to), but I'll wait to commit this until you LGTM this updated PR again, @zanderso. |
I think we need to leave these on the dashboard so that we have a visual reminder of what is marked flaky. |
I guess our intention is to unblock the tree and make sure the test passes 50 successful runs in staging pool before enabled back. We can start this way, and adjust the process if needed. I will update the doc accordingly. |
OK, done. Since this is pure jSON (no comments), I added an extra field called "issue_url" to the flaky tests, that will hopefully be ignored. If not, well, I'll revert it. |
dev/prod_builders.json
Outdated
@@ -82,7 +82,8 @@ | |||
"name": "Linux complex_layout_scroll_perf__devtools_memory", | |||
"repo": "flutter", | |||
"task_name": "linux_complex_layout_scroll_perf__devtools_memory", | |||
"flaky": false | |||
"flaky": true, | |||
"issue_url": "https://github.com/flutter/flutter/issues/82741" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I haven't announced it yet, but I'm aiming to deprecate this JSON config Wednesday night in favor of the .ci.yaml
. Just commenting here as I'll need to move these to comments in the migration PR (@CaseyHillers @christopherfujino )
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think you just announced it. :-)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suppose new properties could be naturally supported in .ci.yaml
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, PRs welcome if you want to add automation around this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suppose new properties could be naturally supported in
.ci.yaml
?
You would have to update the protobuf in flutter/cocoon first. I actually see this as a feature, as if you add a new target and mispell a field, CI will fail on an unknown field, rather than silently ignoring it.
This reverts commit f568f4e139f0f7f94e3a060e99bccaf8cadc9735 to mark tests flaky instead of removing them.
Sorry, @zanderso, I can't add extra fields or comments to the JSON, so I had to remove the cross-linking for the issues. @CaseyHillers might be able to add a field for that into |
When re-enabling these tests, they should still probably go through the "ran 50 times and didn't flake" soak time, right? |
Yes (https://github.com/flutter/flutter/wiki/Reducing-Test-Flakiness#fixing-flaky-tests) |
Will it show up in |
It will still show up in the 15 day dashboard. Created #82767 to enable an easy validation. Before that, the flutter build dashboard will be the source of truth (no task box with exclamation point for top 50 consecutive commits). |
This marks the following tests as flaky, per the gardener rotation:
linux_complex_layout_scroll_perf__devtools_memory
: complex_layout_scroll_perf__devtools_memory is 4.65% flaky #82741linux_large_image_changer_perf_android
: large_image_changer_perf_android is 2.44% flaky #82747linux_new_gallery__crane_perf
: new_gallery__crane_perf is 2.56% flaky #82745linux_platform_channels_benchmarks
: platform_channels_benchmarks is 3.33% flaky #82743