Allow a higher number of flaky test attempts by dzbarsky · Pull Request #28635 · bazelbuild/bazel

dzbarsky · 2026-02-11T14:59:26Z

I'm interested in having an always-flaky test to integration test our build tooling's handling of flaky test reporting.

Currently, there is no great way for a test to know that's it is a flaky re-run. Previously I saw the test drop a marker file in /tmp, fail, and then succeed on the retry when it detects the marker file. However, this is incompatible with remote execution or hermetic /tmp.

It would be ideal to inject a flaky test attempt count via ENV var so we can deterministically know we are a retry (similar to how --runs_per_test is injected), but I can see that behavior being a little worrisome.

In the absence of that, we can fail the test with probability 7/8, which gives a ~15% false positive and false negative rate at 10 retries. Bumping the max retries to 1000 allows to fail the test with probability 199/200, which is roughly .5% false positive and false negative rate.

I know this is a bit of a weird use case, but hopefully this is a harmless-enough bump!

gemini-code-assist

Code Review

This pull request increases the maximum number of flaky test attempts from 10 to 1000. While I understand the motivation to improve the testing of flaky test reporting, a 100x increase presents a significant risk of misuse, potentially leading to builds consuming excessive resources. My review includes a suggestion to use a more moderate, safer limit.

gemini-code-assist · 2026-02-11T15:02:42Z

src/main/java/com/google/devtools/build/lib/exec/ExecutionOptions.java

  public static class TestAttemptsConverter extends PerLabelOptions.PerLabelOptionsConverter {
    private static final int MIN_VALUE = 1;
-    private static final int MAX_VALUE = 10;
+    private static final int MAX_VALUE = 1000;


Increasing the maximum number of attempts by 100x from 10 to 1000 is a very large increase. While this enables your specific use case, it introduces a significant risk of misuse. A user setting --flaky_test_attempts=1000 could cause builds with flaky tests to consume extreme amounts of CI resources and time, potentially appearing to hang for hours.

A more moderate limit, such as 100, would still be a 10x increase over the current value and would likely be sufficient for most practical purposes of testing flaky test infrastructure, while mitigating the risk of extreme resource consumption. A limit of 1000 seems excessive and potentially dangerous for general use.

Suggested change

private static final int MAX_VALUE = 1000;

private static final int MAX_VALUE = 100;

100 retries allows 1-(29/30)^100 which is around 3% FN/FP rate, I think that will still trigger often enough to be annoying

Allow a higher number of flaky test attempts

8da1d9d

github-actions bot added the awaiting-review PR is awaiting review from an assigned reviewer label Feb 11, 2026

gemini-code-assist bot reviewed Feb 11, 2026

View reviewed changes

iancha1992 added the team-Core Skyframe, bazel query, BEP, options parsing, bazelrc label Feb 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow a higher number of flaky test attempts#28635

Allow a higher number of flaky test attempts#28635
dzbarsky wants to merge 1 commit intobazelbuild:masterfrom
dzbarsky:zbarsky/flaky

dzbarsky commented Feb 11, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Feb 11, 2026

Uh oh!

dzbarsky Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	private static final int MAX_VALUE = 1000;
	private static final int MAX_VALUE = 100;

Conversation

dzbarsky commented Feb 11, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

dzbarsky Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants