You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Flaky test report: committed-code failures on 2026-04-09
Summary
13 test failures were detected across committed-code builds (Timer/main and Post Merge Action) in the past 24 hours, representing 10 distinct tests across 9 unique builds. None of the failures reproduced locally with the original seed, confirming they are non-deterministic (flaky).
Methodology
Queried the OpenSearch metrics cluster (gradle-check-* indices) for FAILED tests in Timer/main and Post Merge Action builds from the past 24 hours
Extracted reproduction seeds from Jenkins console logs
Attempted local reproduction with the original seed for each test (where feasible)
Queried historical failure data across all build types (including PR builds) using monthly aggregations with unique build counts
Summary Table (sorted by total builds affected)
#
Test
Builds Affected
First Seen
Recent Build
Reproduced?
Pattern
1
ClientYamlTestSuiteIT (string profiler via global ordinals)
Pattern: Long-standing stable flake. Intermittent across all months with no clear trend. The January 2026 spike (24 builds) is notable but didn't persist.
Error: replica shards haven't caught up with primary expected:<18> but was:<15>
Reproduced locally: No — passed with same seed
First seen: October 2024
Total unique builds: 107
Monthly trend (last 6 months): 0, 0, 0, 17, 13, 3
Pattern: Worsening. Same pattern as test Bump com.diffplug.spotless from 5.6.1 to 6.2.2 #3 — dormant until February 2026, then started failing. Both FullRollingRestartIT SEGMENT tests share the same root cause.
Flaky test report: committed-code failures on 2026-04-09
Summary
13 test failures were detected across committed-code builds (Timer/main and Post Merge Action) in the past 24 hours, representing 10 distinct tests across 9 unique builds. None of the failures reproduced locally with the original seed, confirming they are non-deterministic (flaky).
Methodology
gradle-check-*indices) forFAILEDtests in Timer/main and Post Merge Action builds from the past 24 hoursSummary Table (sorted by total builds affected)
ClientYamlTestSuiteIT(string profiler via global ordinals)IndexingIT.testIndexingWithSegRepFullRollingRestartIT.testFullRollingRestart(SEGMENT)AzureBlobStoreRepositoryTests.testContainerCreationAndDeletionFullRollingRestartIT.testFullRollingRestart_withNoRecoveryPayloadAndSource(SEGMENT)SharedClusterSnapshotRestoreIT.testSnapshotFileFailureDuringSnapshotAzureBlobStoreRepositoryTests.testWriteReadRestoreShallowSnapshotV2IT.classMethodRestoreShallowSnapshotV2IT.testContinuousIndexingIngestFromKinesisIT.testPluginsAreInstalledDetailed Findings
1. ClientYamlTestSuiteIT — string profiler via global ordinals
82DE96594B7080D2field [profile.shards.0.aggregations.0.debug.segments_with_single_valued_ords] is not greater than [0]2. IndexingIT.testIndexingWithSegRep
A2B0C92946BE16ECexpected:<0> but was:<1>3. FullRollingRestartIT.testFullRollingRestart (SEGMENT)
1F5B6F8715288B33replica shards haven't caught up with primary expected:<22> but was:<17>4. AzureBlobStoreRepositoryTests.testContainerCreationAndDeletion
6DD75D74EB2363B1RepositoryVerificationException: path is not accessible on cluster-manager node5. FullRollingRestartIT.testFullRollingRestart_withNoRecoveryPayloadAndSource (SEGMENT)
500B52348ACEC49Breplica shards haven't caught up with primary expected:<18> but was:<15>6. SharedClusterSnapshotRestoreIT.testSnapshotFileFailureDuringSnapshot
BA8E73CB0D6E8F2BExpected: <0L> but: was <1L>7. AzureBlobStoreRepositoryTests.testWriteRead
63E29F9F5650C5BFRepositoryVerificationException: path is not accessible on cluster-manager node8. RestoreShallowSnapshotV2IT.classMethod
F2EFD250CE679D6ESuite timeout exceeded (>= 1200000 msec)9. RestoreShallowSnapshotV2IT.testContinuousIndexing
F2EFD250CE679D6ETest abandoned because suite timeout was reached10. IngestFromKinesisIT.testPluginsAreInstalled
A74676B701B5AD6EResourceInUseException: Stream test already existsAdditional tests not in top 10
IndexFieldDataServiceTests.testClearField(build 74250, 9 builds total, did not reproduce locally)IndexFieldDataServiceTests.testExceptionWhileRemovingKey(build 74250, 3 builds total, did not reproduce locally)RemoteSplitIndexIT.testSplitFromOneToN(build 74189, 29 builds total, did not reproduce locally)