Skip to content

Flaky Test: Tester.HeterogeneousSilosTests.HeterogeneousTests.StatelessWorkerPlacementWithClientRefreshTests #9560

Open
@ReubenBond

Description

@ReubenBond

Description

This test has been identified as flaky in the CI pipeline.

Test:
Failure Rate: Not specified in the data provided

Test Details

  • Test Class:
  • Test Method:
  • Test Category: Functional

Test Description

This test verifies stateless worker grain placement with client refresh by:

  1. Setting up a heterogeneous cluster with certain grain types blacklisted
  2. Testing placement strategies (RandomPlacement and PreferLocalPlacement)
  3. Verifying grain resolution after adding new silos
  4. Testing without restarting the client (relying on type map refresh)

Test Configuration

  • Client refresh delay: 1 second
  • Type map refresh interval: 200ms
  • Tests heterogeneous silo configurations

Failure Pattern

The test involves:

  • Dynamic cluster topology changes
  • Type map refresh mechanisms
  • Client-side grain type resolution
  • Timing dependencies on type map propagation

Failures may be related to:

  • Race conditions in type map refresh
  • Timing issues with cluster membership updates
  • Client refresh delay not being sufficient
  • Type resolution inconsistencies during topology changes

Next Steps

  • Investigate type map refresh timing reliability
  • Review if 200ms refresh interval is sufficient for all scenarios
  • Add diagnostic logging for type resolution failures
  • Consider increasing timeouts for CI environments
  • Analyze race conditions in heterogeneous silo scenarios

Related

  • Test file:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions