Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI test linux://rllib:TestLearnerGroupAsyncUpdate is flaky #45088

Closed
can-anyscale opened this issue May 1, 2024 · 54 comments
Closed

CI test linux://rllib:TestLearnerGroupAsyncUpdate is flaky #45088

can-anyscale opened this issue May 1, 2024 · 54 comments
Assignees
Labels
bug Something that is supposed to be working; but isn't ci-test flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ ray-test-bot Issues managed by OSS test policy rllib RLlib related issues stability triage Needs triage (eg: priority, bug/not-bug, and owning component)

Comments

@can-anyscale
Copy link
Collaborator

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/4274#018f35f7-a42e-40b4-947d-6d3f9782cace
- https://buildkite.com/ray-project/postmerge/builds/4272#018f3593-7dcd-4acd-921e-9441047e9568

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END
Managed by OSS Test Policy

@can-anyscale can-anyscale added bug Something that is supposed to be working; but isn't ci-test flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ ray-test-bot Issues managed by OSS test policy rllib RLlib related issues stability triage Needs triage (eg: priority, bug/not-bug, and owning component) weekly-release-blocker Issues that will be blocking Ray weekly releases labels May 1, 2024
@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

This test is now considered as flaky because it has been failing on postmerge for too long. Flaky tests do not run on premerge.

@can-anyscale
Copy link
Collaborator Author

fixed by #45110

@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale reopened this Jun 6, 2024
@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/4760#018feeae-22ab-4db4-b5e9-88b29e1a961a
- https://buildkite.com/ray-project/postmerge/builds/4760#018feeae-22ab-4db4-b5e9-88b29e1a961a

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

Blamed commit: 937a8fd found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1214

@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale reopened this Jun 16, 2024
@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/4970#01901fa7-10ae-438d-8aa4-f5b05e7107f9
- https://buildkite.com/ray-project/postmerge/builds/4970#01901fa7-10ae-438d-8aa4-f5b05e7107f9

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

Blamed commit: 70e5e78 found by bisect job https://buildkite.com/ray-project/release-tests-bisect/builds/1242

@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale reopened this Jun 21, 2024
@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:TestLearnerGroupAsyncUpdate is consistently_failing. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/5053#01903b8e-53db-405a-9776-4885b7b36841
- https://buildkite.com/ray-project/postmerge/builds/5053#01903b8e-53db-405a-9776-4885b7b36841

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:TestLearnerGroupAsyncUpdate is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/6564#01929498-502e-490c-815d-f40accbed504
- https://buildkite.com/ray-project/postmerge/builds/6555#019292d6-8d2a-4168-ba63-0e723662e28d
- https://buildkite.com/ray-project/postmerge/builds/6549#01928d83-5502-4036-afd4-6cbd5f07df29

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale reopened this Oct 30, 2024
@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:TestLearnerGroupAsyncUpdate is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/6788#0192e50e-b6be-48d9-ba87-160e0bca3d08
- https://buildkite.com/ray-project/postmerge/builds/6764#0192e022-32ec-4839-ab1d-fb30e7fff41a
- https://buildkite.com/ray-project/postmerge/builds/6759#0192de2f-e59f-429b-88e9-c4e174d292ee

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:TestLearnerGroupAsyncUpdate is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/6838#0192fd4c-4d05-42fd-a7a8-19615e213e9a
- https://buildkite.com/ray-project/postmerge/builds/6833#0192faef-fe02-4a25-893f-4b2c2f4c4af8
- https://buildkite.com/ray-project/postmerge/builds/6809#0192ef24-2094-47fd-93d1-394aba6c1527

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

CI test linux://rllib:TestLearnerGroupAsyncUpdate is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/6935#01932494-27bd-4c7b-81d6-345a935772d6
- https://buildkite.com/ray-project/postmerge/builds/6926#01932385-2bd0-4d01-94ea-0ef48cfe0bd9
- https://buildkite.com/ray-project/postmerge/builds/6913#01931d0d-53ad-4618-aab0-34885d52a640

DataCaseName-linux://rllib:TestLearnerGroupAsyncUpdate-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something that is supposed to be working; but isn't ci-test flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ ray-test-bot Issues managed by OSS test policy rllib RLlib related issues stability triage Needs triage (eg: priority, bug/not-bug, and owning component)
Projects
None yet
Development

No branches or pull requests

2 participants