New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] DatafeedJobsRestID: Cancelled recovery not cleaning up #100589
Labels
blocker
:Distributed/Recovery
Anything around constructing a new shard, either from a local or a remote source.
Team:Distributed
Meta label for distributed team
>test-failure
Triaged test failures from CI
Comments
DaveCTurner
added
:Distributed/Recovery
Anything around constructing a new shard, either from a local or a remote source.
>test-failure
Triaged test failures from CI
blocker
labels
Oct 10, 2023
Pinging @elastic/es-distributed (Team:Distributed) |
DaveCTurner
added a commit
to DaveCTurner/elasticsearch
that referenced
this issue
Oct 10, 2023
`IndexShard#markAllocationIdAsInSync` is interruptible because it may block the thread on a monitor waiting for the local checkpoint to advance, but we lost the ability to interrupt it on a recovery cancellation in elastic#95270. Closes elastic#96578 Closes elastic#100589
DaveCTurner
added a commit
to DaveCTurner/elasticsearch
that referenced
this issue
Oct 11, 2023
`IndexShard#markAllocationIdAsInSync` is interruptible because it may block the thread on a monitor waiting for the local checkpoint to advance, but we lost the ability to interrupt it on a recovery cancellation in elastic#95270. Closes elastic#96578 Closes elastic#100589
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
blocker
:Distributed/Recovery
Anything around constructing a new shard, either from a local or a remote source.
Team:Distributed
Meta label for distributed team
>test-failure
Triaged test failures from CI
Although it's an ML test suite, the problems all look to be because an index was deleted while a recovery was ongoing and but the recovery task never goes away.
Build scan:
https://gradle-enterprise.elastic.co/s/e3zjigozbae3y/tests/:x-pack:plugin:ml:qa:native-multi-node-tests:javaRestTest/org.elasticsearch.xpack.ml.integration.PyTorchModelIT/testInferWithMultipleDocs
Reproduction line:
Applicable branches:
main
Reproduces locally?:
Didn't try
Failure history:
https://gradle-enterprise.elastic.co/scans/tests?tests.container=org.elasticsearch.xpack.ml.integration.PyTorchModelIT&tests.test=testInferWithMultipleDocs
Failure excerpt:
The text was updated successfully, but these errors were encountered: