-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Flaky integration tests with gRPC errors on close #228
Comments
Does the test in question hang? |
It shouldn't be possible by contract of our resource handling protocol that something would try to still use a grpc connection that is already closed from what I recall. If this is use-after-close then it's definitely a bug. |
It'd failed after 32s, so unlikely. I have a strong suspicion that there might be a problem where the main thread finishes before the gRPC thread due to In this test we are starting From reading the code, it looks like the problem is in the closing logic itself, so not the typical use-after-close, but more like, use-during-close We might want to just upgrade the gRPC to latest and see what happens. |
In case of main thread finishing due to sigterm/sigkill you wouldn't see that error because a) we don't handle signals to trigger resource closing logic, b) this exception is thrown on the main thread. It's actually quite possible that it is a bug in resource handling logic. Do we have any way to replicate this? |
We might be missing I was unsuccessful in replicating it locally, or replicating reliably in CI. |
Oh, yeah, that might do it. |
This might help, esp. the SIGINT handling: #245 |
…therefore breaking the ordering guarantees by Result.resource
We are experiencing failing gRPC shutdown form our
core
SDK in integration tests, e.g.:source: https://github.com/VirtusLab/besom/actions/runs/6432539960/job/17467662339?pr=227
Looks similar to this old issue: grpc/grpc-java#7752
In short, an exception is being thrown while closing gRPC internal executor when its status in "not OK" at the point in time when the code expects it to be "OK" - I don't have enough context to understand it fully at this point.
The text was updated successfully, but these errors were encountered: