-
Notifications
You must be signed in to change notification settings - Fork 993
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
rust agent crashes on shutdown #160
Comments
Hi @jodh-intel , from you description, was the panic occur here: https://github.com/kata-containers/kata-containers/blob/master/src/agent/src/grpc.rs#L1234 ? |
Hi @lifupan - The problem actually seems to be the custom VSOCK grpc-rs/grpc-sys crate we're using: shutdown seems to be unreliable with it, but not with the an upstream version: I now have a test program that can shutdown its gRPC server cleanly using I really don't want to have to fix this behaviour in our custom crate as it was only ever supposed to be a temporary fix. So, this is a good time to task about two things:
|
@teawater - could you provide any update on grpc/grpc#21121? |
Hi @jodh-intel The ttrpc branch https://github.com/lifupan/kata-containers/commits/ttrpc is a poc that we did |
@lifupan Is the ttrpc code ready to be PR'd /merged? @bergwolf @gnawux Do you have any insights on the above? @jodh-intel is currently blocked on adding tracing support because of agent crashes seen on shutdown. |
Fixes kata-containers#160 Signed-off-by: jinda.ljd <q8886888@qq.com>
This has been fixed. Please reopen if you still see it with the rust agent @jodh-intel |
Some architectures and setups do not support DIMM/NUMA. However, they can still use memory backends, provided a memory backend of the same ID is specified under -machine. This was introduced in QEMU 5.0. Enable this functionality in appendMemoryKnobs. Fixes: kata-containers#160 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>
tarfs reliability fixes
Tracking issue for a problem I'm seeing related to adding tracing (which requires a clean shutdown):
This problem is seen calling
destroySandbox
onmaster
and appears to be caused by a thread (created by a dependent crate?) failing when operating on anmpsc
channel. Debugging is complicated by the fact that the crash is random (threads ;) and the generated backtrace is not super-helpful:@lifupan - I'll add more specific details to this issue on Monday but any thoughts? Can you recreate?
The text was updated successfully, but these errors were encountered: