Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pause on ephemeral agent breaks #2012

Closed
jorros opened this issue Oct 18, 2023 · 3 comments · Fixed by #2459
Closed

pause on ephemeral agent breaks #2012

jorros opened this issue Oct 18, 2023 · 3 comments · Fixed by #2459
Labels
bug Something isn't working user

Comments

@jorros
Copy link

jorros commented Oct 18, 2023

Bug Description

It seems that the container is not unpausing after stopping the debug session (using Rider). The first time it works (ephemeral agent + pause), but any subsequent attempt fails

Steps to Reproduce

  1. Run mirrord with pause and ephemeral agent, it should work the first time
  2. Run it again, this time it should break and will only work again once the pod has been restarted

Backtrace

"Timeout requesting for target container pause.",

�[2m2023-10-18T11:46:09.106207Z�[0m �[31mERROR�[0m �[2mmirrord_kube::api�[0m�[2m:�[0m Daemon sent log message Failed to pause target container: PauseFailedCgroup(CgroupV2Error(NoPidsFoundInCgroup))

Relevant Logs

No response

Your operating system and version

macOS 14

Local process

dotnet

Local process version

No response

Additional Info

No response

@jorros jorros added the bug Something isn't working label Oct 18, 2023
@aviramha
Copy link
Member

This happened to me on docker runtime in CI but not on containerd - I assume it's something related to the Host OS + container runtime. In the case of what I had in CI - we had a race condition where our cgroup was cleaned up by the host cgroup manager since we had a "rogue" one. This is quite tricky but I'd guess the first step is to reproduce it.

@aviramha
Copy link
Member

btw, just to verify, you used the privileged flag right?

@jorros
Copy link
Author

jorros commented Oct 19, 2023

Yeah, running without that gives me an error about missing access rights.

@eyalb181 eyalb181 added the user label Dec 25, 2023
This was referenced May 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working user
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants