-
Notifications
You must be signed in to change notification settings - Fork 1.2k
actions-runner min Pod is '0'. #4073
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hello! Thank you for filing an issue. The maintainers will triage your issue shortly. In the meantime, please take a look at the troubleshooting guide for bug reports. If this is a feature request, please review our contribution guidelines. |
Hey @oliverpark999, Your runner has failed more than 5 times, marking the runner as failed. You can see it from the log line:
The failed runners are counted in when calculating how much we need to scale. This is expected behavior, so I'm going to close the issue. You should investigate the reason for the failure. |
Hi @nikola-jokic , This issue needs to be discussed rather than just closed. The only "automatic" solution for this problem is to delete the failed EphemeralRunner via a CronJob: We also failed at the same time as everyone else, and it didn't recover until we manually deleted the failed |
Hey @DingGGu, It might have been the back-end issue, but ARC is still working as expected. Currently, by design, 5 failed attempts would cause ephemeral runner to reach the failed state and is left for further inspection. There are currently efforts to create a self-recovery for failed ephemeral runners that should be released in the 0.12.0. Regardless of that, from the ARC's perspective, 5 failed attempts would leave the ephemeral runner pod in the failed state, which is exactly what happened. We try to keep issues related to ARC itself here; that is why I closed it. |
Hi @nikola-jokic, Is there any way to find a hint as to why the EpmeralRunner failed 5 times? I can't find any related event in the EphemeralRunner or log of arc-gha-rs-controller or listener. |
It is usually in the reason field under the ephemeral runner's status. Otherwise, you probably need to inspect the Kubelet log. We copy the pod termination reason to the ephemeral runner status reason, so if it is not present in the pod, the ephemeral runner will not have it as well. |
Why was this issue closed? |
Did you find any solution? |
Uh oh!
There was an error while loading. Please reload this page.
Checks
Controller Version
0.11.0
Deployment Method
ArgoCD
Checks
To Reproduce
Describe the bug
I set the actions-runner min/max pod count as below.
at least 1 Runner Pod was always running,
but at some point, the Runner Pod is not visible, and when the build is run, the Runner Pod is created.
The version I'm using is '0.11.0' and I configured it with Helm.
Why is that?
Describe the expected behavior
There must be at
least one Runner Pod
running.Additional Context
Controller Logs
Runner Pod Logs
The text was updated successfully, but these errors were encountered: