You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add/Identify livenessProbe for argocd-application-controller
Motivation
argocd-application-controller became unresponsive and needed to be restarted in #18334. argo-cd should self heal where possible and without manual intervention.
Proposal
I am unsure what would be the best avenue for adding a livenessProbe. We could copy the existing readinessprobe that uses the metrics endpoint on port 8082--but that does not seem like a viable long term solution.
I am a bit tentative on forcing a livenessProbe without consulting the community. The helm chart does not accept any controller livenessProbe inputs, so I am concerned adding one may cause more problems.
My biggest fear is that the application-controller is restarted mid proccess and becomes disruption even on the resources it manages. If there is any documentation on how the application-controller actually manages applications, that would be greatly appreciated and also help with identifying a valid probe.
The text was updated successfully, but these errors were encountered:
Summary
Add/Identify livenessProbe for argocd-application-controller
Motivation
argocd-application-controller became unresponsive and needed to be restarted in #18334. argo-cd should self heal where possible and without manual intervention.
Proposal
I am unsure what would be the best avenue for adding a livenessProbe. We could copy the existing readinessprobe that uses the metrics endpoint on port 8082--but that does not seem like a viable long term solution.
I am a bit tentative on forcing a livenessProbe without consulting the community. The helm chart does not accept any controller livenessProbe inputs, so I am concerned adding one may cause more problems.
My biggest fear is that the application-controller is restarted mid proccess and becomes disruption even on the resources it manages. If there is any documentation on how the application-controller actually manages applications, that would be greatly appreciated and also help with identifying a valid probe.
The text was updated successfully, but these errors were encountered: