argoCD resource events impacts to etcd db size #10529

daro1337 · 2022-09-06T13:46:57Z

Checklist:

I've searched in the docs and FAQ for my answer: https://bit.ly/argocd-faq.
I've included steps to reproduce the bug.
I've pasted the output of argocd version.

Describe the bug

I had a network problem to the kubernetes API (flaps), so argoCD applications got a timeout when trying to sync.
This led to constant changes in the status of the app, and I hadt housands of events like this:

kubectl get events -n argocd
...
45h         Normal    ResourceUpdated      application/some-app    Updated health status: Healthy -> Missing
45h         Normal    ResourceUpdated      application/some-app    Updated sync status: OutOfSync -> Unknown
45h         Normal    ResourceUpdated      application/some-app   Updated health status: Healthy -> Missing
45h         Normal    ResourceUpdated      application/some-app    Updated sync status: Unknown -> OutOfSync
45h         Normal    ResourceUpdated      application/some-app    Updated health status: Missing -> Healthy
45h         Normal    ResourceUpdated      application/some-app    Updated sync status: Unknown -> OutOfSync
45h         Normal    ResourceUpdated      application/some-app    Updated health status: Missing -> Healthy
45h         Normal    ResourceUpdated      application/some-app   Updated sync status: OutOfSync -> Unknown
...

I have like 200+ apps in my argoCD so it make scale and this leads to grow my etcd to 600MB+ in couple days and continued to grow.
I've made etcd snapshot and I checked where this data increase comes from. Because I have dedicated k8s cluster for argo it was easy to tell that issue is with argo. After inspecting etcd

To Reproduce

make network related issue so k8s API is flapping
argo will try to sync apps every 3min (default)
monitor etcd size

Expected behavior

argoCD should cleanup events resource because it can easily generate thousands of them

Workaround
As a workaround to restore etcd space:

kubectl delete events -n argocd --all -v10 --grace-period 0 --force
make standard etcd procedure (compact & defrag)

Screenshots
ETCD database size increase over time and decrease when I start cleaning up events

Version

v2.3.4

Logs

45h         Normal    ResourceUpdated      application/some-app    Updated health status: Healthy -> Missing
45h         Normal    ResourceUpdated      application/some-app    Updated sync status: OutOfSync -> Unknown
45h         Normal    ResourceUpdated      application/some-app   Updated health status: Healthy -> Missing
45h         Normal    ResourceUpdated      application/some-app    Updated sync status: Unknown -> OutOfSync
45h         Normal    ResourceUpdated      application/some-app    Updated health status: Missing -> Healthy
45h         Normal    ResourceUpdated      application/some-app    Updated sync status: Unknown -> OutOfSync
45h         Normal    ResourceUpdated      application/some-app    Updated health status: Missing -> Healthy
45h         Normal    ResourceUpdated      application/some-app   Updated sync status: OutOfSync -> Unknown

The text was updated successfully, but these errors were encountered:

daro1337 added the bug Something isn't working label Sep 6, 2022

rumstead mentioned this issue May 14, 2024

Option to disable writing k8s events #18205

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

argoCD resource events impacts to etcd db size #10529

argoCD resource events impacts to etcd db size #10529

daro1337 commented Sep 6, 2022

argoCD resource events impacts to etcd db size #10529

argoCD resource events impacts to etcd db size #10529

Comments

daro1337 commented Sep 6, 2022