-
Notifications
You must be signed in to change notification settings - Fork 72
Closed
Description
Hi guys,
we discovered a state in one of our clusters managed by the operator, where
- 3 agencies running
- 2 primaries are stuck in Terminating state (the finalizers are keeping the pods from being deleted)
- 1 primary is running, but couldn't sync with the error:
WARNING [de0be] {httpclient} retrying failed HTTP request for endpoint 'tcp://hugo-plus-arango-dbserver-ps3uajld.hugo-plus-arango-int.hugoplus.svc:8529' for replication applier in database 'staging'
All Coordinators are contineously restarting due to no primary available. The cluster is not available.
Arango-Version is 3.5.1, Operator Version is 0.4.4.
Do you need more information? How can I recover the cluster from this state?
Thanks in advance for your help!
Metadata
Metadata
Assignees
Labels
No labels