[Feature] Auto-recover if database directory is locked #124
Labels
kind/enhancement
Enhancement, improvement, extension
lifecycle/rotten
Nobody worked on this for 12 months (final aging stage)
priority/3
Priority (lower number equals higher priority)
Feature (What you would like to be added):
In some infrastructures (Azure), abnormal termination of etcd container/pod leads to the database directory lock not being released and prevents the backup-restore to hang while opening the database for verification on etcd container restart.
We should try to detect this scenario and try to recover from it automatically.
Motivation (Why is this needed?):
This happens rarely (so far only a couple of times in Azure) but requires manual intervention. Typically, a pod restart resolves the issue. But we should try and automate this.
Approach/Hint to the implement solution (optional):
Typically, a pod restart resolves the issue.
The text was updated successfully, but these errors were encountered: