[Feature] Auto-recover if database directory is locked #124

amshuman-kr · 2020-12-03T10:28:54Z

Feature (What you would like to be added):
In some infrastructures (Azure), abnormal termination of etcd container/pod leads to the database directory lock not being released and prevents the backup-restore to hang while opening the database for verification on etcd container restart.

We should try to detect this scenario and try to recover from it automatically.

Motivation (Why is this needed?):
This happens rarely (so far only a couple of times in Azure) but requires manual intervention. Typically, a pod restart resolves the issue. But we should try and automate this.

Approach/Hint to the implement solution (optional):
Typically, a pod restart resolves the issue.

amshuman-kr added the kind/enhancement Enhancement, improvement, extension label Dec 3, 2020

gardener-robot added the lifecycle/stale Nobody worked on this for 6 months (will further age) label Sep 22, 2021

gardener-robot added lifecycle/rotten Nobody worked on this for 12 months (final aging stage) and removed lifecycle/stale Nobody worked on this for 6 months (will further age) labels Mar 24, 2022

abdasgupta added priority/4 Priority (lower number equals higher priority) priority/3 Priority (lower number equals higher priority) and removed priority/4 Priority (lower number equals higher priority) labels Jan 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Auto-recover if database directory is locked #124

[Feature] Auto-recover if database directory is locked #124

amshuman-kr commented Dec 3, 2020 •

edited

Loading

[Feature] Auto-recover if database directory is locked #124

[Feature] Auto-recover if database directory is locked #124

Comments

amshuman-kr commented Dec 3, 2020 • edited Loading

amshuman-kr commented Dec 3, 2020 •

edited

Loading