Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

crash recovery: Deal with un-open-able LevelDBs archived_fingerprint_to_timerange and archived_fingerprint_to_metric #2210

Closed
beorn7 opened this Issue Nov 21, 2016 · 1 comment

Comments

Projects
None yet
1 participant
@beorn7
Copy link
Member

beorn7 commented Nov 21, 2016

Crash recovery deals properly with inconsistent data in the leveldb directories archived_fingerprint_to_timerange and archived_fingerprint_to_metric. However, in rare cases, the leveldb can be corrupted in a way that already opening it fails.

In that case, the whole crash recovery bails out.

Instead, we should nuke the respective LevelDBs and continue recovery as far as possible. In fact, nuking archived_fingerprint_to_timerange can be completely recovered (it just will take a long time because all archived time series have to be unarchived, note that this will require additional RAM and has to be taken into account for #2139). Nuking archived_fingerprint_to_metric will mean the loss of all archived series, but that's still better than losing everything.

@lock

This comment has been minimized.

Copy link

lock bot commented Mar 23, 2019

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

@lock lock bot locked and limited conversation to collaborators Mar 23, 2019

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
You can’t perform that action at this time.