Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upPrometheus stuck in repairing block #4003
Comments
This comment has been minimized.
This comment has been minimized.
dannyk81
commented
Mar 29, 2018
|
Just wondering if you ever got around this? I have a few servers running v2.1.0 that I was planning to upgrade to v2.2.1, but now kind of getting second thoughts after reading this... |
This comment has been minimized.
This comment has been minimized.
|
I upgraded 3 instances and only one was stuck. I had to wipe the data to get it up again. However, the other 2 were repaired and up and running quite fast. Didn't observe storage issues since then. |
This comment has been minimized.
This comment has been minimized.
dannyk81
commented
Mar 31, 2018
|
@auhlig thanks for the update, still didn't uprgade our instances, figured I'll take a snapshot before - just in case |
gouthamve
added
the
component/local storage
label
May 9, 2018
This comment has been minimized.
This comment has been minimized.
dannyk81
commented
May 15, 2018
•
|
Upgraded our Prometheus fleet (12 servers) to v2.2.1 (from v2.1.) didn't encounter this issue |
This comment has been minimized.
This comment has been minimized.
|
I'm closing this issue as it seems to have been a flaky error. Feel free to reopen if this is still a problem for you. |
simonpasquier
closed this
Aug 7, 2018
This comment has been minimized.
This comment has been minimized.
|
Haven't seen it since we're running v2.2.3 |
This comment has been minimized.
This comment has been minimized.
|
thanks for the heads-up @auhlig |
This comment has been minimized.
This comment has been minimized.
Hashfyre
commented
Feb 4, 2019
|
We are seeing this again #4324 (comment) |
auhlig commentedMar 23, 2018
What did you do?
Upgraded to Prometheus from
v2.1.0tov2.2.1.What did you expect to see?
Prometheus up and running after repairing broken data on 1st start.
What did you see instead? Under which circumstances?
Logs (see below) show it does repair some blocks but then stops. Was like that for 30+ min. Still not accessible nor responsive.
Environment
System information:
Linux 4.14.19-coreos x86_64Prometheus version:
Prometheus configuration file:
Can be found here