Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upPrometheus not recovering gracefully after disk fill event #4194
Comments
This comment has been minimized.
This comment has been minimized.
hoffie
commented
May 26, 2018
This comment has been minimized.
This comment has been minimized.
|
Thanks @hoffie, those look to already cover this. |
brian-brazil
closed this
Jun 13, 2018
krasi-georgiev
referenced this issue
Sep 19, 2018
Closed
Fatal error handling (when writes to wal file fail) #247
This comment has been minimized.
This comment has been minimized.
lock
bot
commented
Mar 22, 2019
|
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
lock
bot
locked and limited conversation to collaborators
Mar 22, 2019
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
xginn8 commentedMay 24, 2018
•
edited
Bug Report
The partition containing Prometheus data (as set by
--storage.tsdb.path) filled, and subsequent writes failed with "no space left on partition". This condition persists until the process is restarted, but the /-/healthy endpoint and HTTP API stay up and reporting "Prometheus is Healthy."What did you expect to see?
After freeing space on the partition, Prometheus should continue writing data to the partition.
What did you see instead? Under which circumstances?
Even after clearing space, Prometheus continues to fail all writes with the same error message:
Environment
tested against 2.2.0, 2.2.1, and HEAD