Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upPrometheus fills up disk after restart #2542
Comments
This comment has been minimized.
This comment has been minimized.
|
If series are quarantined during normal operation, they end up in the A possible explanation for that "temporary deadlock" might be the following: A busy Prometheus server is stressing your disk quite a bit. Also, an SSD device that is almost full (like in your case) degrades dramatically in performance. If those two things come together, your SSD might lock up for minutes. (On Linux, This doesn't look like a bug, just like an operational issue. It makes most sense to discuss problems like this on the prometheus-users mailinglist rather than in a GitHub issue. In that way, more people are available to help you, and others can benefit more easily from presented solution. |
beorn7
closed this
Apr 2, 2017
This comment has been minimized.
This comment has been minimized.
|
Thanks for the response I would almost guarantee this is an operational issue. I'll try my luck on the mailing list thanks. |
This comment has been minimized.
This comment has been minimized.
lock
bot
commented
Mar 23, 2019
|
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
michaeljs1990 commentedMar 28, 2017
•
edited
Restarted prometheus after seeing that the main prometheus box has entered rush mode with a score of one for the period of about a minute then dropped back down to our configured 0.8... then instantly following this prometheus_local_storage_indexing_queue_length started growing resulting in a value of 16k before restarting. After restarting 600GB of data was filled on disk and the logs below occurred.
Environment
System information:
Linux 3.13.0-105-generic x86_64
Prometheus version:
prometheus, version 1.0.0 (branch: v1.0.0-marathon-auth, revision: 710c7da)
build user: root@967a46ea24e5
build date: 20160728-18:51:38
go version: go1.6.2
Logs:
I am unsure of why so much data was used from disk it also seems like a deadlock occured as prometheus was out of rush mode and everything returned to normal after the 1 minute spike where it went into rush mode however the queue for indexing just kept growing.