Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upNo more metrics when remote storage queue full #2486
Comments
goettl79
changed the title
No more metrics collected wehen Remote storage queue full
No more metrics when remote storage queue full
Mar 9, 2017
This comment has been minimized.
This comment has been minimized.
|
HI @goettl79 - a few questions:
The remote storage client is very decoupled for Prometheus' own storage - if doesn't ever gibe feedback to the scrape loop - see https://github.com/prometheus/prometheus/blob/master/storage/remote/remote.go#L83. I'd be surprised if this is causing Prometheus to stop scraping. I suspect the system is getting overwhelmed by something else, and one of the symptoms is it can't flush samples to remote storage quickly enough. |
This comment has been minimized.
This comment has been minimized.
|
Indeed, I verified locally that just having a full storage queue does not cause ingestion to get throttled by itself. Is the machine generally overloaded (or even swapping) perhaps? |
This comment has been minimized.
This comment has been minimized.
|
Regrettably we deleted the prometheus Logs by now. the machine didn't have a high load. We generally do not use a swapfile. So either working or oom killer. However the remote wasn't fully down, TCP sockets could be opened, as far as I remember. No data could be sent. Nginx reverse proxy... |
This comment has been minimized.
This comment has been minimized.
|
If you have further debugging information, please reopen. |
brian-brazil
closed this
Mar 27, 2017
This comment has been minimized.
This comment has been minimized.
lock
bot
commented
Mar 23, 2019
|
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
goettl79 commentedMar 9, 2017
•
edited
What did you do?
When the remote_write service runs into problems prometheus does not deliver any more query results.
What did you expect to see?
I'd have expected prometheus to, even if the remote_write fails, behave as usual. So I'd have expected to see a normal prometheus grafana dashboard in fact.
What did you see instead? Under which circumstances?
No querying of data from prometheus was possible when the Remote storage queue was full. Scrape duration went foo.
Environment
Official docker container for prometheus v1.5.2 on docker coreos
System information:
Linux 4.7.3-coreos-r3 x86_64
Prometheus version: