Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upToo many warnings logged about remote storage queue being full #2177
Comments
This comment has been minimized.
This comment has been minimized.
|
I'm just putting together a PR for this. |
tomwilkie
referenced this issue
Feb 23, 2017
Merged
Limit 'discarding sample' logs to 1 every 10s #2446
juliusv
closed this
in
#2446
Feb 23, 2017
This comment has been minimized.
This comment has been minimized.
lock
bot
commented
Mar 23, 2019
|
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
lock
bot
locked and limited conversation to collaborators
Mar 23, 2019
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
jml commentedNov 8, 2016
What did you do?
Ran a Prometheus instance configured to use remote storage on a system with a broken DNS configuration.
I don't have the exact command-line on me. I can construct a likely equivalent if necessary.
What did you expect to see?
Error messages in the logs informing me that the remote host couldn't be discovered due to DNS lookup failures.
What did you see instead? Under which circumstances?
That is, there were 30000 times as many
Remote storage queue full, discarding samplemessages as there weredial tcp: lookup hostname on 10.96.0.10:53: dial udp 10.96.0.10:53: i/o timeoutfailures.This lead to the problem being misreported to me ("it says the queue is full") and made it harder than necessary to diagnose the root cause (i.e. that DNS was broken).
Environment
Ubuntu 16.04 LTS images running on EC2. Prometheus itself was running as a container in a Kubernetes cluster.
Alas, system has been wiped out now.
Alas, I didn't capture exact output, but it was reported to me as 1.3.1.
Can provide equivalent on request.
All I have left is the summary above.