-
-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Return error code when scrape fails #48
Comments
Could this lead to the container going into an 'unhealthy' state? I'm using this in a Pi-based internet monitoring setup, and it seems like a couple times a day the container just goes unhealthy, and the metrics endpoint returns nothing. In the logs, there are no errors that are output:
And running
(So you can see it's been a few hours it was down.) I'm monitoring a Starlink connection, so I'm wondering if maybe if the network goes away completely sometimes, it causes an unhandled exception or something (but even then... you'd think it would kill the flask app, not just keep it running doing nothing). |
Hey, @geerlingguy I don't know what to say eheh, I'm one of your subscribers. That's really good to see that you are using my project WOW.
Basically the check is just checking if the flask app is still working. When your exporter stops scraping, what is the status of exporter in Prometheus client? Are you using an specific server to do the tests? Thanks, |
Hey again, @geerlingguy can you try the Thanks, |
@MiguelNdeCarvalho - Thanks! I just updated the Pi to 3.1 after seeing the same dropout this morning. There are no additional logs (just ... stops it seems), and if I log into the container and run I will keep monitoring the speedtest container and see if it stays up longer this time. Note that I have another Pi running on my local cable Internet connection with the exact same config, and it keeps running (has been going two weeks now). So I wonder if something about the connection with Starlink (vs. Cable) is throwing the app for a loop (in some weird way that is not triggering an exception). |
Hey again @geerlingguy, Basically yeasterday @Doacola has deployed the stack that you have done in his Pi4 (32 Bits) and he didn't got that weird behaviour that you are getting in the Pi4 connected to the Starlink. Right now I have done some tests: 1st - Deployed exporter in docker and I have done the first trigger I think this should be related to the Thanks, Ps. I hope that you talk about how you are monitoring it in next video 😉 |
Everything is working fine now, so I'm going to close this |
https://prometheus.io/docs/instrumenting/writing_exporters/#failed-scrapes
The text was updated successfully, but these errors were encountered: