Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upPrometheus Alerting on Pods with HTTPS Scheme #4741
Comments
simonpasquier
added
the
kind/more-info-needed
label
Oct 16, 2018
This comment has been minimized.
This comment has been minimized.
|
You probably have a mismatch between your kubernetes annotations and the relabeling configuration. Can you share the Targets page? It looks as if this is actually a question about usage and not development. For the future, I suggest that you use our user mailing list, which you can also search. If you prefer more interactive help, join or our IRC channel, #prometheus on irc.freenode.net. Please be aware that our IRC channel has no logs, is not searchable, and that people might not answer quickly if they are busy or asleep. If in doubt, you should choose the mailing list. |
This comment has been minimized.
This comment has been minimized.
|
@simonpasquier Thanks for responding. Do you want a private email with a PDF of the targets page? I originally tried asking the question in the user mail list but never heard back from anyone. Here's my post in the user list. |
This comment has been minimized.
This comment has been minimized.
|
The correct link https://groups.google.com/d/msg/prometheus-users/UPCau6BB4AQ/Ex2tLYzyAgAJ ;-) |
This comment has been minimized.
This comment has been minimized.
|
Looks like the root cause is how Prometheus reports the uptime for these pods. I tested this by querying for pods via "up == 0". This does seem like a bug though I haven't dug into the code to solidify this theory but from a user perspective it's not working as one would think. For the time being, I've refactored my alerting to use this instead and so far so good: expr: kube_deployment_status_replicas_unavailable > 0 |
This comment has been minimized.
This comment has been minimized.
|
Closing for now as you've found an alternative approach. |
spirrello commentedOct 15, 2018
Bug Report
What did you do?
Apply a job to monitor pods over HTTP and HTTPS.
What did you expect to see?
To be able to monitor pods with HTTP and HTTPS endpoints without alerting when pods are up.
What did you see instead? Under which circumstances?
All pods that are using HTTPS for a metric endpoint are in an Up state but are alerting as down as well.
Environment
uname -srm
Linux 4.4.0-116-generic x86_64
prometheus, version 2.3.2 (branch: HEAD, revision: 71af5e2)
build user: root@5258e0bd9cc1
build date: 20180712-14:02:52
go version: go1.10.3
No logs indicating these pods are down.
Screenshots: