-
Notifications
You must be signed in to change notification settings - Fork 736
Instance down #24
Comments
Node exporter and cadvisor are running on each Swarm node, so you can configure an alert for |
I don't think it is effective enough. As the value 0 of that certain node-exporter will not be present for long. Also, it shows only the instance IP and not the node_name.. I tried grouping it with node_name but it will not show up at all please see photos below |
You can use IF absent(node_meta) FOR 5m |
Hi @stefanprodan , what should be the expected value on the The photo below is what returned when I intentionally downed my swarm-node-2 |
@Dean-Christian-Armada , I am also facing the same problem. I want to create a rule whenever a node is down. |
@abhisheks-cuelogic , "Container down", you mean if you have a python container that went down then it will alert? I don't think it's possible with the container part. Prometheus needs node-exporter or other scraping like tool to determine metrics. Unless, there is an agent that can be installed inside the container to determine if it went down. |
@stefanprodan , we need your advise. |
Have you ever tried creating a rule like if the node went down then it will throw an alert?
The text was updated successfully, but these errors were encountered: