Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upadd metric for alertrule template rendering failure #4634
Comments
This comment has been minimized.
This comment has been minimized.
|
@simonpasquier I would like to help about this issue. I think, a metric like |
This comment has been minimized.
This comment has been minimized.
|
@mucahitkurt sure you're more than welcome! You're mostly correct. Maybe we can limit the counter to |
mucahitkurt
added a commit
to mucahitkurt/prometheus
that referenced
this issue
Oct 16, 2018
mucahitkurt
referenced this issue
Oct 19, 2018
Merged
add alert template expanding failure metric #4747
brian-brazil
closed this
in
#4747
Nov 6, 2018
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
juliantaylor commentedSep 19, 2018
The alertrules can have templated values which can be filled out based on metric labels for summaries.
This template rendering can fail due to typos in the template. In that case no alert is sent to the alertmanager which can be a major problem.
E.g. forgetting the
.Labelscauses:There should be a metric that is increased on rule template rendering failures similar to prometheus_rule_evaluation_failures_total and prometheus_notifications_errors_total so one can alert on that failure instead.
We are currently using prometheus 2.3.2.