Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New ruler alerts that ignore user-errors #4281

Merged
merged 6 commits into from
Jun 17, 2021

Conversation

pstibrany
Copy link
Contributor

@pstibrany pstibrany commented Jun 15, 2021

What this PR does: This PR adds new metrics for tracking total number of queries and push requests to ingesters from rulers, and failed queries and push requests. Unlike existing cortex_prometheus_rule_evaluation_failures_total, failures are only counted for "real" failures and not user failures (hitting limits, duplicate samples, etc.). The idea is to use these new metrics in alerts, to avoid being alerted on issues that users can already solve themselves.

This PR also moves errorTranslateQueryable from api package to querier package, for simpler reuse.

Checklist

  • Tests updated
  • Documentation added
  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

pstibrany and others added 5 commits June 16, 2021 08:58
Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>
Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>
Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>
Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>
Signed-off-by: Peter Štibraný <pstibrany@gmail.com>
Copy link
Contributor

@jtlisi jtlisi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

pkg/ruler/compat.go Outdated Show resolved Hide resolved
pkg/ruler/compat.go Outdated Show resolved Hide resolved
pkg/ruler/compat_test.go Show resolved Hide resolved
Signed-off-by: Peter Štibraný <peter.stibrany@grafana.com>
Copy link
Contributor

@pracucci pracucci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good job! 👏

@pracucci pracucci merged commit fbf0677 into cortexproject:master Jun 17, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants