Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add playbook for CortexRequestErrors and config option to exclude specific routes #338

Merged
merged 2 commits into from
Jun 23, 2021

Conversation

pracucci
Copy link
Collaborator

What this PR does:
In this PR I've added a config option to exclude specific routes from CortexRequestErrors and CortexRequestLatency errors. I've also added a playbook for CortexRequestErrors. The alert is very generic and could trigger because of many reasons: any suggestion on which common use cases we should cover?

Which issue(s) this PR fixes:
N/A

Checklist

  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

…cific routes

Signed-off-by: Marco Pracucci <marco@pracucci.com>
@pracucci pracucci requested a review from pstibrany June 23, 2021 08:29
@pracucci pracucci requested a review from a team as a code owner June 23, 2021 08:29
@@ -355,10 +366,6 @@ WAL corruptions are only detected at startups, so at this point the WAL/Checkpoi
2. Equal or more than the quorum number but less than replication factor: There is a good chance that there is no data loss if it was replicated to desired number of ingesters. But it's good to check once for data loss.
3. Equal or more than the replication factor: Then there is definitely some data loss.

### CortexRequestErrors
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Was duplicated.

Copy link
Member

@pstibrany pstibrany left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

CHANGELOG.md Outdated
@@ -19,6 +19,7 @@
* [ENHANCEMENT] cortex-mixin: Make `cluster_namespace_deployment:kube_pod_container_resource_requests_{cpu_cores,memory_bytes}:sum` backwards compatible with `kube-state-metrics` v2.0.0. #317
* [ENHANCEMENT] Added documentation text panels and descriptions to reads and writes dashboards. #324
* [ENHANCEMENT] Dashboards: defined container functions for common resources panels: containerDiskWritesPanel, containerDiskReadsPanel, containerDiskSpaceUtilization. #331
* [ENHANCEMENT] cortex-mixin: Added `alert_excluded_routes` config to exclude specific routes from alerts. #326
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR is 338.

Signed-off-by: Marco Pracucci <marco@pracucci.com>
@pracucci pracucci merged commit f83ae25 into main Jun 23, 2021
@pracucci pracucci deleted the playbook-for-request-errors branch June 23, 2021 12:31
simonswine pushed a commit to grafana/mimir that referenced this pull request Oct 18, 2021
…or-request-errors

Add playbook for CortexRequestErrors and config option to exclude specific routes
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants