Skip to content

Separate kube-apiserver-availability.rules into own group (w/ 3min interval) #403

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Conversation

metalmatze
Copy link
Member

Recording Rules

First, I've added a more explicit code_verb:apiserver_request_total:increase30d recording rule for which we now iterate over for code in ['2..', '3..', '4..', '5..'] and for verb in ['LIST', 'GET', 'POST', 'PUT', 'PATCH', 'DELETE'] evaluating these individual combinations on their own. This will add 24 more recording rules, but nicely split those up :)

photo_2020-04-26_17-06-38

The 3min evaluation interval has the nice side effect of really saving some CPU again (as reported by others resulting in higher usage, hi @smoke)

Screenshot from 2020-04-27 10-25-14

Additionally, I also had to fix the scope=~"resource|" for Kubernetes 1.18+, as the scope="resource" doesn't seem to exist anymore.


Dashboard

The dashboard need some cleaning up and I put a bit time into it:
Screenshot from 2020-04-27 12-17-03

The biggest change here would be that both Read and Write SLI - Requests panels now show the requests by (code). I've deleted a few older and now duplicate panels too.
Each panel now has a description explaining a bit about what they show.

/cc @brancz @povilasv @jamie-34254 @smoke @anyname2 @ekeih @billimek

Closes #394 prometheus-operator/kube-prometheus#503 helm/charts#22003

…val)

Additionally I also had to fix the scope=~"resource|" for Kubernetes
1.18+, as the scope="resource" doesn't seem to exist anymore.
@metalmatze metalmatze requested review from brancz and povilasv April 27, 2020 10:23
@metalmatze metalmatze changed the base branch from master to release-0.4 April 27, 2020 10:25
@brancz
Copy link
Member

brancz commented Apr 28, 2020

nice 👍 lgtm

Copy link

@smoke smoke left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍 lgtm!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

SLO dashboards/alerts are not working in 1.18+
3 participants