Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Grafana panels regarding Request latency #5477

Merged
merged 3 commits into from Feb 25, 2022

Conversation

istvanballok
Copy link
Contributor

How to categorize this PR?

/area monitoring
/kind enhancement

What this PR does / why we need it:

Improve Grafana panels regarding Request latency

Which issue(s) this PR fixes:
Fixes #

Special notes for your reviewer:

Release note:

Improve Grafana panels regarding Request latency

istvanballok and others added 2 commits February 22, 2022 16:17
The subresources like log, portforward and exec might have a longer
request duration by design and that does not indicate an issue in the
api server.

These subresources are filtered out from this graph.

Co-authored-by: Wesley Bermbach <wesley.bermbach@sap.com>
Co-authored-by: Istvan Zoltan Ballok <istvan.zoltan.ballok@sap.com>
The panel used an incorrect query before: it applied the
rate function to the apiserver_latency_seconds:quantile metric.

The apiserver_latency_seconds:quantile is the average latency of the
requests, in seconds - the rate function is not meaningful in that case.

The latency values can not be summed up in a meaningful way,
but we can pick the maximum value for plotting on the panel.

Co-authored-by: Wesley Bermbach <wesley.bermbach@sap.com>
Co-authored-by: Istvan Zoltan Ballok <istvan.zoltan.ballok@sap.com>
@istvanballok istvanballok requested a review from a team as a code owner February 22, 2022 15:32
wyb1
wyb1 previously approved these changes Feb 22, 2022
@gardener-robot gardener-robot added area/monitoring Monitoring (including availability monitoring and alerting) related kind/enhancement Enhancement, improvement, extension needs/lgtm size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Feb 22, 2022
…etes-control-plane-status-dashboard.json

Co-authored-by: Ismail Alidzhikov <i.alidjikov@gmail.com>
Copy link
Member

@ialidzhikov ialidzhikov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm
/squash

@rfranzke rfranzke merged commit be6feb2 into gardener:master Feb 25, 2022
krgostev pushed a commit to krgostev/gardener that referenced this pull request Apr 21, 2022
* Exclude some subresources from the Request latency panel

The subresources like log, portforward and exec might have a longer
request duration by design and that does not indicate an issue in the
api server.

These subresources are filtered out from this graph.

Co-authored-by: Wesley Bermbach <wesley.bermbach@sap.com>
Co-authored-by: Istvan Zoltan Ballok <istvan.zoltan.ballok@sap.com>

* Fix the request latency panel

The panel used an incorrect query before: it applied the
rate function to the apiserver_latency_seconds:quantile metric.

The apiserver_latency_seconds:quantile is the average latency of the
requests, in seconds - the rate function is not meaningful in that case.

The latency values can not be summed up in a meaningful way,
but we can pick the maximum value for plotting on the panel.

Co-authored-by: Wesley Bermbach <wesley.bermbach@sap.com>
Co-authored-by: Istvan Zoltan Ballok <istvan.zoltan.ballok@sap.com>

* Update charts/seed-monitoring/charts/grafana/dashboards/owners/kubernetes-control-plane-status-dashboard.json

Co-authored-by: Ismail Alidzhikov <i.alidjikov@gmail.com>

Co-authored-by: Wesley Bermbach <wesley.bermbach@sap.com>
Co-authored-by: Ismail Alidzhikov <i.alidjikov@gmail.com>
krgostev pushed a commit to krgostev/gardener that referenced this pull request Jul 5, 2022
* Exclude some subresources from the Request latency panel

The subresources like log, portforward and exec might have a longer
request duration by design and that does not indicate an issue in the
api server.

These subresources are filtered out from this graph.

Co-authored-by: Wesley Bermbach <wesley.bermbach@sap.com>
Co-authored-by: Istvan Zoltan Ballok <istvan.zoltan.ballok@sap.com>

* Fix the request latency panel

The panel used an incorrect query before: it applied the
rate function to the apiserver_latency_seconds:quantile metric.

The apiserver_latency_seconds:quantile is the average latency of the
requests, in seconds - the rate function is not meaningful in that case.

The latency values can not be summed up in a meaningful way,
but we can pick the maximum value for plotting on the panel.

Co-authored-by: Wesley Bermbach <wesley.bermbach@sap.com>
Co-authored-by: Istvan Zoltan Ballok <istvan.zoltan.ballok@sap.com>

* Update charts/seed-monitoring/charts/grafana/dashboards/owners/kubernetes-control-plane-status-dashboard.json

Co-authored-by: Ismail Alidzhikov <i.alidjikov@gmail.com>

Co-authored-by: Wesley Bermbach <wesley.bermbach@sap.com>
Co-authored-by: Ismail Alidzhikov <i.alidjikov@gmail.com>
@istvanballok istvanballok deleted the request-latency branch July 22, 2022 10:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/monitoring Monitoring (including availability monitoring and alerting) related kind/enhancement Enhancement, improvement, extension size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

8 participants