Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add monitoring for API Server Watches #2743

Merged
merged 1 commit into from
Aug 21, 2020

Conversation

wyb1
Copy link
Contributor

@wyb1 wyb1 commented Aug 18, 2020

How to categorize this PR?

/area monitoring
/kind enhancement
/priority normal

What this PR does / why we need it:

Add metric shoot:apiserver_watch_duration:quantile. Add dashboard to show length of API Server watches based on resource. These metrics can be helpful in identifying resources with troublesome WATCHES. WATCHES should last a long time so if a resource has many WATCHES that are very short this could point to an issue.

Which issue(s) this PR fixes:
Fixes #

Special notes for your reviewer:

Release note:

Add monitoring for API Server Watches

@wyb1 wyb1 requested a review from a team as a code owner August 18, 2020 12:59
@gardener-robot gardener-robot added area/monitoring Monitoring (including availability monitoring and alerting) related kind/enhancement Enhancement, improvement, extension priority/normal labels Aug 18, 2020
@wyb1
Copy link
Contributor Author

wyb1 commented Aug 18, 2020

/invite @timebertt

@timebertt
Copy link
Member

Thank you very much for the PR!
I will check it out in detail tomorrow :)

Copy link
Member

@timebertt timebertt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very nice PR!
/invite @istvanballok

@timebertt timebertt self-assigned this Aug 20, 2020
@wyb1
Copy link
Contributor Author

wyb1 commented Aug 20, 2020

Thanks for the review @timebertt! I've addressed the comments. PTAL

timebertt
timebertt previously approved these changes Aug 20, 2020
Copy link
Member

@timebertt timebertt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! Great work, I like it :)
/lgtm

@timebertt
Copy link
Member

/retest

@gardener-robot
Copy link

@timebertt Command /retest is not known.

@timebertt
Copy link
Member

/ok-to-test

* Add metric shoot:apiserver_watch_duration:quantile
* Add dashboard to show length of API Server watches based on resource
Copy link
Contributor

@istvanballok istvanballok left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm
Kudos! 👍

@wyb1
Copy link
Contributor Author

wyb1 commented Aug 21, 2020

Added thresholds to the panels to add visual clues for which values are acceptable.

Copy link
Member

@timebertt timebertt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good idea!
/lgtm

@rfranzke rfranzke merged commit a5ad7c9 into gardener:master Aug 21, 2020
@wyb1 wyb1 deleted the add-apiserver-watch-metrics branch August 21, 2020 12:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/monitoring Monitoring (including availability monitoring and alerting) related kind/enhancement Enhancement, improvement, extension
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

9 participants