New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Expose metrics about resource requests and limits that represent the pod model #1748
Comments
/sig instrumentation |
Hey there @smarterclayton -- 1.19 Enhancements shadow here. I wanted to check in and see if you think this Enhancement will be graduating in 1.19? In order to have this part of the release:
The current release schedule is:
If you do, I'll add it to the 1.19 tracking sheet (http://bit.ly/k8s-1-19-enhancements). Once coding begins please list all relevant k/k PRs in this issue so they can be tracked properly. 👍 Thanks! |
I don't think we'll make implementable and merged by Tuesday, so should be targeted for 1.20 |
Hey @smarterclayton Thanks for confirming the inclusion state. I've marked the Enhancement as /milestone v1.20 |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle stale |
Hi @smarterclayton ! Enhancements Lead here, do you still intend to target this for alpha in 1.20? Thanks! |
Yes, this is target alpha for 1.20 assuming we can close the remaining questions in the KEP |
Thanks Clayton!! As a reminder, by Enhancements Freeze (October 6th), KEPs must be:
Best, I also added the PR link to the Issue description we can update again once merged. |
Hi @smarterclayton 👋! I'm one of the Enhancement shadows for the 1.20 release cycle. This is a friendly reminder that the Enhancement freeze is on the 6th of October, i'm repeating the requirements needed by then:
Thanks! |
Thanks for the reminder, updated those. Will be working with the sig. |
The current PR looks complete from a enhancements freeze POV, we'll monitor to see if it merges in time. |
Enhancements Freeze is now in effect. Unfortunately, your KEP PR has not yet merged. If you wish to be included in the 1.20 Release, please submit an Exception Request as soon as possible. Best, |
/lifecycle frozen |
@dashpole @logicalhan do you happen to find some volunteers to continue the work? |
Oh man, we didn't take this to beta?! This is my fault. Let me talk to @dgrisonnet who pinged me about it a day ago - originally the delay was gathering feedback from admins doing capacity planning, and I had been working with a few people on leveraging it more widely. The use I was most familiar with was OpenShift and we replaced the dashboards that were using the (old, incorrect, not complete) kube-state-metrics for this - among the folks who did the change there was general agreement that the new metrics were superior and the cost of cardinality was worth it to replace the generally incorrect metrics from kube-state-metrics (at the time we felt that completely replicating the pod resource model code in ksm was not appropriate, and this was a better solution).. Next phase was getting community user input on building metric based capacity dashboards and whether the dimensions worked for the audience. I did a few analysis when planning out e2e CI runs and found the metrics provided better human visibility of comparing bulk "used vs requested". |
@Huang-Wei re:
The original intent was to allow admins to build capacity planning dashboards, and to pair the resource vs pod level resource metrics (like cpu, memory, etc). So the intent was very much to have a pod dimension. Do we have a proposal to remove or make optional pod level cpu consumption or memory consumption dimensions? If so, such a change would apply to this metric as well, but as this is already an optional endpoint for users who are concerned about cardinality. |
To clarify - this is in beta since 1.21 (#1748 (comment)). Was there some belief that it was not beta? It would be last step to go to GA, I'm happy to push that over the line with @dgrisonnet |
I also thought this was still in Alpha for some reason even though we have a label marking the stability 😅 Yet let's try to get this over the finish and gather some feedback from users to know if they encountered any issues with these new metrics. /assign @smarterclayton @dgrisonnet |
We already have a cardinality protection mechanism in Kubernetes: https://github.com/kubernetes/enhancements/tree/master/keps/sig-instrumentation/2305-metrics-cardinality-enforcement so users could already tweak the dimensions if needed. |
I opened kubernetes/kube-state-metrics#1846 in kube-state-metrics to make the transition to the kube-scheduler metrics. |
@smarterclayton we are now recommending users to switch to the scheduler resource metrics in kube-state-metrics and there is an open PR to propagate the use of the new metrics in the Prometheus mixins: kubernetes-monitoring/kubernetes-mixin#815. Based on that, we should have a good enough user base for these metrics, so would it make sense to graduate the effort to stable? |
Hello @dgrisonnet 👋, 1.27 Enhancements team here. Just checking in as we approach enhancements freeze on 18:00 PDT Thursday 9th February 2023. This enhancement is targeting for stage Here's where this enhancement currently stands:
It looks like kubernetes/kubernetes#115454 and #3810 will address most of these issues. The status of this enhancement is marked as |
Hi @npolshakova, I completed the different action items, please let me know if there is anything else I need to do. |
Great, I'm marking this enhancement as tracked for v1.27. /remove-label tracked/no |
@npolshakova: Those labels are not set on the issue: In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Hi @dgrisonnet, Checking in as we approach 1.27 code freeze at 17:00 PDT on Tuesday 14th March 2023. Please ensure the following items are completed:
Please let me know if there are any other PRs in k/k I should be tracking for this KEP. |
Hi @npolshakova, I have updated everything, thank you for the reminder :) |
Hi @dgrisonnet @smarterclayton, I’m reaching out from the 1.27 Release Docs team. This enhancement is marked as ‘Needs Docs’ for the 1.27 release. Please follow the steps detailed in the documentation to open a PR against dev-1.27 branch in the k/website repo. This PR can be just a placeholder at this time, and must be created by March 16. For more information, please take a look at Documenting for a release to familiarize yourself with the documentation requirements for the release. Please feel free to reach out with any questions. Thanks! |
Hi @LukeMwila, I completely missed the fact that we add to write a doc for this KEP. I opened a placeholder PR for now kubernetes/website#39970. @smarterclayton do you perhaps remember what you had in mind for the doc? I was thinking about drafting something about capacity planning in general. |
Enhancement Description
k/enhancements
) update PR(s):k/k
) update PR(s):k/website
) update(s):The text was updated successfully, but these errors were encountered: