-
Notifications
You must be signed in to change notification settings - Fork 321
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[enhancement] Add support for monitoring node runtime & system resource usage #101
Comments
Hi @stevehipwell, Could you share which metrics you have in mind and what is your use-case? |
@dotdc I'd like to see the metrics showing the runtime and system resource utilization, the panels could be hidden with a tip on how to enable them if the metrics aren't present. A rough stab at the queries based on an EKS cluster follows. Runtime Usage
System Usage (mem only)
AFAIK they're not high cardinality metrics and were disabled via a fairly opaque discussion. |
I have the metrics, but they don't have the same
Do you think you can make generic queries that would work everywhere (with |
@dotdc the metrics above are container metrics, you need the following metrics. Due to AKS currently using
|
@stevehipwell I'm sorry, but I don't understand what you're trying to achieve. Which information are you missing in this dashboard (panel title) ? |
Describe the enhancement you'd like
I'd like the nodes dashboard to show the runtime and system resource usage, as are exported by kubelet.
Additional context
This requires that the cAdvisor metrics for cgroup slices aren't being dropped. For this to work with Kube Prometheus Stack the kubelet ServiceMonitor
cAdvisorMetricRelabelings
value needs to be overridden to keep the required values.The text was updated successfully, but these errors were encountered: