sum(container_memory_usage_bytes{...}) rule doubles values #136

rrichardson · 2019-01-17T16:28:45Z

the sum(container* ...) rules are duplicates of data provided by cAdvisor within kubelet, but they are reported in the same record names, albeit with different labels.

The label selectors in the rules in the default rules file collects both the NodeExporter(I think?) records as well as the kubelet cAdvisor records. This results in values that are exactly double reality.

I think the solution here is to just use the service="kubelet" and container_name!="" label selectors, and there is no need for a sum()

Originally posted here:
prometheus-operator/prometheus-operator#2302

What did you do?;
Installed Prometheus chart and friends via Helm in a K8s cluster created by Kubeadm 1.11

What did you expect to see?
Correct values aggregated by the rules:


record: pod_name:container_memory_usage_bytes:sum expr: sum   by(pod_name) (container_memory_usage_bytes{container_name!="POD",pod_name!=""}) | OK |   | 16.737s ago | 17.95ms
-- | -- | -- | -- | --
record: pod_name:container_spec_cpu_shares:sum expr: sum   by(pod_name) (container_spec_cpu_shares{container_name!="POD",pod_name!=""}) | OK |   | 16.719s ago | 14.89ms
record: pod_name:container_cpu_usage:sum expr: sum   by(pod_name) (rate(container_cpu_usage_seconds_total{container_name!="POD",pod_name!=""}[5m])) | OK |   | 16.704s ago | 19.75ms
record: pod_name:container_fs_usage_bytes:sum expr: sum   by(pod_name) (container_fs_usage_bytes{container_name!="POD",pod_name!=""})

If the rules were changed to just use the output from Kubelet, a sum() would not be necessary. This would require setting {service="kubelet", container_name!=""}

What did you see instead? Under which circumstances? : In addition to NodeExporter(I think?) exporting data under these record names, kubelet also reports data under these record names, albeit with different labels. Kublet reports the exact sum of all containers in the Pod.. so the above rules report a value that is exactly double the actual value.

Environment

Prometheus Operator version:

Image ID: docker-pullable://quay.io/coreos/prometheus-operator@sha256:faa9f8a9045092b9fe311016eb3888e2c2c824eb2b4029400f188a765b97648a
Kubernetes version information:

Client Version: version.Info{Major:"1", Minor:"12", GitVersion:"v1.12.4", GitCommit:"f49fa022dbe63faafd0da106ef7e05a29721d3f1", GitTreeState:"clean", BuildDate:"2018-12-14T07:10:00Z", GoVersion:"go1.10.4", Compiler:"gc", Platform:"linux/amd64"}
Server Version: version.Info{Major:"1", Minor:"11", GitVersion:"v1.11.0", GitCommit:"91e7b4fd31fcd3d5f436da26c980becec37ceefe", GitTreeState:"clean", BuildDate:"2018-06-27T20:08:34Z", GoVersion:"go1.10.2", Compiler:"gc", Platform:"linux/amd64"}

Kubernetes cluster kind:

kubeadm on bare metal
Manifests:

https://github.com/coreos/prometheus-operator/blob/master/contrib/kube-prometheus/manifests/prometheus-rules.yaml#L22

Prometheus Operator Logs:

not relevant

The text was updated successfully, but these errors were encountered:

tomwilkie · 2019-01-19T10:55:32Z

Hi @rrichardson; thanks for the issue! I'd be surprised if node_exporter is exporting container_* metrics, but cadvisor (embedded in the kubelet) exports metrics in a hierarchical fashion - and hence if we aggregate lower levels of the hierarchy with upper levels, we can get doubling...

We solve this by dropping lower levels at scrape time, see https://github.com/grafana/jsonnet-libs/blob/master/prometheus-ksonnet/lib/prometheus-config.libsonnet#L287

Could you confirm that image!="" to the rules fixes this for you? We should add that here, and see if we can have prometheus operator also drop those rules.

rrichardson · 2019-01-19T17:17:16Z

image != "" does fix it.

…

On Sat, Jan 19, 2019 at 2:55 AM Tom Wilkie ***@***.***> wrote: Hi @rrichardson <https://github.com/rrichardson>; thanks for the issue! I'd be surprised if node_exporter is exporting container_* metrics, but cadvisor (embedded in the kubelet) exports metrics in a hierarchical fashion - and hence if we aggregate lower levels of the hierarchy with upper levels, we can get doubling... We solve this by dropping lower levels at scrape time, see https://github.com/grafana/jsonnet-libs/blob/master/prometheus-ksonnet/lib/prometheus-config.libsonnet#L287 Could you confirm that image!="" to the rules fixes this for you? We should add that here, and see if we can have prometheus operator also drop those rules. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#136 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAHlC593tLW4Pr1KaTf2JwKVKlMakICqks5vEvmkgaJpZM4aGLrz> .

metalmatze · 2019-01-21T10:59:43Z

The Prometheus-Operator supports relabeling on the Endpoints of a ServiceMonitor:
https://github.com/coreos/prometheus-operator/blob/master/pkg/apis/monitoring/v1/types.go#L558

What we would need to do is to update this endpoint in the kube-prometheus stack:
https://github.com/coreos/prometheus-operator/blob/master/contrib/kube-prometheus/jsonnet/kube-prometheus/prometheus/prometheus.libsonnet#L280

Would you like to do a PR @rrichardson? Let me know, otherwise I can do the change too, but it's a nice little contribution if you want to.

rrichardson · 2019-01-21T15:11:34Z

@metalmatze Sure. I'll take a whack at it.

rrichardson · 2019-02-26T19:04:03Z

I still don't fully understand prometheus or the scope of cAdvisor and its hierarchy of metrics.

But shouldn't https://github.com/grafana/jsonnet-libs/blob/master/prometheus-ksonnet/lib/prometheus-config.libsonnet#L286-L292 already solve my problem?

It should be dropping container_.* where image =="" . when I filter out image = "" then my result is correct. But shouldn't those records not exist?

… (doubles values for container_cpu_usage

Sayrus · 2020-07-06T09:10:45Z

Hey @rrichardson ,

Thanks for documing your findings. I'm currently having the exact same problem with the metrics from cAdvisor not playing well with sum (for the exact same reason you opened this issue: the "total" (image="") is not filtered out as the relabeling rule does not exist in kube-prometheus).

I still don't fully understand prometheus or the scope of cAdvisor and its hierarchy of metrics.

But shouldn't https://github.com/grafana/jsonnet-libs/blob/master/prometheus-ksonnet/lib/prometheus-config.libsonnet#L286-L292 already solve my problem?

It should be dropping container_.* where image =="" . when I filter out image = "" then my result is correct. But shouldn't those records not exist?

I see that the PR on kube-prometheus was closed as this metric is not redundant and only Grafana dashboards should be fixed to filter it out.
However, I don't seem do find any patch on the dashboard side.

How did you ended up fixing the issue? Were the changes up-streamed and I'm missing something?

Edit:

To add a bit more context, here is an example of value I get in double:
https://github.com/kubernetes-monitoring/kubernetes-mixin/blob/master/dashboards/resources/pod.libsonnet#L106

Just like your issue, one is the Pod Total (container=""), the other is the container itself (container="manager"), container="POD" is already filtered out by the query.

…ner_cpu_cfs Currently, Kubelet cAdvisor exports metrics for the parent cgroup as well as for each container. This leads to having "duplicate metrics" and espacially lead to strange or wrong visualisations. Filtering by `container!=""` exclude metrics from the parent cgroup. This patch avoids having two time-series in the CPU Throttling panel. Related to kubernetes-monitoring#136.

…ner_cpu_cfs Currently, Kubelet cAdvisor exports metrics for the parent cgroup as well as for each container. This leads to having "duplicate metrics" and espacially lead to strange or wrong visualisations. Filtering by `container!=""` exclude metrics from the parent cgroup. This patch avoids having two time-series in the CPU Throttling panel. Related to kubernetes-monitoring#136. Signed-off-by: Mathis Raguin <mathis@cri.epita.fr>

…ner_cpu_cfs (#456) Currently, Kubelet cAdvisor exports metrics for the parent cgroup as well as for each container. This leads to having "duplicate metrics" and espacially lead to strange or wrong visualisations. Filtering by `container!=""` exclude metrics from the parent cgroup. This patch avoids having two time-series in the CPU Throttling panel. Related to #136. Signed-off-by: Mathis Raguin <mathis@cri.epita.fr>

paulfantom · 2020-11-12T13:10:11Z

I think this can be closed after #512 was merged.

DarthSlider added a commit to DarthSlider/k8s-resource-quota-dashboards that referenced this issue Jul 17, 2019

fix for a bug discribed here kubernetes-monitoring/kubernetes-mixin#136…

392edac

… (doubles values for container_cpu_usage

DarthSlider mentioned this issue Jul 17, 2019

fix for a bug yuripastushenko/k8s-resource-quota-dashboards#1

Open

This was referenced Aug 7, 2019

Drop container_* metrics with no image. prometheus-operator/kube-prometheus#176

Closed

Drop container_* metrics with no image. prometheus-operator/kube-prometheus#178

Closed

Sayrus mentioned this issue Jul 6, 2020

dashboards/pod: Fix usage of duplicate cAdvisor time-series #456

Merged

adinhodovic mentioned this issue Jul 14, 2020

Exclude v1alpha1 metrics from container memory metrics #463

Merged

Sayrus mentioned this issue Nov 4, 2020

dashboards/statefulset: Fix usage of duplicate cAdvisor time-series #512

Merged

metalmatze closed this as completed Dec 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sum(container_memory_usage_bytes{...}) rule doubles values #136

sum(container_memory_usage_bytes{...}) rule doubles values #136

rrichardson commented Jan 17, 2019

tomwilkie commented Jan 19, 2019

rrichardson commented Jan 19, 2019 via email

metalmatze commented Jan 21, 2019

rrichardson commented Jan 21, 2019

rrichardson commented Feb 26, 2019

Sayrus commented Jul 6, 2020 •

edited

Loading

paulfantom commented Nov 12, 2020

sum(container_memory_usage_bytes{...}) rule doubles values #136

sum(container_memory_usage_bytes{...}) rule doubles values #136

Comments

rrichardson commented Jan 17, 2019

tomwilkie commented Jan 19, 2019

rrichardson commented Jan 19, 2019 via email

metalmatze commented Jan 21, 2019

rrichardson commented Jan 21, 2019

rrichardson commented Feb 26, 2019

Sayrus commented Jul 6, 2020 • edited Loading

paulfantom commented Nov 12, 2020

Sayrus commented Jul 6, 2020 •

edited

Loading