cAdvisor should not report irrelevant metrics for systemd units relating to mounts #2037

bboreham · 2018-09-07T16:09:02Z

(using cAdvisor inside kubelet 1.11.2)

I see a bunch of metrics reported like this:

container_cpu_load_average_10s{container_name="",id="/system.slice/run-r0adb997525644e3e97b5a387c0404b5d.scope",image="",name="",namespace="",pod_name=""} 0
container_cpu_load_average_10s{container_name="",id="/system.slice/run-r0afc4fa09f17427cbdd3cc0f78e7f03f.scope",image="",name="",namespace="",pod_name=""} 0
container_cpu_load_average_10s{container_name="",id="/system.slice/run-r13a18ccb82ce44ceb230128b545a7561.scope",image="",name="",namespace="",pod_name=""} 0
...

and these cgroups appear to correspond to a behaviour of systemd whenever a mount is created:

# systemctl status run-r6fc4ce2068d9421fbc4334d9c799b19c.scope
● run-r6fc4ce2068d9421fbc4334d9c799b19c.scope - Kubernetes transient mount for /mnt/containers/kubernetes/pods/44fdf928-a871-11e8-817c-12ffd34f0094/volumes/kubernetes.io~secret/default-token-s7sk7
   Loaded: loaded
Transient: yes
  Drop-In: /run/systemd/system/run-r6fc4ce2068d9421fbc4334d9c799b19c.scope.d
           └─50-Description.conf
   Active: active (running) since Sat 2018-08-25 14:15:08 UTC; 1 weeks 6 days ago

Unless I misunderstand, there is never going to be any load average, or run-queue, or memory failures, etc., for these cgroups.

I see 45 different metrics reported per unit, so many thousands across our cluster.

The text was updated successfully, but these errors were encountered:

dashpole · 2018-09-07T23:23:56Z

hmmm... maybe we need to ignore .scope cgroups as well? I am no expert, but I thought mount cgroups ended in .mount...

bboreham · 2018-09-08T08:54:08Z

I am no expert in systemd either.

From systemd.scope I read:

Scope units are [...] created programmatically using the bus interfaces of systemd. [...]
The main purpose of scope units is grouping worker processes of a system service for organization and for managing resources.

I might speculate that these units are created internally by systemd to run processing relating to each mount. Negating my "never going to be any load ..."

Maybe a different approach would be to limit the depth to which certain cgroups are monitored? E.g. I could declare to cAdvisor I'm happy to monitor /system.slice as a whole, and don't need to see detail under it?

jraby · 2018-10-16T20:54:39Z

Ignoring these would be really useful for us. We have thousands of k8s pods with a few mounts each, and these system.slice/run-*.scope metrics account for the vast majority of the metrics reported by cadvisor.

Having a way to ignore them at the source would be awesome.

junta:tmp jean$ grep -E '^[^#]'  host1.cadvisor | wc -l
   30523
junta:tmp jean$ grep -E '^[^#]'  host1.cadvisor | grep system.slice/run | wc -l
   24312

bboreham · 2019-06-25T14:58:27Z

Update: the underlying issue seems to be a systemd thing; see kubernetes/kubernetes#64137
or kubernetes/kubernetes#57345

dashpole added kind/bug area/performance labels Sep 7, 2018

bboreham mentioned this issue Dec 13, 2019

Lots of useless "container_spec_cpu_shares" kubelet/cadvisor metrics kubernetes/kubernetes#71181

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cAdvisor should not report irrelevant metrics for systemd units relating to mounts #2037

cAdvisor should not report irrelevant metrics for systemd units relating to mounts #2037

bboreham commented Sep 7, 2018

dashpole commented Sep 7, 2018

bboreham commented Sep 8, 2018

jraby commented Oct 16, 2018

bboreham commented Jun 25, 2019 •

edited

cAdvisor should not report irrelevant metrics for systemd units relating to mounts #2037

cAdvisor should not report irrelevant metrics for systemd units relating to mounts #2037

Comments

bboreham commented Sep 7, 2018

dashpole commented Sep 7, 2018

bboreham commented Sep 8, 2018

jraby commented Oct 16, 2018

bboreham commented Jun 25, 2019 • edited

bboreham commented Jun 25, 2019 •

edited