CPU usage of pod always zero #17889

edoardottt · 2024-01-04T09:46:28Z

What Happened?

Context: #13898

I've been using the flag --disable-optimizations, but that doesn't solve the problem for me.
I need to get some statistics about the utilization and requests of some pods, but I can't get that since the output tells me always zero, even under stress tests:

minikube   default                hello-node-59684db6fc-tzkzx                  0m (0%)        0m (0%)      1m (0%)     0Mi (0%)          0Mi (0%)        7Mi (0%)

Attach the log file

log.txt

Operating System

Ubuntu

Driver

Docker

The text was updated successfully, but these errors were encountered:

afbjorklund · 2024-01-04T09:54:31Z

You can compare with the output of docker stats. If they differ, it is an issue with cri-dockerd.

EDIT: There is also a dedicated crictl statsp, that can be used to check the explicit CRI feature

afbjorklund · 2024-01-04T10:02:01Z

I think it is related to ListPodSandboxStats not being implemented for Docker, so it returns nothing.

There is a hardcoded kubelet workaround for cri-o, but I think it is only implemented for containerd.

// UsingLegacyCadvisorStats returns true if container stats are provided by cadvisor instead of through the CRI.
// CRI integrations should get container metrics via CRI.
// TODO: cri-o relies on cadvisor as a temporary workaround. The code should
// be removed. Related issue:
// https://github.com/kubernetes/kubernetes/issues/51798
func UsingLegacyCadvisorStats(runtimeEndpoint string) bool {
        return strings.HasSuffix(runtimeEndpoint, CrioSocketSuffix)
}

edoardottt · 2024-01-04T10:05:26Z

Thanks for the answer @afbjorklund , really appreciated!
So I should change container runtime? something like containerd?

afbjorklund · 2024-01-04T10:17:48Z

I think the function was wrongly implemented, so the fallback doesn't work either. So it is probably a bug.

func (*UnimplementedRuntimeServiceServer) ListPodSandboxStats(ctx context.Context, req *ListPodSandboxStatsRequest) (*ListPodSandboxStatsResponse, error) {
        return nil, status.Errorf(codes.Unimplemented, "method ListPodSandboxStats not implemented")
}

func (ds *dockerService) ListPodSandboxStats(context.Context, *runtimeapi.ListPodSandboxStatsRequest) (*runtimeapi.ListPodSandboxStatsResponse, error) {
        return nil, fmt.Errorf("ListPodSandboxStats is not implemented")
}

If it (cri-dockerd) is indeed the issue here, there should be some evidence about the error in the kubelet logs.

edoardottt · 2024-01-04T10:23:42Z

wow... @afbjorklund I didn't expect that.
Due to the fact that I'm a newbie here... how to collect some kind of resource utilization?

Is there a way (or more than one) to collect some info on the cluster and its components?

afbjorklund · 2024-01-04T10:47:53Z

Can't duplicate it here, though. (v1.32.0)

top node

NAME       CPU(cores)   CPU%   MEMORY(bytes)   MEMORY%   
minikube   229m         2%     866Mi           5%

top pods -A

NAMESPACE     NAME                               CPU(cores)   MEMORY(bytes)   
kube-system   coredns-5dd5756b68-wl86q           4m           14Mi            
kube-system   etcd-minikube                      31m          34Mi            
kube-system   kube-apiserver-minikube            79m          218Mi           
kube-system   kube-controller-manager-minikube   31m          40Mi            
kube-system   kube-proxy-hgtrw                   1m           16Mi            
kube-system   kube-scheduler-minikube            5m           17Mi            
kube-system   metrics-server-7c66d45ddc-gpvt7    6m           17Mi            
kube-system   storage-provisioner                3m           9Mi

It uses cri-dockerd 0.3.3, will try 0.3.9 too

edoardottt · 2024-01-04T14:41:04Z

This is the command I'm executing:

kubectl resource-capacity --sort cpu.util --util --pods --pod-count

And this is the complete output I get:

NODE       NAMESPACE              POD                                          CPU REQUESTS   CPU LIMITS   CPU UTIL    MEMORY REQUESTS   MEMORY LIMITS   MEMORY UTIL   POD COUNT
                                                                                                                                                                       
minikube   *                      *                                            850m (7%)      0m (0%)      122m (1%)   370Mi (1%)        170Mi (0%)      642Mi (2%)    11/110
minikube   kube-system            kube-apiserver-minikube                      250m (2%)      0m (0%)      26m (0%)    0Mi (0%)          0Mi (0%)        220Mi (0%)    
minikube   kube-system            etcd-minikube                                100m (0%)      0m (0%)      10m (0%)    100Mi (0%)        0Mi (0%)        43Mi (0%)     
minikube   kube-system            kube-controller-manager-minikube             200m (1%)      0m (0%)      8m (0%)     0Mi (0%)          0Mi (0%)        51Mi (0%)     
minikube   kube-system            coredns-5dd5756b68-8fb7w                     100m (0%)      0m (0%)      2m (0%)     70Mi (0%)         170Mi (0%)      14Mi (0%)     
minikube   kube-system            kube-scheduler-minikube                      100m (0%)      0m (0%)      2m (0%)     0Mi (0%)          0Mi (0%)        18Mi (0%)     
minikube   kube-system            metrics-server-7c66d45ddc-6qfxq              100m (0%)      0m (0%)      2m (0%)     200Mi (0%)        0Mi (0%)        18Mi (0%)     
minikube   kube-system            storage-provisioner                          0m (0%)        0m (0%)      2m (0%)     0Mi (0%)          0Mi (0%)        9Mi (0%)      
minikube   kubernetes-dashboard   dashboard-metrics-scraper-7fd5cb4ddc-4hqsv   0m (0%)        0m (0%)      1m (0%)     0Mi (0%)          0Mi (0%)        7Mi (0%)      
minikube   kubernetes-dashboard   kubernetes-dashboard-8694d4445c-4n4d4        0m (0%)        0m (0%)      1m (0%)     0Mi (0%)          0Mi (0%)        10Mi (0%)     
minikube   default                hello-node-59684db6fc-tzkzx                  0m (0%)        0m (0%)      1m (0%)     0Mi (0%)          0Mi (0%)        7Mi (0%)      
minikube   kube-system            kube-proxy-rnzw2                             0m (0%)        0m (0%)      1m (0%)     0Mi (0%)          0Mi (0%)        15Mi (0%)

So it seems to work for me either, what I want to say is that when the hello-node test pod is under stress test, its metrics don't change, they keep saying 0 for everything... it's not responsive let's say

afbjorklund · 2024-01-04T14:49:41Z

I think it is a known issue with cri-dockerd, there might be some workarounds available but anyway.

The dockershim code was removed from Kubernetes, before the new functionality was in cri-dockerd

https://kubernetes.io/docs/tasks/administer-cluster/migrating-from-dockershim/

You can mitigate this issue by using cAdvisor as a standalone daemonset.

k8s-triage-robot · 2024-04-03T15:47:07Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2024-05-03T15:56:25Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

afbjorklund added the co/runtime/docker Issues specific to a docker runtime label Jan 4, 2024

edoardottt mentioned this issue Jan 8, 2024

kubectl top output format options kubernetes/kubectl#1534

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 3, 2024

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels May 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPU usage of pod always zero #17889

CPU usage of pod always zero #17889

edoardottt commented Jan 4, 2024

afbjorklund commented Jan 4, 2024 •

edited

afbjorklund commented Jan 4, 2024 •

edited

edoardottt commented Jan 4, 2024

afbjorklund commented Jan 4, 2024 •

edited

edoardottt commented Jan 4, 2024

afbjorklund commented Jan 4, 2024 •

edited

edoardottt commented Jan 4, 2024 •

edited

afbjorklund commented Jan 4, 2024 •

edited

k8s-triage-robot commented Apr 3, 2024

k8s-triage-robot commented May 3, 2024

CPU usage of pod always zero #17889

CPU usage of pod always zero #17889

Comments

edoardottt commented Jan 4, 2024

What Happened?

Attach the log file

Operating System

Driver

afbjorklund commented Jan 4, 2024 • edited

afbjorklund commented Jan 4, 2024 • edited

edoardottt commented Jan 4, 2024

afbjorklund commented Jan 4, 2024 • edited

edoardottt commented Jan 4, 2024

afbjorklund commented Jan 4, 2024 • edited

edoardottt commented Jan 4, 2024 • edited

afbjorklund commented Jan 4, 2024 • edited

k8s-triage-robot commented Apr 3, 2024

k8s-triage-robot commented May 3, 2024

afbjorklund commented Jan 4, 2024 •

edited

afbjorklund commented Jan 4, 2024 •

edited

afbjorklund commented Jan 4, 2024 •

edited

afbjorklund commented Jan 4, 2024 •

edited

edoardottt commented Jan 4, 2024 •

edited

afbjorklund commented Jan 4, 2024 •

edited