-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[v1.24] use container stats from cadvisor for cri-dockerd #74
Conversation
Note: This PR is still marked as WIP because yet to add drone build logic for it. But following tags have the linked fix for testing purposes: They have been pulled into rancher/hyperkube for |
c0a45ed
to
2c8a88b
Compare
4bc359b
to
ff1043a
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed non-vendor updates commits, looks sane to me
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good to me!
Issue
The main issue around cpu spike with >=v1.24 RKE1 clusters is Kubernetes deprecating internal dockershim with 1.24 and Rancher now using cri-dockerd - this means that all stats are now collected by cri-dockerd talking to docker api directly instead of getting them from cadvisor. The docker calls themselves seem to be the bottleneck, which changes from system to system. (This is confirmed by internal testing and varying results on different user setups).
rancher/rancher#38816
Fix
The workaround would be to build our own kubelet that installs docker plugin for cadvisor and continue reading stats the old way.