New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Increase the period of getting container filesystem/network stats #898
Comments
/cc @dchen1107 |
From the referenced issue it seems running |
AFAICT this du cheek only runs on aufs backed docker, basically docker running on ubuntu. Firstly I don't really like that inconsistency, but secondly I'm not sure I see the value of this check anyway. What do you think about removing it? In the meantime I'll have a think about how we might be able to implement in a better fashion. Alternatively we could reduce the polling interval, but that just seems like delaying for when the number of containers rises. |
Do we not check disk usage in other cases? I think we want more information of disk usage exposed to kubelet. @vishh is probably the one who added this, so he'd defend the choice better than I do. I agree that reducing the polling interval is a short-term solution, but it'd help for the time being since we've got quite a few reports from the users. A better implementation is more than welcome :) |
Yes only aufs. See cadvisor/container/docker/handler.go Line 232 in b22a085
If this check is required, the way to perform it is dependent on the backing storage used for docker. In its current form this check would probably also work for devicemapper loopback but isn't going to work for a more production like deployment like using direct lvm. However I can't think of a better way to do it tbh. I can't see this performing well enough at scale. Still vote to drop the check, open an issue to think if we can do this better. |
You probably already know this, but thought it worth just noting why |
I am okay with disabling this for now since kubelet hasn't started utilizing it, but I think we still want the disk usage information in the near future. @vishh, WDYT? |
Filesystem stats are useful mainly to figure out which container is hogging up disk space on a given node. |
See #771 for another reason not to do |
@jimmidyson du is a temporary workaround for disk usage tracking in cAdvisor without disk quota. We are working on proposal / prototype on better disk usage tracking. One proposal is using disk quota tracking. But before we get there, we need signals at least to detect out-of-disk condition, and propagate such information to upstream layers for management. Thus increasing the interval for filesystem stats might be a ok workaround for short-term. |
@dchen1107 If we could somehow swap to |
@jimmidyson: We do not use |
Frequent checking could cause high cpu usage, as reported by a kubernetes user in kubernetes/kubernetes#10451 (comment)
edit: The current housekeeping period is 1s
/cc @vishh
The text was updated successfully, but these errors were encountered: