Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
Hang with df -h #1630
Comments
keimoon
commented
Oct 24, 2016
|
Some related error in
|
|
The timing plus the journal entry are suspicious, and may hint at some rkt pod which are not being garbage-collected quickly enough. @keimoon can you please manually run a |
keimoon
commented
Oct 25, 2016
|
@lucab thanks, I will paste the output when it hang again |
PAStheLoD
commented
Dec 15, 2016
|
Hello, On 1248.1.0. Drastic things can happen if someone tries a rkt-ified toolbox, such as un-GC-able pods.
Results in And then it takes a few reboots to clean up the resulting mess if more and more containers' things get tangled up below that pod's mounts (so it seemed that more and more mounts got accessible from that directory). |
|
@PAStheLoD be careful about this:
This is nesting run and usr under an existing root volume. Due to "shared" back propagation, they will re-appear back on the host layered on top your rootfs. That's an almost pathological case to clean. I strongly suggest to use separate non-nested targets for each volume and if possible not to share root at all (ie. only share the rootfs directories you need). |
This was referenced Dec 19, 2016
crawford
added
area/usability
component/kernel
component/rkt
kind/friction
team/os
labels
Jan 12, 2017
lucab
referenced this issue
Mar 6, 2017
Closed
Upgrade to CoreOS Alpha 1339.0.0 breaks connectivity to node-exporter container after instance reboot #1844
btalbot
commented
Jun 23, 2017
•
|
This seems to be happening again with Access to /proc/sys/fs/binfmt_misc seems to hang. Rebooting does seem to clear the issue for a very short time maybe until something trys to access binfmt_misc filesystem.
Logs about
and
|
denderello
commented
Jun 23, 2017
|
We are facing the same issue with CoreOS
We are running as similar setup of EC2 and EBS volumes as @btalbot described. |
|
This is typically due to mounts piling up or similar strange scenarios as a result of some other container-mounting activity. This is just a symptom however, and the root causes may change. I'd suggest to start from a fresh system, keep track of which containers/services are being started/stopped/gc-ed, and then look into |
lucab
added
the
needs/more-information
label
Jun 23, 2017
keimoon
commented
Jun 23, 2017
•
|
I haven't seen this error in my system for months. But when this occurred I can just mount the |
denderello
commented
Jun 23, 2017
•
|
@lucab We can try to gather this data next week, although it will take a bit of time. One thing that I did not mention yet. This happens when we update our cluster machines from |
idleyoungman
commented
Jun 23, 2017
|
We have a container w/ a read-only volume mount of the host's root vol (like Contents of |
idleyoungman
commented
Jun 23, 2017
|
Just a note re: our current workaround: We are starting the docker container including the volume mount of the host's root vol via a systemd unit. Adding |
|
I suspect this issue will be fixed by systemd/systemd#5916 That change should be in the next systemd release. The easiest workaround in the meanwhile is probably to just mask that automount ( If you do need binfmt_misc mounted, you could still mask the automount and depend on the regular |
euank
added
component/systemd
dependency/external
and removed
needs/more-information
dependency/external
labels
Jun 26, 2017
|
This should now be fixed in the beta channel, as well as in the next alpha and stable releases, due shortly. |
bgilbert
closed this
Jul 6, 2017
felixbuenemann
commented
Jul 6, 2017
|
@bgilbert It would be great if you could add a link to the PR or commit that introduced the fix. |
|
@felixbuenemann the upstream fix is systemd/systemd#5916 and will land in systemd-234, while the backport for ContainerLinux systemd-233 is coreos/systemd#82. |
keimoon commentedOct 24, 2016
Issue Report
Bug
CoreOS Version
Environment
AWS EC2, deploy using ``kube-aws`
Expected Behavior
df -hreturns list of mounted storage devicesActual Behavior
df -hhang infinitelyReproduction Steps
Other Information
Kubernetes version:
version.Info{Major:"1", Minor:"4", GitVersion:"v1.4.3+coreos.0", GitCommit:"7819c84f25e8c661321ee80d6b9fa5f4ff06676f", GitTreeState:"clean", BuildDate:"2016-10-17T21:19:17Z", GoVersion:"go1.6.3", Compiler:"gc", Platform:"linux/amd64"}Kubernetes Controller Instance type:
t2.smallKubernetes Worker Instance type:
m4.xlargeWe ran into an issue where commands like
df -hhang onworker server. We ran withstraceand it hangs at:stat("/proc/sys/fs/binfmt_misc",Rebooting worker instances will resolve the problem temporarily, but after about 1 day it will occur again.