Bind mounts in container not updated after volume is remounted again #66567

cofyc · 2018-07-24T11:10:42Z

Is this a BUG REPORT or FEATURE REQUEST?:

Uncomment only one, leave it on its own line:

/kind bug

/kind feature

@kubernetes/sig-storage-bugs

What happened:

For fuse-backed volumes, fuse process may exit on error, the volume on the host will disappear. Kubelet will remount the volume later because it will find volume mount point does not exist anymore. But bind mount in container will still be the old one, processes in container will fail with error:

Transport endpoint is not connected

What you expected to happen:

Processes in container can access new volume mount point without restart if possible
Or containers are restarted with new bind mounts, then processes in container can access new volume mount point.
Or other way to recover in this scenario.

How to reproduce it (as minimally and precisely as possible):

Create a cephfs volume and enable fuse support
Create a pod to mount the volume
Kill ceph-fuse process on the host

Anything else we need to know?:

Should we restart container after remounting volumes of it?

Related issues

GlusterFS volumes don't remount into pods #46253

The text was updated successfully, but these errors were encountered:

fejta-bot · 2018-11-19T08:17:54Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

cofyc · 2018-11-19T08:19:34Z

/remove-lifecycle stale

fejta-bot · 2019-02-17T09:03:07Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

gnufied · 2019-02-17T16:55:36Z

/remove-lifecycle stale

fejta-bot · 2019-05-18T17:09:57Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2019-06-17T17:56:07Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

cofyc · 2019-06-18T01:39:29Z

/lifecycle frozen

knightXun · 2019-07-26T19:18:11Z

that's really hit my s3fs-fuse. If we use s3fs mount a bucket into a host as a dir, then mount this dir into container. At this point if we restart s3fs process, the dir in the container can't be access. Always show me 'Transport endpoint is not connected'. Does the ceph-fuse has the same problem?

jingxu97 · 2019-07-26T23:50:57Z

@cofyc you mentioned "Kubelet will remount the volume later because it will find volume mount point does not exist anymore. ". But from my understanding, kubelet does not check whether mount still exist or not and remount it.

For this type problem, are the following steps necessary to recover? Is there any other way?

kubelet remount the volume,
container restarts with the new mount.

knightXun · 2019-07-27T05:43:12Z

I think the only way to fix this is to change some logic in cri and container-runtime.

cofyc · 2019-10-15T10:50:39Z

@jingxu97

I might make a mistake. I guess the userspace filesystem process (I was using ceph-fuse) was restarted by systemd.

yes, the kubelet can remount the volume and restarts the container to update the stale bind mounts in the container.

However, it's too complicated I guess. As a workaround solution, users can run the userspace filesystem process in a process supervisor (systemd), then the mount point will be remounted again after the process restarts. Second, the process in the pod can exit and wait for the kubelet to restart it again when it cannot access the filesystem or the mount point is found stale.

garenchan · 2022-09-28T08:37:42Z

Restarting container is a feasible approach. Is there a way to do this without restarting container?

k8s-ci-robot added sig/storage Categorizes an issue or PR as relevant to SIG Storage. kind/bug Categorizes issue or PR as related to a bug. labels Jul 24, 2018

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 19, 2018

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 19, 2018

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 17, 2019

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 17, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 18, 2019

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jun 17, 2019

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. labels Jun 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bind mounts in container not updated after volume is remounted again #66567

Bind mounts in container not updated after volume is remounted again #66567

cofyc commented Jul 24, 2018 •

edited

fejta-bot commented Nov 19, 2018

cofyc commented Nov 19, 2018

fejta-bot commented Feb 17, 2019

gnufied commented Feb 17, 2019

fejta-bot commented May 18, 2019

fejta-bot commented Jun 17, 2019

cofyc commented Jun 18, 2019

knightXun commented Jul 26, 2019

jingxu97 commented Jul 26, 2019

knightXun commented Jul 27, 2019 •

edited

cofyc commented Oct 15, 2019

garenchan commented Sep 28, 2022

Bind mounts in container not updated after volume is remounted again #66567

Bind mounts in container not updated after volume is remounted again #66567

Comments

cofyc commented Jul 24, 2018 • edited

fejta-bot commented Nov 19, 2018

cofyc commented Nov 19, 2018

fejta-bot commented Feb 17, 2019

gnufied commented Feb 17, 2019

fejta-bot commented May 18, 2019

fejta-bot commented Jun 17, 2019

cofyc commented Jun 18, 2019

knightXun commented Jul 26, 2019

jingxu97 commented Jul 26, 2019

knightXun commented Jul 27, 2019 • edited

cofyc commented Oct 15, 2019

garenchan commented Sep 28, 2022

cofyc commented Jul 24, 2018 •

edited

knightXun commented Jul 27, 2019 •

edited