hostPath volumes used with subPath volume mounts don't support reconstruction #61446

msau42 · 2018-03-21T01:15:57Z

Is this a BUG REPORT or FEATURE REQUEST?:
@kubernetes/sig-storage-bugs

What happened:
Normally this isn't a problem, but if you use subpath with hostpath volumes, then that means the subpath mounts will not get cleaned up during the reconstruction window (a pod is force deleted while kubelet is down)

msau42 · 2018-03-21T01:16:31Z

Noticed this issue while working on #61373

liggitt · 2018-03-21T23:04:31Z

How is this different than #61372?

msau42 · 2018-03-21T23:49:03Z

#61372 affects reconstruction for PVCs

* Use of subPath module with hostPath volumes can cause issues during reconstruction ([#61446](kubernetes/kubernetes#61446)) and with containerized kubelets ([#61456](kubernetes/kubernetes#61456)). The workaround for this issue is to specify the complete path in the hostPath volume.

fejta-bot · 2018-08-06T06:28:27Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

msau42 · 2018-08-06T18:07:39Z

/remove-lifecycle stale

fejta-bot · 2018-11-04T18:34:32Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

msau42 · 2018-11-06T01:17:47Z

/remove-lifecycle stale

We intend to address this by redesigning the volume reconstruction feature to use kubelet checkpointing.

fejta-bot · 2019-02-04T02:18:36Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2019-03-06T02:36:53Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2019-04-05T03:21:54Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2019-04-05T03:22:06Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

msau42 · 2019-12-12T15:25:14Z

/reopen
/lifecycle frozen

k8s-ci-robot · 2019-12-12T15:25:17Z

@msau42: Reopened this issue.

In response to this:

/reopen
/lifecycle frozen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

neujie · 2019-12-20T01:56:45Z

Better practice:

kill or delete pod
clear the pod's all subpath
otherwise a lot of useless dir will be left

These tests were previously disabled to work around kubernetes#61446 and kubernetes#79980 kubernetes@f1e1f3a

dobsonj · 2022-03-25T00:12:14Z

/assign

dobsonj · 2022-03-29T17:30:59Z

The in-tree hostPath driver makes no attempt to unmount:

kubernetes/pkg/volume/hostpath/host_path.go

Lines 266 to 274 in 6c96ac0

    
           // TearDown does nothing. 
        
           func (c *hostPathUnmounter) TearDown() error { 
        
           	return nil 
        
           } 
        
           // TearDownAt does not make sense for host paths - probably programmer error. 
        
           func (c *hostPathUnmounter) TearDownAt(dir string) error { 
        
           	return fmt.Errorf("TearDownAt() does not make sense for host paths") 
        
           }

Once that is fixed, these skips need to be removed:

kubernetes/test/e2e/storage/testsuites/subpath.go

Lines 347 to 350 in 6c96ac0

    
           if strings.HasPrefix(driverName, "hostPath") { 
        
           	// TODO: This skip should be removed once #61446 is fixed 
        
           	e2eskipper.Skipf("Driver %s does not support reconstruction, skipping", driverName) 
        
           }

kubernetes/test/e2e/storage/testsuites/subpath.go

Lines 359 to 362 in 6c96ac0

    
           if strings.HasPrefix(driverName, "hostPath") { 
        
           	// TODO: This skip should be removed once #61446 is fixed 
        
           	e2eskipper.Skipf("Driver %s does not support reconstruction, skipping", driverName) 
        
           }

dobsonj · 2022-03-30T16:58:47Z

The in-tree hostPath driver makes no attempt to unmount

So it's not really the TearDown code I quoted above, the subpaths would normally be cleaned up here:

kubernetes/pkg/volume/util/operationexecutor/operation_generator.go

Lines 825 to 830 in 2e55595

    
           // Remove all bind-mounts for subPaths 
        
           podDir := filepath.Join(podsDir, string(volumeToUnmount.PodUID)) 
        
           if err := subpather.CleanSubPaths(podDir, volumeToUnmount.InnerVolumeSpecName); err != nil { 
        
           	eventErr, detailedErr := volumeToUnmount.GenerateError("error cleaning subPath mounts", err) 
        
           	return volumetypes.NewOperationContext(eventErr, detailedErr, migrated) 
        
           }

But we never make any attempt to call UnmountVolume in this scenario.

kubernetes/pkg/kubelet/volumemanager/reconciler/reconciler.go

Lines 185 to 187 in 2e55595

    
           klog.V(5).InfoS(mountedVolume.GenerateMsgDetailed("Starting operationExecutor.UnmountVolume", "")) 
        
           err := rc.operationExecutor.UnmountVolume( 
        
           	mountedVolume.MountedVolume, rc.actualStateOfWorld, rc.kubeletPodsDir)

unmountVolumes loops over rc.actualStateOfWorld.GetAllMountedVolumes(), but if we're just using a hostPath (type = Directory), there won't be a mount point, and it won't be listed as a mounted volume. But the subpaths still use bind mounts, and those never get unmounted.

For future reference it can be reproduced manually with:

kind: Pod
apiVersion: v1
metadata:
  name: my-intree-inline-app
spec:
  containers:
    - name: my-frontend
      image: busybox
      volumeMounts:
      - mountPath: "/data"
        name: my-inline-vol
      - mountPath: "/data/subpath1"
        name: my-inline-vol
        subPath: subpath1
      - mountPath: "/data/subpath2"
        name: my-inline-vol
        subPath: subpath2
      command: [ "sleep", "1000000" ]
  volumes:
    - name: my-inline-vol
      hostPath:
        path: /tmp/dir1
        type: Directory

create pod with spec above, check mountpoints
kill kubelet
force delete pod
start kubelet again
check mount points and kubelet.log

k8s-ci-robot added sig/storage Categorizes an issue or PR as relevant to SIG Storage. kind/bug Categorizes issue or PR as related to a bug. labels Mar 21, 2018

msau42 mentioned this issue Mar 22, 2018

WIP: Use containerized rootfs mount point when handling subpath #61489

Closed

liggitt mentioned this issue Mar 22, 2018

CVE-2017-1002101 - subpath volume mount handling allows arbitrary file access in host filesystem #60813

Closed

liggitt changed the title ~~Hostpath doesn't support reconstruction~~ hostPath volumes used with subPath volume mounts don't support reconstruction Mar 22, 2018

Bradamant3 mentioned this issue Mar 22, 2018

v1.10 known issues / FAQ accumulator #59764

Closed

liggitt mentioned this issue Mar 22, 2018

subPath volume mount umbrella issue #61563

Closed

8 tasks

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 6, 2018

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 6, 2018

msau42 mentioned this issue Oct 26, 2018

Make csi drivers and in-tree drivers share e2e tests #68025

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 4, 2018

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 6, 2018

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 4, 2019

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Mar 6, 2019

k8s-ci-robot closed this as completed Apr 5, 2019

k8s-ci-robot reopened this Dec 12, 2019

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. labels Dec 12, 2019

neujie mentioned this issue Dec 20, 2019

when pod restart how to kill pause container #85261

Closed

tanjunchen mentioned this issue Jan 1, 2020

/test: resolve pending TODOs #86756

Closed

verult mentioned this issue Jul 26, 2021

[Failing Test] [sig-storage] subPath should unmount if pod is deleted while kubelet is down (ci-kubernetes-e2e-gci-gce-serial) #103651

Closed

pacoxu mentioned this issue Sep 17, 2021

Remove VolumeSubpath feature gate #105090

Merged

dobsonj added a commit to dobsonj/kubernetes that referenced this issue Mar 25, 2022

e2e: restore volume lifecycle checks for hostpath driver

caec4ad

These tests were previously disabled to work around kubernetes#61446 and kubernetes#79980 kubernetes@f1e1f3a

k8s-ci-robot assigned dobsonj Mar 25, 2022

dobsonj mentioned this issue Mar 25, 2022

Fix volume reconstruction for CSI ephemeral volumes #108997

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hostPath volumes used with subPath volume mounts don't support reconstruction #61446

hostPath volumes used with subPath volume mounts don't support reconstruction #61446

msau42 commented Mar 21, 2018

msau42 commented Mar 21, 2018

liggitt commented Mar 21, 2018

msau42 commented Mar 21, 2018

fejta-bot commented Aug 6, 2018

msau42 commented Aug 6, 2018

fejta-bot commented Nov 4, 2018

msau42 commented Nov 6, 2018

fejta-bot commented Feb 4, 2019

fejta-bot commented Mar 6, 2019

fejta-bot commented Apr 5, 2019

k8s-ci-robot commented Apr 5, 2019

msau42 commented Dec 12, 2019

k8s-ci-robot commented Dec 12, 2019

neujie commented Dec 20, 2019

dobsonj commented Mar 25, 2022

dobsonj commented Mar 29, 2022

dobsonj commented Mar 30, 2022

hostPath volumes used with subPath volume mounts don't support reconstruction #61446

hostPath volumes used with subPath volume mounts don't support reconstruction #61446

Comments

msau42 commented Mar 21, 2018

msau42 commented Mar 21, 2018

liggitt commented Mar 21, 2018

msau42 commented Mar 21, 2018

fejta-bot commented Aug 6, 2018

msau42 commented Aug 6, 2018

fejta-bot commented Nov 4, 2018

msau42 commented Nov 6, 2018

fejta-bot commented Feb 4, 2019

fejta-bot commented Mar 6, 2019

fejta-bot commented Apr 5, 2019

k8s-ci-robot commented Apr 5, 2019

msau42 commented Dec 12, 2019

k8s-ci-robot commented Dec 12, 2019

neujie commented Dec 20, 2019

dobsonj commented Mar 25, 2022

dobsonj commented Mar 29, 2022

dobsonj commented Mar 30, 2022