Openstack cinder volumes not detached from downed vm when pod is rescheduled to another node. #33288

Rotwang · 2016-09-22T16:28:13Z

Kubernetes version (use kubectl version):
Server Version: version.Info{Major:"1", Minor:"5+", GitVersion:"v1.5.0-alpha.0.1062+1070a518301f0b", GitCommit:"1070a518301f0bfb6f6c8832feff0bba0c391a22", GitTreeState:"clean", BuildDate:"2016-09-20T11:34:18Z", GoVersion:"go1.6.3", Compiler:"gc", Platform:"linux/amd64"}
Also tried with 1.3.6

Environment:

Cloud provider or hardware configuration: openstack
OS (e.g. from /etc/os-release): ubuntu 14.04
Kernel (e.g. uname -a): 3.13.0-93-generic
Install tools: deb package (not selfhosted)
Others: May be related to Openstack Cinder volumes do not work with >v1.3.0 #32716 Attach detach controller not working in k8s 1.3 for AWS EBS volume #28643 Volume is not detached/attached to a new node when pod is scheduled to other node #28671

What happened:
If compute instance with attached volume is downed (physical volume in kubernetes terms), kubernetes doesn't try to detach said volume (ever). End result is that k8s is trying to attach a volume in a loop but never succeeds because it is already attached to a downed node. I've waited for more than an hour for the volume to be detached (didn't help :c)

What you expected to happen:
I expect controller-manager or kubelet to detach volume before trying to attach it to a new compute instance.

How to reproduce it (as minimally and precisely as possible):
Bring up cluster with two nodes on openstack. Schedule pod with a pvc. Shutdown (from the command line on the operating system) the node with attached volume. Pod gets rescheduled to another node, but volume stays with the downed node.

Anything else do we need to know:
I've looked through reconcile.go on both kubelet and controller-manager it seems to me that once a node is downed it is no longer on the list of nodesManaged (in desired_state_of_world.go) so it's volume is never going to be detached. I've also tried to disable --enable-controller-attach-detach on a kubelet, but it doesn't try to detach it either.

The text was updated successfully, but these errors were encountered:

Rotwang · 2016-09-23T12:30:10Z

Additionally after bringing back compute instance up (the one with required volume), so that 'kubectl get nodes' lists it as 'Ready' volume is not being detached (even after 3 hours). Tried with both controller-manager and kubelet (--enable-controller-attach-detach=False). After volume is manually detached from the node (node which doesn't run pod which claims this volume) controller-manager attaches it to proper node.

@Rotwang

Fixes kubernetes#33288 Co-Authored-By: @Rotwang

@Rotwang

Automatic merge from submit-queue (batch tested with PRs 39152, 39142, 39055) openstack: Forcibly detach an attached cinder volume before attaching elsewhere Fixes #33288 **What this PR does / why we need it**: Without this fix, we can't preemptively reschedule pods with persistent volumes to other hosts (for rebalancing or hardware failure recovery). **Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes #33288 **Special notes for your reviewer**: (This is a resurrection/cleanup of PR #33734, originally authored by @Rotwang) **Release note**:

@Rotwang

Automatic merge from submit-queue (batch tested with PRs 39152, 39142, 39055) openstack: Forcibly detach an attached cinder volume before attaching elsewhere Fixes kubernetes#33288 **What this PR does / why we need it**: Without this fix, we can't preemptively reschedule pods with persistent volumes to other hosts (for rebalancing or hardware failure recovery). **Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes kubernetes#33288 **Special notes for your reviewer**: (This is a resurrection/cleanup of PR kubernetes#33734, originally authored by @Rotwang) **Release note**:

slawekww · 2017-03-10T18:50:00Z

In the scenario when pod is forced to delete by kubectl delete pod command, pod is rescheduled to another node. Without forcibly detach volume fix, scenario would end with failure as cinder volume is attached to another k8s node. It is very annoying, especially pod is hosting database on persistent volume.
Would this fix be part of official kubernetes 1.4 or/and 1.5 release?

@Rotwang

Fixes kubernetes#33288 Co-Authored-By: @Rotwang

@Rotwang

Fixes kubernetes#33288 Co-Authored-By: @Rotwang

@Rotwang

Automatic merge from submit-queue (batch tested with PRs 39152, 39142, 39055) openstack: Forcibly detach an attached cinder volume before attaching elsewhere Fixes kubernetes#33288 **What this PR does / why we need it**: Without this fix, we can't preemptively reschedule pods with persistent volumes to other hosts (for rebalancing or hardware failure recovery). **Which issue this PR fixes** *(optional, in `fixes #<issue number>(, fixes #<issue_number>, ...)` format, will close that issue when PR gets merged)*: fixes kubernetes#33288 **Special notes for your reviewer**: (This is a resurrection/cleanup of PR kubernetes#33734, originally authored by @Rotwang) **Release note**:

k8s-github-robot added area/kubelet team/cluster labels Sep 22, 2016

This was referenced Sep 29, 2016

OpenStack integration (detach a volume before trying to attach it to another node). Rotwang/kubernetes#1

Open

OpenStack integration (detach a volume before trying to attach it to another node). #33734

Closed

dims added area/provider/openstack Issues or PRs related to openstack provider and removed area/kubelet labels Nov 15, 2016

xmik mentioned this issue Dec 19, 2016

OpenStack Cinder: upon node failure kubernetes fails to relocate pods with cinder volumes #38098

Closed

anguslees mentioned this issue Dec 21, 2016

openstack: Forcibly detach an attached cinder volume before attaching elsewhere #39055

Merged

anguslees added a commit to anguslees/kubernetes that referenced this issue Dec 21, 2016

Forcibly detach an attached volume before attaching elsewhere

fa1d6f3

Fixes kubernetes#33288 Co-Authored-By: @Rotwang

k8s-github-robot closed this as completed in #39055 Dec 28, 2016

NickrenREN mentioned this issue Jan 12, 2017

Should we detach an attached cinder volume forcibly before attaching elsewhere #39791

Closed

xsgordon mentioned this issue Mar 20, 2017

PV cannot be deleted if attachable PVC is deleted #42784

Closed

berryjam pushed a commit to berryjam/kubernetes that referenced this issue Aug 18, 2017

Forcibly detach an attached volume before attaching elsewhere

7edc560

Fixes kubernetes#33288 Co-Authored-By: @Rotwang

dims pushed a commit to dims/kubernetes that referenced this issue Feb 8, 2018

Forcibly detach an attached volume before attaching elsewhere

0b2d015

Fixes kubernetes#33288 Co-Authored-By: @Rotwang

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Openstack cinder volumes not detached from downed vm when pod is rescheduled to another node. #33288

Openstack cinder volumes not detached from downed vm when pod is rescheduled to another node. #33288

Rotwang commented Sep 22, 2016

Rotwang commented Sep 23, 2016 •

edited

slawekww commented Mar 10, 2017

Openstack cinder volumes not detached from downed vm when pod is rescheduled to another node. #33288

Openstack cinder volumes not detached from downed vm when pod is rescheduled to another node. #33288

Comments

Rotwang commented Sep 22, 2016

Rotwang commented Sep 23, 2016 • edited

slawekww commented Mar 10, 2017

Rotwang commented Sep 23, 2016 •

edited