crc start issues after forceful shutdown #325

cfergeau · 2019-07-18T12:39:37Z

I've seen this on hyperkit, when the VM is forcefully shutdown (for example if one interrupts the 3 minutes wait for the cluster to be up), then next start fails with

command : sudo podman run  --ip 10.88.0.8 --name dnsmasq -v /var/srv/dnsmasq.conf:/etc/dnsmasq.conf -p 53:53/udp --privileged -d quay.io/crcont/dnsmasq:latest
err     : exit status 125
output  : must provide image ID and image name to use an image: invalid argument

The dnsmasq local image is indeed in an odd state

$ sudo podman image inspect quay.io/crcont/dnsmasq:latest
error parsing image data "851bb0e5bf751cba2d649612a47651890a86eafe629308e1b3273c16b71b047e": readlink /var/lib/containers/storage/overlay/l/5BCUDYQEZFQFL56HPBCM376SLM: no such file or directory

The podman version on our image is old:

$ podman version
Version:       1.0.2-dev
Go Version:    go1.11.5
OS/Arch:       linux/amd64

This may or may not be related to containers/podman#3345 (comment)

The text was updated successfully, but these errors were encountered:

gbraad · 2019-07-18T12:50:02Z

@giuseppe any thoughts on this? We use an RHCOS image

giuseppe · 2019-07-19T08:41:00Z

@gbraad that version of podman is really old and a lot of changes/improvements went into. The issue you've linked is related to rootless containers, even if the error message looks similar.

Looks like the storage is corrupted (in this case missing symlinks), because of the forced shutdown. I'd suggest to remove that image and re-pull it again.

cfergeau · 2019-07-19T09:03:46Z

@giuseppe this is the version of podman which is shipped on the RH coreos image openshift uses (and this is also the default version of podman in rhel8.0).
Removing the image indeed helps, but this is happening close to 100% of the time when the VM is forcefully shutdown (ie unclean shutdown). Is this expected, or is this something that newer podman version are likely to be more robust against?

gbraad · 2019-07-21T14:01:48Z

Alternatively we have to stop the container BEFORE doing a stop of the VM, but that sounds like a workaround to an issue that can happen outside of CRC.

gbraad · 2019-07-21T14:04:06Z

repulling the image is not an option, as we need to ensure we can start from a disconnected state (no guarantee that we can pull an image from a remote reqistry on the internet, like quay). Alternatively we can export the image and place the archive inside the VM, so we can always re-import it. But again, this sounds like a workaround for an issue with the podman version delivered with RHCOS(?).

@ashcrow Are newer versions of podman considered or available for use with RHCOS?

ashcrow · 2019-07-22T19:03:03Z

@gbraad yes, newer versions are pulled in when requested. @lsm5 works with us when a new version is required to be pulled in.

lsm5 · 2019-07-22T19:45:15Z

soon, it'll be @jnovy

cfergeau · 2019-09-19T13:10:15Z

I can still reproduce with an image based off OpenShift 4.1.14 (podman-1.0.2-1.dev.git96ccc2e.el8.x86_64). @ashcrow, @jnovy, any plans to update podman to a newer version?

jnovy · 2019-09-19T13:35:12Z

@cfergeau @ashcrow @lsm5 I will provide you with an internal scratch build of podman-1.4.2-5.el8, if that looks good I will push to have it included in rhaos-4.1. Sounds like a plan?

ashcrow · 2019-09-19T14:16:01Z

Works for me! Thanks @jnovy.

cfergeau · 2019-10-11T11:14:25Z

Tested a 4.2.0-rc.2 image, and could not reproduce the issue, so it's probably fixed there by the upgrade to a newer podman version.

cfergeau · 2019-10-11T11:18:57Z

Closing this, we can reopen if the issue reoccurs

gbraad mentioned this issue Aug 26, 2019

crc start fails after stopped #443

Closed

cfergeau closed this as completed Oct 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

crc start issues after forceful shutdown #325

crc start issues after forceful shutdown #325

cfergeau commented Jul 18, 2019

gbraad commented Jul 18, 2019

giuseppe commented Jul 19, 2019

cfergeau commented Jul 19, 2019

gbraad commented Jul 21, 2019

gbraad commented Jul 21, 2019 •

edited

Loading

ashcrow commented Jul 22, 2019

lsm5 commented Jul 22, 2019

cfergeau commented Sep 19, 2019

jnovy commented Sep 19, 2019

ashcrow commented Sep 19, 2019

cfergeau commented Oct 11, 2019

cfergeau commented Oct 11, 2019

crc start issues after forceful shutdown #325

crc start issues after forceful shutdown #325

Comments

cfergeau commented Jul 18, 2019

gbraad commented Jul 18, 2019

giuseppe commented Jul 19, 2019

cfergeau commented Jul 19, 2019

gbraad commented Jul 21, 2019

gbraad commented Jul 21, 2019 • edited Loading

ashcrow commented Jul 22, 2019

lsm5 commented Jul 22, 2019

cfergeau commented Sep 19, 2019

jnovy commented Sep 19, 2019

ashcrow commented Sep 19, 2019

cfergeau commented Oct 11, 2019

cfergeau commented Oct 11, 2019

gbraad commented Jul 21, 2019 •

edited

Loading