(HTTP code 409) conflict - conflict: unable to delete (cannot be forced) - image is being used by running container #841

willswire · 2018-12-11T14:47:46Z

When implementing the delete-then-download application update strategy, devices are unable to remove the existing container due to the following error. Confirmed supervisor version running >= v2.5.1.

10.12.18 14:46:53 (-0500) Killing service 'main'
10.12.18 14:46:53 (-0500) Deleting image
10.12.18 14:46:53 (-0500) Failed to delete image due to '(HTTP code 409) conflict - conflict: unable to delete (cannot be forced) - image is being used by running container'

The text was updated successfully, but these errors were encountered:

willswire · 2018-12-11T16:48:49Z

It is possible to bypass this issue by first stopping the container, then pushing the update. The supervisor, under the delete-then-download strategy should do this rather than the user having to do so manually.

willswire · 2019-01-17T17:14:30Z

Same error on supervisor ver 9.0.1

17.01.19 12:12:52 (-0500) Killing service 'main sha256:08251b0aa8d6c66bd1b240ea4cf172963e81cbd8bac252fb144ba7eaaa0b41a0'
17.01.19 12:12:52 (-0500) Deleting image 'registry2.balena-cloud.com/v2/0f74bea475cd7a34f257da6be491baa8@sha256:d8b60a410da5ab912d725ed19eba5624e71359814dc09f9f83b8b71ba77fc98f'
17.01.19 12:12:52 (-0500) Failed to delete image 'registry2.balena-cloud.com/v2/0f74bea475cd7a34f257da6be491baa8@sha256:d8b60a410da5ab912d725ed19eba5624e71359814dc09f9f83b8b71ba77fc98f' due to '(HTTP code 409) conflict - conflict: unable to delete 08251b0aa8d6 (cannot be forced) - image is being used by running container c3bc0919b429 '

CameronDiver · 2019-01-17T17:16:13Z

Hey @willswire thanks for the report. I'll try to reproduce this soon and see what we can do. I imagine it's something like the supervisor not giving the container time to exit, so this would be where I'll start looking.

willswire · 2019-01-17T17:19:45Z

@CameronDiver thanks! If it helps at all, our current situation is:

Deploying a single container image (base image: balenalib/amd64-node:jessie) with a total payload of 955.08 MB
Device Type: WYSE Zx0 (AMD64 Architecture) with 2GB flash storage

The device will stay in a constant loop, reporting the same error.

CameronDiver · 2019-01-17T17:21:43Z

Thanks for the extra info;

Does the container catch and act upon signals, for example the SIGTERM that docker will send to ask a container to stop running?

I mean even if it does, this is still a bug, because the supervisor shouldn't be trying to remove the image until the container has stopped.

willswire · 2019-01-17T17:26:20Z

Any SIGTERM commands sent via the console, prior to initiating an update, are successful. Once the nightmarish update loop starts however, there's no response to any 'restart', 'stop' or 'start' commands.

CameronDiver · 2019-02-05T17:10:09Z

Hey @willswire sorry for the delay. I finally got some time to do some investigation here. I didn't manage to reproduce, but a colleague of mine did find a potential problem in the way that the state engine handles the delete-then-download strategy.

If possible, would you be able to try a new supervisor image which should fix this, or alternatively provide me with the source code for your project (and I'll try to dig out a device of the same type)?

The changes are implemented in this PR: #893

In the original implementation it was possible that the delete did not wait for the kill step to be finished, so it would not be deleted. We seperate this process into two steps, to allow for the container to have stopped before proceeding. Change-type: patch Closes: #841 Signed-off-by: Cameron Diver <cameron@balena.io>

willswire · 2019-02-06T18:17:00Z

@CameronDiver we can try the new supervisor image to test! How would we go about deploying the latest image to our machine?

CameronDiver · 2019-02-07T10:13:30Z

Thanks @willswire I'm pretty sure it should fix your issue (hence the closing) but finding out before release is certainly better.

The way that you could do this is to open a host OS terminal on your device and run update-resin-supervisor -t v9.7.1 -i balena/amd64-supervisor.

willswire · 2019-02-07T18:56:13Z

@CameronDiver thanks! The issue has been resolved.

CameronDiver · 2019-02-07T19:22:35Z

Really happy to hear :)

jellyfish-bot · 2022-04-27T21:04:53Z

[cywang117] This issue has attached support thread https://jel.ly.fish/e74a1106-b0eb-4f02-8f46-b78732db1ef9

cywang117 · 2023-02-16T21:21:30Z

For context, this error message is passed by the Supervisor from the Engine during updates, when an image in the current release needs to be deleted in favor of an image in the target release. The Supervisor should wait for containers to stop before attempting to remove images, but if a container fails to stop even with a balena kill, then this error may appear. Before commenting on or linking to this issue, please investigate if there are any processes in a service that fail to exit, even with a kill -9. If this error occurs in the absence of zombie user container processes, then that is a potential bug of the Supervisor.

CameronDiver added type/bug High priority This issue has a high priority, and needs to be fixed ASAP Needs more investigation More investigation is needed to find the cause or fix labels Jan 17, 2019

ghost assigned CameronDiver Feb 5, 2019

ghost added the flow/in-progress label Feb 5, 2019

CameronDiver mentioned this issue Feb 5, 2019

fix: Rework delete-then-download handling in state engine #893

Merged

ghost assigned balena-ci Feb 5, 2019

CameronDiver closed this as completed in #893 Feb 6, 2019

ghost removed the flow/in-progress label Feb 6, 2019

CameronDiver reopened this Feb 7, 2019

CameronDiver closed this as completed Feb 7, 2019

jellyfish-bot reopened this Apr 27, 2022

jellyfish-bot closed this as completed Apr 27, 2022

jellyfish-bot reopened this Apr 27, 2022

cywang117 removed the High priority This issue has a high priority, and needs to be fixed ASAP label Feb 16, 2023

cywang117 unassigned CameronDiver and balena-ci Feb 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(HTTP code 409) conflict - conflict: unable to delete (cannot be forced) - image is being used by running container #841

(HTTP code 409) conflict - conflict: unable to delete (cannot be forced) - image is being used by running container #841

willswire commented Dec 11, 2018 •

edited

Loading

willswire commented Dec 11, 2018

willswire commented Jan 17, 2019

CameronDiver commented Jan 17, 2019

willswire commented Jan 17, 2019

CameronDiver commented Jan 17, 2019

willswire commented Jan 17, 2019

CameronDiver commented Feb 5, 2019 •

edited

Loading

willswire commented Feb 6, 2019

CameronDiver commented Feb 7, 2019

willswire commented Feb 7, 2019

CameronDiver commented Feb 7, 2019

jellyfish-bot commented Apr 27, 2022

cywang117 commented Feb 16, 2023

(HTTP code 409) conflict - conflict: unable to delete (cannot be forced) - image is being used by running container #841

(HTTP code 409) conflict - conflict: unable to delete (cannot be forced) - image is being used by running container #841

Comments

willswire commented Dec 11, 2018 • edited Loading

willswire commented Dec 11, 2018

willswire commented Jan 17, 2019

CameronDiver commented Jan 17, 2019

willswire commented Jan 17, 2019

CameronDiver commented Jan 17, 2019

willswire commented Jan 17, 2019

CameronDiver commented Feb 5, 2019 • edited Loading

willswire commented Feb 6, 2019

CameronDiver commented Feb 7, 2019

willswire commented Feb 7, 2019

CameronDiver commented Feb 7, 2019

jellyfish-bot commented Apr 27, 2022

cywang117 commented Feb 16, 2023

willswire commented Dec 11, 2018 •

edited

Loading

CameronDiver commented Feb 5, 2019 •

edited

Loading