Interrupting SDK install step in CI breaks CI #1052

oleflb · 2024-06-09T15:09:37Z

When a CI step is interruped while it is installing a SDK, it may result in a partial SDK.
This is fixed by deleting the broken SDK and ensuring that the SDK install step runs through successfully.
Maybe we could add a seperate sdk install step in the CI that is not interruptable. Alternatively, we could disable fail-fast entirely for the build step

The text was updated successfully, but these errors were encountered:

knoellle · 2024-06-09T15:13:09Z

Another solution would be to make the SDK install atomic by installing to a temp directory first and then mving it into place like we do with the downloads since #747

oleflb · 2024-06-09T16:13:52Z

Great idea @knoellle, would also fix such issues outside the CI

knoellle · 2024-06-09T18:23:39Z

I tried implementing this approach but it doesn't work since the installation process bakes the installation path into many of the files.
Some alternative approaches:

Create an "hey, this didn't finish correctly" marker file which is removed after the sdk installation reports success.
Requires changing the "is sdk already installed" detection which may break if we forget to do so at some point but it is probably the best option.
Remove the installation directory on error.
This would still break in cases where pepsi dies at the same time as the installer as would likely be the case in an aborted CI job.
sed -i over the directory after installation to fix the paths.
Very hacky, does not spark joy.

What do you think?

schmidma · 2024-06-10T06:55:36Z

Is there anything against a pepsi sdk install action in the CI build jobs, that cannot be interrupted? If there is a released version of the SDK, it is a good idea to install it to the CI runners.

knoellle · 2024-06-10T08:16:50Z

Sure, that would (probably) fix the CI issues.
However, I had hoped to also fix this issue for people installing the SDK on their machines.
If you Ctrl+C a pepsi upload during sdk installation, you will likely be met with very cryptic error messages when you run the command again.

schmidma · 2024-06-10T09:55:31Z

We could also integrate such a feature to the SDK install script by patching poky

knoellle · 2024-06-10T10:43:00Z

How Aufwand would that be? Patches break more easily when updating versions.
Also, which of the suggested solutions?

The marker file would still have to be checked by pepsi before using the sdk and at that point pepsi might as well create/remove the marker file too.
Removing the partial installation on error isn't reliable because the cleanup code may never be executed depending on the kind of error.
The sed after mv I don't think we should do either way.

I'm favoring solution 1 implemented in pepsi.

oleflb added the tools:CI label Jun 9, 2024

oleflb changed the title ~~Aborting SDK install step in CI breaks CI~~ Interrupting SDK install step in CI breaks CI Jun 9, 2024

knoellle self-assigned this Jun 9, 2024

knoellle mentioned this issue Jun 10, 2024

Robustify SDK installation #1053

Merged

knoellle closed this as completed in #1053 Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interrupting SDK install step in CI breaks CI #1052

Interrupting SDK install step in CI breaks CI #1052

oleflb commented Jun 9, 2024

knoellle commented Jun 9, 2024

oleflb commented Jun 9, 2024

knoellle commented Jun 9, 2024 •

edited

Loading

schmidma commented Jun 10, 2024

knoellle commented Jun 10, 2024

schmidma commented Jun 10, 2024

knoellle commented Jun 10, 2024

Interrupting SDK install step in CI breaks CI #1052

Interrupting SDK install step in CI breaks CI #1052

Comments

oleflb commented Jun 9, 2024

knoellle commented Jun 9, 2024

oleflb commented Jun 9, 2024

knoellle commented Jun 9, 2024 • edited Loading

schmidma commented Jun 10, 2024

knoellle commented Jun 10, 2024

schmidma commented Jun 10, 2024

knoellle commented Jun 10, 2024

knoellle commented Jun 9, 2024 •

edited

Loading