CI step "Set toolchain version" can fail without stopping CI job #225

joshlf · 2023-08-03T18:32:43Z

In this CI job, it appears that a network error causes issues with the "Set toolchain version" step, but the GitHub runner does not fail at that point. Instead, later steps are executed with bad data (namely, the ZC_TOOLCHAIN environment variable is set to an empty string), causing confusing errors.

I suspect that what's happening is that the shell script executed for that step has a bug that results in errors inside of $(...) to not respect the set -e at the top of the script.

The text was updated successfully, but these errors were encountered:

zoo868e · 2023-08-25T16:41:40Z

Hi @joshlf , I would like to try it.

joshlf · 2023-08-25T16:44:21Z

Hey @zoo868e, sounds good, thanks! Let me know if you have any questions at any point!

zoo868e · 2023-08-25T20:57:47Z

Hello @joshlf,

It seems that the issue is arising intermittently, possibly due to a GitHub action facing difficulties in fetching dependencies during the execution of cargo metadata. After replicating the problem, I managed to execute all the tasks successfully, including a re-run of the previously failed job.

Therefore, I'm interested to know if there is a reliable method to verify whether the issue has been resolved, given that it doesn't occur consistently.

The approach that comes to mind for me to assess the issue's current status is to run the CI process multiple times and hope that the problem shows up.

Moreover, the failures are consistently caused by difficulties in fetching the syn package. Perhaps we could consider changing the source of the crate?

Thank you.

Fixes google#225 The reason `set -e` failed to interrupt the workflow is due to the output redirection to `jq`. Since `jq` returns a value of 0, indicating success, the script continues to execute. You can try running these scripts locally to gain insight into the issue. To illustrate: Script without interruption: ```sh set -e function pkg-meta { cargo metadata --format-version 1 | jq -r ".packages[] | select(.name\ == \"zerocopy\").$1" } ZC_TOOLCHAIN=\"$(pkg-meta 'metadata.ci.\"pinned-nightly\"')\" echo \"Discovered that the nightly toolchain is $ZC_TOOLCHAIN\" echo \"ZC_TOOLCHAIN=$ZC_TOOLCHAIN\" ``` Script with interruption: ```sh set -e function pkg-meta { cargo metadata --format-version 1" } ZC_TOOLCHAIN=\"$(pkg-meta 'metadata.ci.\"pinned-nightly\"')\" echo \"Discovered that the nightly toolchain is $ZC_TOOLCHAIN\" echo \"ZC_TOOLCHAIN=$ZC_TOOLCHAIN\" ``` Additionally, you can display the return value after the command execution. This output will confirm the successful execution of the command. ```sh set -e function pkg-meta { cargo metadata --format-version 1 | jq -r \".packages[] | select(\ .name == \"zerocopy\").$1\" echo echo \$? } ZC_TOOLCHAIN=\"$(pkg-meta 'metadata.ci.\"pinned-nightly\"')\" echo \"Discovered that the nightly toolchain is $ZC_TOOLCHAIN\" echo \"ZC_TOOLCHAIN=$ZC_TOOLCHAIN\" ``` Therefore, I've incorporated `cargo check` to validate the package retrieval. Subsequent to this check, there's no necessity to fetch the package again using `cargo metadata`. This modification should rectify the point of failure in the Git action.

To bolster reliability, incorporate the -o pipefail option, which designates a command pipeline's overall return status as failed if any individual command within it fails. Following this, integrating the -e option can empower the script with the ability to promptly halt execution. Fixes #225

joshlf added the compatibility-nonbreaking Changes that are (likely to be) non-breaking label Aug 12, 2023

joshlf mentioned this issue Aug 20, 2023

Roadmap #98

Open

zoo868e mentioned this issue Aug 25, 2023

Cargo check before get the metadata #289

Merged

joshlf closed this as completed in #289 Aug 27, 2023

joshlf mentioned this issue Aug 29, 2023

CI step "Set toolchain version" is flaky due to network timeouts #295

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI step "Set toolchain version" can fail without stopping CI job #225

CI step "Set toolchain version" can fail without stopping CI job #225

joshlf commented Aug 3, 2023

zoo868e commented Aug 25, 2023

joshlf commented Aug 25, 2023

zoo868e commented Aug 25, 2023 •

edited

CI step "Set toolchain version" can fail without stopping CI job #225

CI step "Set toolchain version" can fail without stopping CI job #225

Comments

joshlf commented Aug 3, 2023

zoo868e commented Aug 25, 2023

joshlf commented Aug 25, 2023

zoo868e commented Aug 25, 2023 • edited

zoo868e commented Aug 25, 2023 •

edited