Skip to content

SCHED-300: Delete wait-for-checks-job if helm release failed#1708

Merged
rdjjke merged 1 commit intosoperator-release-1.22from
delete-wait-for-checks-job-hr-failed/0
Oct 30, 2025
Merged

SCHED-300: Delete wait-for-checks-job if helm release failed#1708
rdjjke merged 1 commit intosoperator-release-1.22from
delete-wait-for-checks-job-hr-failed/0

Conversation

@rdjjke
Copy link
Collaborator

@rdjjke rdjjke commented Oct 30, 2025

Problem

If soperator-activechecks helm release failed, a new attempt fails as well because K8s job "wait-for-checks-job" already exists.

Solution

Add hook-failed and before-hook-creation values to this K8s job's "helm.sh/hook-delete-policy" annotation.

Testing

To test:

  1. Create a new cluster with a wrong path to the image used by active checks.
  2. Wait until soperator-activechecks HR failed
  3. Redeploy this HR

Release Notes

Fixed a bug that caused repeated attempts of cluster creation to fail if the deployment of active checks failed during the first attempt.

@rdjjke rdjjke added the fix label Oct 30, 2025
@rdjjke rdjjke changed the title Delete wait-for-checks-job if helm release failed SCHED-300: Delete wait-for-checks-job if helm release failed Oct 30, 2025
@rdjjke rdjjke merged commit b1c8408 into soperator-release-1.22 Oct 30, 2025
4 of 5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants