Skip to content

WINC-1839: Fix Nutanix WinC jobs by adding missing OVERRIDE_RHCOS_IMAGE#77031

Open
rrasouli wants to merge 1 commit intoopenshift:mainfrom
rrasouli:fix-winc-nutanix-rhcos-override
Open

WINC-1839: Fix Nutanix WinC jobs by adding missing OVERRIDE_RHCOS_IMAGE#77031
rrasouli wants to merge 1 commit intoopenshift:mainfrom
rrasouli:fix-winc-nutanix-rhcos-override

Conversation

@rrasouli
Copy link
Copy Markdown
Contributor

@rrasouli rrasouli commented Mar 29, 2026

JIRA: https://redhat.atlassian.net/browse/WINC-1839

Problem

PR #76620 added debug-winc-nutanix-ipi presubmit jobs to main, release-4.22, and release-4.23 branches but omitted the critical OVERRIDE_RHCOS_IMAGE environment variable that was already present in release-4.21 (added by PR #73946).

Symptoms:

Root Cause

Nutanix IPI installations require explicit RHCOS image override to prevent version mismatches between:

  1. Installer binary's embedded RHCOS stream metadata
  2. Release payload's expected machine-os version

When these mismatch, the API server fails to start during bootstrap phase, causing the context step to timeout when trying to communicate with Nutanix Prism Central API.

Solution

Add OVERRIDE_RHCOS_IMAGE environment variable pointing to RHCOS 9.6.20260323-1 (same version used in working 4.21 jobs) to the env section of debug-winc-nutanix-ipi job definition in:

  • ci-operator/config/openshift/openshift-tests-private/openshift-openshift-tests-private-main.yaml
  • ci-operator/config/openshift/openshift-tests-private/openshift-openshift-tests-private-release-4.22.yaml
  • ci-operator/config/openshift/openshift-tests-private/openshift-openshift-tests-private-release-4.23.yaml

This matches the fix applied to release-4.21 in PR #73946.

Affected Jobs

  • pull-ci-openshift-openshift-tests-private-main-debug-winc-nutanix-ipi
  • pull-ci-openshift-openshift-tests-private-release-4.22-debug-winc-nutanix-ipi
  • pull-ci-openshift-openshift-tests-private-release-4.23-debug-winc-nutanix-ipi

References

PR openshift#76620 added debug-winc-nutanix-ipi jobs to main, release-4.22, and
release-4.23 but omitted the critical OVERRIDE_RHCOS_IMAGE environment
variable that was already present in release-4.21 (added by PR openshift#73946).

Without this override, Nutanix IPI installations fail during bootstrap
because of RHCOS version mismatches between the installer's embedded
metadata and the release payload's expected machine-os version. This
manifests as timeout errors (exit 28) in the ipi-conf-nutanix-context
step when trying to communicate with the Nutanix Prism Central API.

This fix adds OVERRIDE_RHCOS_IMAGE pointing to RHCOS 9.6.20260323-1
(same version used in working 4.21 jobs) to prevent API server startup
failures during cluster bootstrap.

Fixes: openshift#76620
@openshift-ci openshift-ci bot requested review from lihongan and memodi March 29, 2026 20:22
@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci bot commented Mar 29, 2026

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: rrasouli
Once this PR has been reviewed and has the lgtm label, please assign jhuttana for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot
Copy link
Copy Markdown
Contributor

[REHEARSALNOTIFIER]
@rrasouli: the pj-rehearse plugin accommodates running rehearsal tests for the changes in this PR. Expand 'Interacting with pj-rehearse' for usage details. The following rehearsable tests have been affected by this change:

Test name Repo Type Reason
pull-ci-openshift-openshift-tests-private-release-4.23-debug-winc-nutanix-ipi openshift/openshift-tests-private presubmit Ci-operator config changed
pull-ci-openshift-openshift-tests-private-release-4.22-debug-winc-nutanix-ipi openshift/openshift-tests-private presubmit Ci-operator config changed
pull-ci-openshift-openshift-tests-private-main-debug-winc-nutanix-ipi openshift/openshift-tests-private presubmit Ci-operator config changed

Prior to this PR being merged, you will need to either run and acknowledge or opt to skip these rehearsals.

Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse list to get an up-to-date list of affected jobs
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

@rrasouli
Copy link
Copy Markdown
Contributor Author

/pj-rehearse pull-ci-openshift-openshift-tests-private-release-4.22-debug-winc-nutanix-ipi

@openshift-ci-robot
Copy link
Copy Markdown
Contributor

@rrasouli: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@rrasouli
Copy link
Copy Markdown
Contributor Author

/jira-refresh

@rrasouli rrasouli changed the title Fix Nutanix WinC jobs by adding missing OVERRIDE_RHCOS_IMAGE WINC-1839: Fix Nutanix WinC jobs by adding missing OVERRIDE_RHCOS_IMAGE Mar 29, 2026
@openshift-ci-robot openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Mar 29, 2026
@openshift-ci-robot
Copy link
Copy Markdown
Contributor

openshift-ci-robot commented Mar 29, 2026

@rrasouli: This pull request references WINC-1839 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the bug to target the "4.22.0" version, but no target version was set.

Details

In response to this:

JIRA: https://redhat.atlassian.net/browse/WINC-1839

Problem

PR #76620 added debug-winc-nutanix-ipi presubmit jobs to main, release-4.22, and release-4.23 branches but omitted the critical OVERRIDE_RHCOS_IMAGE environment variable that was already present in release-4.21 (added by PR #73946).

Symptoms:

Root Cause

Nutanix IPI installations require explicit RHCOS image override to prevent version mismatches between:

  1. Installer binary's embedded RHCOS stream metadata
  2. Release payload's expected machine-os version

When these mismatch, the API server fails to start during bootstrap phase, causing the context step to timeout when trying to communicate with Nutanix Prism Central API.

Solution

Add OVERRIDE_RHCOS_IMAGE environment variable pointing to RHCOS 9.6.20260323-1 (same version used in working 4.21 jobs) to the env section of debug-winc-nutanix-ipi job definition in:

  • ci-operator/config/openshift/openshift-tests-private/openshift-openshift-tests-private-main.yaml
  • ci-operator/config/openshift/openshift-tests-private/openshift-openshift-tests-private-release-4.22.yaml
  • ci-operator/config/openshift/openshift-tests-private/openshift-openshift-tests-private-release-4.23.yaml

This matches the fix applied to release-4.21 in PR #73946.

Affected Jobs

  • pull-ci-openshift-openshift-tests-private-main-debug-winc-nutanix-ipi
  • pull-ci-openshift-openshift-tests-private-release-4.22-debug-winc-nutanix-ipi
  • pull-ci-openshift-openshift-tests-private-release-4.23-debug-winc-nutanix-ipi

References

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci bot commented Mar 29, 2026

@rrasouli: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

jira/valid-reference Indicates that this PR references a valid Jira ticket of any type.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants