WINC-1839: Fix Nutanix WinC jobs by adding missing OVERRIDE_RHCOS_IMAGE#77031
WINC-1839: Fix Nutanix WinC jobs by adding missing OVERRIDE_RHCOS_IMAGE#77031rrasouli wants to merge 1 commit intoopenshift:mainfrom
Conversation
PR openshift#76620 added debug-winc-nutanix-ipi jobs to main, release-4.22, and release-4.23 but omitted the critical OVERRIDE_RHCOS_IMAGE environment variable that was already present in release-4.21 (added by PR openshift#73946). Without this override, Nutanix IPI installations fail during bootstrap because of RHCOS version mismatches between the installer's embedded metadata and the release payload's expected machine-os version. This manifests as timeout errors (exit 28) in the ipi-conf-nutanix-context step when trying to communicate with the Nutanix Prism Central API. This fix adds OVERRIDE_RHCOS_IMAGE pointing to RHCOS 9.6.20260323-1 (same version used in working 4.21 jobs) to prevent API server startup failures during cluster bootstrap. Fixes: openshift#76620
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: rrasouli The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
[REHEARSALNOTIFIER]
Prior to this PR being merged, you will need to either run and acknowledge or opt to skip these rehearsals. Interacting with pj-rehearseComment: Once you are satisfied with the results of the rehearsals, comment: |
|
/pj-rehearse pull-ci-openshift-openshift-tests-private-release-4.22-debug-winc-nutanix-ipi |
|
@rrasouli: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel. |
|
/jira-refresh |
|
@rrasouli: This pull request references WINC-1839 which is a valid jira issue. Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the bug to target the "4.22.0" version, but no target version was set. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
@rrasouli: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
JIRA: https://redhat.atlassian.net/browse/WINC-1839
Problem
PR #76620 added
debug-winc-nutanix-ipipresubmit jobs to main, release-4.22, and release-4.23 branches but omitted the criticalOVERRIDE_RHCOS_IMAGEenvironment variable that was already present in release-4.21 (added by PR #73946).Symptoms:
ipi-conf-nutanix-contextstep with exit status 28 (timeout)debug-winc-nutanix-ipijobs failing on main, 4.22, and 4.23Root Cause
Nutanix IPI installations require explicit RHCOS image override to prevent version mismatches between:
When these mismatch, the API server fails to start during bootstrap phase, causing the context step to timeout when trying to communicate with Nutanix Prism Central API.
Solution
Add
OVERRIDE_RHCOS_IMAGEenvironment variable pointing to RHCOS 9.6.20260323-1 (same version used in working 4.21 jobs) to the env section ofdebug-winc-nutanix-ipijob definition in:ci-operator/config/openshift/openshift-tests-private/openshift-openshift-tests-private-main.yamlci-operator/config/openshift/openshift-tests-private/openshift-openshift-tests-private-release-4.22.yamlci-operator/config/openshift/openshift-tests-private/openshift-openshift-tests-private-release-4.23.yamlThis matches the fix applied to release-4.21 in PR #73946.
Affected Jobs
pull-ci-openshift-openshift-tests-private-main-debug-winc-nutanix-ipipull-ci-openshift-openshift-tests-private-release-4.22-debug-winc-nutanix-ipipull-ci-openshift-openshift-tests-private-release-4.23-debug-winc-nutanix-ipiReferences