From b5989c7ab1efac1daa7eb37736ce608292538aa1 Mon Sep 17 00:00:00 2001 From: Alexander Maslennikov Date: Mon, 10 Nov 2025 10:25:21 +0100 Subject: [PATCH] fix: add a note on DGX's FW_RESET_AFTER_CONFIG_UPDATE env var feature gate Signed-off-by: Alexander Maslennikov --- docs/nic-conf-operator/nic-fw-configuration.rst | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/docs/nic-conf-operator/nic-fw-configuration.rst b/docs/nic-conf-operator/nic-fw-configuration.rst index 95128192..9a8ac58e 100644 --- a/docs/nic-conf-operator/nic-fw-configuration.rst +++ b/docs/nic-conf-operator/nic-fw-configuration.rst @@ -83,6 +83,9 @@ Install the NIC Configuration Operator and observe NIC devices in the cluster After deploying the NFS server and NFS CSI driver, the `storage class `_ should become available in the cluster. The name of the storage class should then be passed when configuring the NIC Configuration Operator. To disable the Firmware upgrade and validation logic, do not define the ``nicFirmwareStorage`` section in the NicClusterPolicy CR. +.. note:: + On some DGX servers, the configuration update is not successfully applied after the warm reboot. In this case, it is recommended to explicitly reset the NIC's Firmware before the reboot and after updating its non-volatile configuration. This can be achieved by specifying the ``FW_RESET_AFTER_CONFIG_UPDATE`` environment variable in the NicClusterPolicy CR. Please see the commented section in the example below. + First install the Network Operator helm chart with the Maintenance Operator enabled and deploy a NIC Cluster Policy CRD with NIC Configuration Operator and DOCA-OFED Driver enabled: ``values.yaml``: @@ -111,6 +114,11 @@ First install the Network Operator helm chart with the Maintenance Operator enab image: nic-configuration-operator-daemon repository: |nic-configuration-operator-repository| version: |nic-configuration-operator-version| + # Uncomment to explicitely reset the NIC's Firmware before the reboot and after updating its non-volatile configuration. + # Might be required on DGX servers where configuration update is not successfully applied after the warm reboot. + # env: + # - name: "FW_RESET_AFTER_CONFIG_UPDATE" + # value: "true" nicFirmwareStorage: create: true pvcName: nic-fw-storage-pvc