Skip to content

Latest commit

 

History

History
33 lines (21 loc) · 1.82 KB

compute-node-not-shown-error-state.md

File metadata and controls

33 lines (21 loc) · 1.82 KB
title description ms.date ms.reviewer ms.service ms.custom
HPC compute nodes aren't shown or are in error state
Provides a solution for certificate and network errors that occur when you deploy an HPC Pack cluster on Azure.
09/14/2022
hclvteam, cargonz, v-weizhu
hpcpack
sap:HPC Pack

HPC compute nodes aren't shown or are in error state

This article provides a solution for an issue in which compute nodes aren't shown or are in an error state when you deploy a high-performance computing (HPC) cluster on Azure.

Symptoms

When you deploy an HPC cluster on Azure, the deployment succeeds but compute nodes aren't shown or are in an error state. You see certificate and network-related errors in the HpcNodeManager service logs on the compute node.

Here's a network error example:

System.Net.Sockets.SocketException: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond x.x.x.x:xxx

Cause

This issue occurs because of network configuration issues or some type of certificate mismatch.

Resolution

Check network and certificate errors in the HpcNodeManager service logs on the compute node.

Correct certificate mismatches if there are any. For more information, see Manage Certificates for HPC Pack 2019 Cluster.

To resolve network issues with the error above, configure the Network Security Group rules and firewall rules. For more information, see the "Configure Network Security Group for Azure virtual network" section in Burst to Azure IaaS VM from an HPC Pack Cluster.

[!INCLUDE Azure Help Support]