Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MGMT-15878: Ensure that hosts emit event showing why preparation failed. #5521

Merged
merged 1 commit into from Nov 6, 2023

Commits on Nov 6, 2023

  1. MGMT-15878: Ensure that hosts emit event showing why preparation failed.

    When preparation fails for a host, we do not implement any kind of a timeout for the host to indicate that this has occurred.
    
    This means that it is sometimes not possible for the user to determine the cause of a cluster timeout (which will inevitably be caused by the timeout of a host during preparation.)
    
    Presently, there are two ways in which a host may time out (no result received within the cluster timeout)
    
        An inconclusive result from the pulling of cluster images
        An inconclusive result from the disk speed check
    
    This PR introduces a timeout to detect these scenarios and report on them in a host timeout event so that the user may have a clue as to what has happened.
    
    This PR is in addition to MGMT-15814 which introduces a cluster condition to track a cluster timeout when there is a failure to configure the preparation of a cluster within a given time frame (for example if the assisted pod crashes)
    
    Together these PR's should improve the overall quality of error reporting.
    paul-maidment committed Nov 6, 2023
    Copy the full SHA
    92f3e38 View commit details
    Browse the repository at this point in the history