MHC Controller keeps issuing warning although the MachineDeployment does not have any unhealthy machines #10585

Levi080513 · 2024-05-10T03:21:34Z

What steps did you take and what happened?

Create CAPI Cluster.
Create MHC and configure unHealthRange like this

spec:
  clusterName: hw-sks-test-logging
  maxUnhealthy: 100%
  nodeStartupTimeout: 5m0s
  selector:
    matchLabels:
      cluster.x-k8s.io/cluster-name: hw-sks-test-logging
      cluster.x-k8s.io/deployment-name: hw-sks-test-logging-logging
  unhealthyConditions:
  - status: Unknown
    timeout: 5m0s
    type: Ready
  - status: "False"
    timeout: 5m0s
    type: Ready
  unhealthyRange: '[1-3]'

MHC Controller keeps issuing warning although the cluster does not have any unhealthy machines, the warning events like this:

3m52s       Warning   RemediationRestricted   machinehealthcheck/hw-sks-test-logging-logging     Remediation is not allowed, the number of not started or unhealthy machines does not fall within the range (total: 3, unhealthy: 0, unhealthyRange: [1-3])

What did you expect to happen?

Can we make a small optimization for the MHC Controller does not generate this warning when the MachineDeployment does not have any unhealthy machines? This is easily misleading.

Cluster API version

v1.6.4

Kubernetes version

v1.25.16

Anything else you would like to add?

No response

Label(s) to be applied

/kind bug
One or more /area label. See https://github.com/kubernetes-sigs/cluster-api/labels?q=area for the list of labels.

The text was updated successfully, but these errors were encountered:

sbueringer · 2024-05-10T16:00:40Z

Sounds good!

enxebre · 2024-05-13T07:16:33Z

/kind cleanup
/area machinehealthcheck
/triage accepted

fabriziopandini · 2024-05-22T13:01:11Z

/close
The PR already merged

k8s-ci-robot · 2024-05-22T13:01:15Z

@fabriziopandini: Closing this issue.

In response to this:

/close
The PR already merged

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot added kind/bug Categorizes issue or PR as related to a bug. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels May 10, 2024

Levi080513 mentioned this issue May 13, 2024

🐛 Skip publishing the RemediationRestricted event when there are no unhealthy target #10591

Merged

k8s-ci-robot closed this as completed May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MHC Controller keeps issuing warning although the MachineDeployment does not have any unhealthy machines #10585

MHC Controller keeps issuing warning although the MachineDeployment does not have any unhealthy machines #10585

Levi080513 commented May 10, 2024

sbueringer commented May 10, 2024

enxebre commented May 13, 2024

fabriziopandini commented May 22, 2024

k8s-ci-robot commented May 22, 2024

MHC Controller keeps issuing warning although the MachineDeployment does not have any unhealthy machines #10585

MHC Controller keeps issuing warning although the MachineDeployment does not have any unhealthy machines #10585

Comments

Levi080513 commented May 10, 2024

What steps did you take and what happened?

What did you expect to happen?

Cluster API version

Kubernetes version

Anything else you would like to add?

Label(s) to be applied

sbueringer commented May 10, 2024

enxebre commented May 13, 2024

fabriziopandini commented May 22, 2024

k8s-ci-robot commented May 22, 2024