Skip to content

Conversation

@jerqi
Copy link
Collaborator

@jerqi jerqi commented Nov 12, 2021

What changes were proposed in this pull request?

Firestorm supports node's health check. There are some full disk nodes in our cluster, they influence the shuffle service. We should exclude them.Currently we only check disk usage, and exclude the unhealthy nodes.

Why are the changes needed?

Because we should have the mechanism to find the servers that can't serve, and screen them

Does this PR introduce any user-facing change?

No

How was this patch tested?

newly added UTs

@jerqi jerqi changed the title [Feature] Firestorm supports node's health check [WIP][Feature] Firestorm supports node's health check Nov 12, 2021
@jerqi jerqi requested review from colinmjj and duanmeng November 16, 2021 10:47
@jerqi jerqi marked this pull request as ready for review November 16, 2021 10:47
@jerqi jerqi changed the title [WIP][Feature] Firestorm supports node's health check [Feature] Firestorm supports node's health check Nov 16, 2021
@jerqi jerqi self-assigned this Nov 16, 2021
@jerqi
Copy link
Collaborator Author

jerqi commented Nov 17, 2021

@colinmjj comments are addressed.

@duanmeng
Copy link
Collaborator

Please write more details about this pr's goal maybe some background and design in the what section.

@duanmeng duanmeng added the enhancement New feature or request label Nov 18, 2021
@jerqi
Copy link
Collaborator Author

jerqi commented Nov 19, 2021

All comments are addressed.

@duanmeng
Copy link
Collaborator

LGTM

@colinmjj
Copy link
Collaborator

LGTM

@colinmjj colinmjj self-requested a review November 23, 2021 02:04
colinmjj
colinmjj previously approved these changes Nov 23, 2021
@jerqi jerqi merged commit cce8824 into Tencent:master Nov 23, 2021
@jerqi jerqi deleted the healthy_check branch April 22, 2022 11:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants