Skip to content
This repository has been archived by the owner on Feb 9, 2024. It is now read-only.

[7.0.x] Degrade system status in case of offline nodes #2131

Merged
merged 1 commit into from Sep 16, 2020

Conversation

bernardjkim
Copy link
Contributor

@bernardjkim bernardjkim commented Sep 15, 2020

Description

This PR updates the gravity status to report degraded status if any nodes are offline. Prior to this change, planet status could report an active cluster status while failing to collect the status of some nodes. This has been observed in #2100 when a serf node is partition off the cluster for too long and is eventually kicked out of the serf cluster.

Type of change

  • Bug fix (non-breaking change which fixes an issue)

Linked tickets and other PRs

TODOs

  • Self-review the change
  • Perform manual testing
  • Address review feedback

Testing done

Verify cluster status is degraded after a member leaves the serf cluster

  • Install 3-node cluster.
  • On one of the nodes execute serf force-leave -prune <another-node>.
  • Verify serf members.
  • Verify gravity status is degraded.

@bernardjkim bernardjkim requested review from a team, r0mant and knisbet September 15, 2020 18:30
@bernardjkim bernardjkim merged commit 6a49920 into version/7.0.x Sep 16, 2020
@bernardjkim bernardjkim deleted the bernard/7.0.x/status-fix branch September 16, 2020 17:30
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants