[server] Support Cluster Health API for safe rolling upgrades

### Search before asking

- [x] I searched in the [issues](https://github.com/apache/fluss/issues) and found nothing similar.


### Motivation

During Kubernetes StatefulSet rolling upgrades, the next `TabletServer` pod should not restart until all replicas from the previously restarted pod have fully recovered (leaders re-elected, ISR restored). Without this, cascading restarts can cause data unavailability or prolonged under-replication.

Currently there is no server-side API to determine whether the cluster has finished recovery. Operators rely on TCP-only readiness probes, which pass as soon as the process binds its port — long before replica recovery completes.

### Solution

  Add a `GetClusterHealth` RPC to the Coordinator that computes cluster health from in-memory
  state (CoordinatorContext). The API returns replica statistics and an overall health status:

  - **GREEN** — all replicas are in-sync and all leaders are active.
  - **YELLOW** — all leaders are active, but some replicas have not yet rejoined ISR.
  - **RED** — one or more leaders have not been confirmed active (election or KV recovery in progress).
  - **UNKNOWN** — health could not be determined.

  A readiness-probe shell script (`readiness-check.sh`) performs a two-step check:
  1. TCP port check (local liveness)
  2. Cluster Health API query (only pass on GREEN)

  This gates StatefulSet rolling upgrades: the next pod only restarts when the cluster is fully healthy.

### Anything else?

_No response_

### Willingness to contribute

- [x] I'm willing to submit a PR!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[server] Support Cluster Health API for safe rolling upgrades #3399

Search before asking

Motivation

Solution

Anything else?

Willingness to contribute

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[server] Support Cluster Health API for safe rolling upgrades #3399

Description

Search before asking

Motivation

Solution

Anything else?

Willingness to contribute

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions