kvserver: pause replication activity in a cluster #81953
Labels
A-kv-decom-rolling-restart
Decommission and Rolling Restarts
A-kv-replication
Relating to Raft, consensus, and coordination.
C-enhancement
Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)
P-3
Issues/test failures with no fix SLA
T-kv
KV Team
Projects
In #81935 we discuss the prioritization of replication activity at the store level. In the same vein we should consider manual knobs to disable classes of replication/snapshot activity in a cluster. This should be controlled at the cluster level via a setting(s). For example: a situation could arise where an operator starts decommissioning a node, which may cause a latency impact or instability in the cluster. We have seen that happen for a variety of reasons before and numerous customers. In this case it would be extremely helpful to have a single universal knob to pause all decommissioning across all nodes.
We should consider the following buckets of replication activity we should consider having control over
Jira issue: CRDB-16313
The text was updated successfully, but these errors were encountered: