Allow to lock a replicaset's buckets from rebalancing #71

Gerold103 · 2018-02-13T20:09:05Z

Add a method like

vshard.storage.lock_buckets()

This method must protect active buckets of this replicaset from rebalancing, and must forbid to receive new buckets. In other words, rebalancer must consider such replicaset as balanced, even if it contradicts with weights.

Gerold103 · 2018-02-14T18:14:07Z

And allow to lock a concrete bucket. For example, by:

vshard.storage.bucket_pin(bucket_id)

Gerold103 · 2018-03-06T17:58:52Z

Why?

Replicaset lock allows, for example, to separate a replicaset for testsing
from production replicasets. Or to fix some application metadata, that must not
be sharded for a while.
Bucket pin allows the same, but in the smaller scope.

How it works?

Replicaset lock is not the same as pinning all of its buckets. When a replicaset
is locked, it can neither receive new buckets nor send its own ones.
When a bucket is individually pinned, then it can not be sent, but its
replicaset can receive new buckets.

Replicaset lock and rebalancing

When a replicaset is locked, it does not participate in rebalancing. It means,
that even if its ethalon bucket count is not equal to an actual one, this
disbalance can not be fixed due to lock. When a rebalancer detects that one of
replicasets appears to be locked, it recalculates ethalon bucket count of
non-locked replicasets as if the locked replicaset and its buckets does not
exist.

Bucket pin and rebalancing

It is the much more complex case.

A rebalancer calculates ethalon bucket count as if all buckets are not
pinned. Then it looks at each replicaset and compares its new ethalon bucket
count against pinned bucket count. If the pinned one is less, then it is ok. A
non-locked replicaset with pinned buckets can receive new ones.
If ethalon bucket count is less, than pinned one, this disbalance can not be
fixed - the rebalancer can not move pinned buckets out of this replicaset. In
such a case ethalon bucket count of this replicasets are set to exactly the
pinned one. These replicasets are not considered by a rebalancer then, and their
pinned count is subtracted from a total bucket count.
The described procecure is restarted from the step one with new total
bucket count and with replicasets, those ethalon bucket count >= pinned one,
until it appears, that on all replicasets ethalon bucket count >= pinned one.

Pseudocode:

function cluster_calculate_ethalon_balance(replicasets, bucket_count)
	-- spread buckets over replicasets using weights --
end;

cluster = <all of non-locked replicasets>;
bucket_count = <total bucket count in the cluster>;
next_cluster = <empty>;
while next_cluster != cluster do
	cluster_calculate_ethalon_balance(cluster, bucket_count);
	foreach replicaset in cluster do
		if replicaset.ethalon_bucket_count <
		   replicaset.pinned_bucket_count then
			bucket_count -= replicaset.pinned_bucket_count;
			replicaset.ethalon_bucket_count =
				replicaset.pinned_bucket_count;
		else
			new_cluster.add(replicaset);
		end;
	end;
end;
cluster_calculate_ethalon_balance(cluster, bucket_count);

Locked replicaset can neither receive new buckets nor send its own ones. Actually, it and its buckets do not participate in rebalancing, and non-locked replicasets are rebalanced independently. For example, consider a cluster: replicaset1: locked, weight = 1, bucket count = 1500 replicaset2: weight = 1, bucket count = 1500 When a replicaset3 is added, only rs3 and rs2 participate in rebalancing (respecting their weights): replicaset1: locked, weight = 1, bucket count = 1500 replicaset2: weight = 1, bucket_count = 500 replicaset3: weight = 2, bucket_count = 1000 The lock is useful for example to hold some test data on a particular replicaset, or to store on the replicaset a special data, that must be stored together. Part of #71

Pinned bucket is the bucket, that can not be sent out of its replicaset. Taking pinned buckets into account changes rebalancer algorithm, since now on some replicasets the perfect balance can not be reached. Iterative algorithm is used to learn the best balance in a cluster. On each step it calculates perfect bucket count for each replicaset. If this count can not be satisfied due to pinned buckets, the algorithm does best effort to get the perfect balance. This is done via ignoring of replicasets disbalanced via pinning, and their pinned buckets. After that a new balance is calculated. And it can happen, that it can not be satisfied too. It is possible, because ignoring of pinned buckets in overpopulated replicasets leads to decrease of perfect bucket count in other replicasets, and a new values can become less that their pinned bucket count. Part of #71

Closes #71

Pinned bucket is the bucket, that can not be sent out of its replicaset. Taking pinned buckets into account changes rebalancer algorithm, since now on some replicasets the perfect balance can not be reached. Iterative algorithm is used to learn the best balance in a cluster. On each step it calculates perfect bucket count for each replicaset. If this count can not be satisfied due to pinned buckets, the algorithm does best effort to get the perfect balance. This is done via ignoring of replicasets disbalanced via pinning, and their pinned buckets. After that a new balance is calculated. And it can happen, that it can not be satisfied too. It is possible, because ignoring of pinned buckets in overpopulated replicasets leads to decrease of perfect bucket count in other replicasets, and a new values can become less that their pinned bucket count. Part of #71

Closes #71

Locked replicaset can neither receive new buckets nor send its own ones. Actually, it and its buckets do not participate in rebalancing, and non-locked replicasets are rebalanced independently. For example, consider a cluster: replicaset1: locked, weight = 1, bucket count = 1500 replicaset2: weight = 1, bucket count = 1500 When a replicaset3 is added, only rs3 and rs2 participate in rebalancing (respecting their weights): replicaset1: locked, weight = 1, bucket count = 1500 replicaset2: weight = 1, bucket_count = 500 replicaset3: weight = 2, bucket_count = 1000 The lock is useful for example to hold some test data on a particular replicaset, or to store on the replicaset a special data, that must be stored together. Part of #71

Pinned bucket is the bucket, that can not be sent out of its replicaset. Taking pinned buckets into account changes rebalancer algorithm, since now on some replicasets the perfect balance can not be reached. Iterative algorithm is used to learn the best balance in a cluster. On each step it calculates perfect bucket count for each replicaset. If this count can not be satisfied due to pinned buckets, the algorithm does best effort to get the perfect balance. This is done via ignoring of replicasets disbalanced via pinning, and their pinned buckets. After that a new balance is calculated. And it can happen, that it can not be satisfied too. It is possible, because ignoring of pinned buckets in overpopulated replicasets leads to decrease of perfect bucket count in other replicasets, and a new values can become less that their pinned bucket count. Part of #71

Gerold103 added feature A new functionality customer labels Feb 13, 2018

Gerold103 self-assigned this Feb 13, 2018

Gerold103 added prio2 and removed prio2 labels Feb 14, 2018

Gerold103 added a commit that referenced this issue Mar 26, 2018

storage: open public API to pin/unpin buckets

8454a8c

Closes #71

Gerold103 added a commit that referenced this issue Mar 26, 2018

storage: open public API to pin/unpin buckets

ed653b0

Closes #71

Gerold103 closed this as completed in 327c746 Mar 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow to lock a replicaset's buckets from rebalancing #71

Allow to lock a replicaset's buckets from rebalancing #71

Gerold103 commented Feb 13, 2018 •

edited

Gerold103 commented Feb 14, 2018 •

edited

Gerold103 commented Mar 6, 2018

Allow to lock a replicaset's buckets from rebalancing #71

Allow to lock a replicaset's buckets from rebalancing #71

Comments

Gerold103 commented Feb 13, 2018 • edited

Gerold103 commented Feb 14, 2018 • edited

Gerold103 commented Mar 6, 2018

Why?

How it works?

Replicaset lock and rebalancing

Bucket pin and rebalancing

Gerold103 commented Feb 13, 2018 •

edited

Gerold103 commented Feb 14, 2018 •

edited