ReadReplica role (former Non Promotable Clone) #1931

riccardone · 2019-05-17T07:41:31Z

There is a new ReadReplica config setting (default false)
When it is set to true, the node does not participate in the election process as a candidate and it does not allow the cluster to stay up if there are not enough (other) nodes to form a quorum
Added a new test for the ElectionService changes (more tests can be added... as usual)

To test this PR, run a cluster of 3 nodes. The nodes could be any previous version.
Once that the cluster is up and running, run a node using this PR code with ReadReplica setting set to true. Remember to keep the clustersize 3 and use the same gossip port of the other nodes in order to join the existing cluster.
You can verify that the node is recognise as ReadReplica looking at the logs and it does not participate as a candidate in the Election Process when you shut down the other nodes for testing.

Closes #1751

…e, the node does not partecipate in the election process as a candidate (completed but need to fix ElectionService unit tests)

riccardone · 2019-05-17T12:05:06Z

// Quick commands (for linux add mono)
// Start 3 nodes cluster
EventStore.ClusterNode.exe --int-ip 127.0.0.1 --ext-ip 127.0.0.1 --int-tcp-port=1111 --ext-tcp-port=1112 --int-http-port=1113 --ext-http-port=1114 --cluster-size=3 --discover-via-dns=false --gossip-seed=127.0.0.1:2113,127.0.0.1:3113 --structured-log=false

EventStore.ClusterNode.exe --int-ip 127.0.0.1 --ext-ip 127.0.0.1 --int-tcp-port=2111 --ext-tcp-port=2112 --int-http-port=2113 --ext-http-port=2114 --cluster-size=3 --discover-via-dns=false --gossip-seed=127.0.0.1:1113,127.0.0.1:3113 --structured-log=false

EventStore.ClusterNode.exe --int-ip 127.0.0.1 --ext-ip 127.0.0.1 --int-tcp-port=3111 --ext-tcp-port=3112 --int-http-port=3113 --ext-http-port=3114 --cluster-size=3 --discover-via-dns=false --gossip-seed=127.0.0.1:1113,127.0.0.1:2113 --structured-log=false

// Start a ReadReplica node
EventStore.ClusterNode.exe --int-ip 127.0.0.1 --ext-ip 127.0.0.1 --int-tcp-port=4111 --ext-tcp-port=4112 --int-http-port=4113 --ext-http-port=4114 --cluster-size=3 --discover-via-dns=false --gossip-seed=127.0.0.1:1113,127.0.0.1:2113,127.0.0.1:3113 --structured-log=false --read-replica=true

…ther nodes are back up

riccardone · 2019-05-24T10:21:38Z

A scenario to consider is when a ReadReplica node is running and all other nodes in the cluster are down. The ReadReplica writer checkpoint is still moving forward even without client writes as it is writing internal stats and data. For that reason, when the other nodes are back up and the ReadReplica re-join the cluster, the elected master will subscribe to it with a less recent position and therefore an off line truncation for the ReadReplica will be performed.

riccardone · 2019-05-28T12:38:09Z

A ReadReplica node is like a Clone that could never be elected. When you attach a ReadReplica node to an existing cluster you keep the same Cluster Size. In a cluster of 4 nodes where one of them is a Read Replica there could still happening network partitions. When 2 nodes are running side by side with the other 2 nodes that is when you have a network partitioning. The difference is that using a Read Replica node, the partition where it belongs to does not form a quorum and therefore clients can't write data to it. When the 2 partitions join back together then the nodes in the partition with less data will be automatically stopped and log truncated at next start. Using a Read Replica you can be sure that what is truncated is only the tail of the log containing stats and internal data.

To reproduce the network partition running the nodes locally:

start the four nodes with ClusterSize=3
kill two of the nodes
kill the running two nodes and start the other two nodes
start all the nodes... two of them will be automatically shut down

During the next start the tail of the log will be truncated. If you are using a Clone, there is a risk that you can loose client data (split brain problem... 2 masters accepting writes in the same cluster). If you are using a ReadReplica node there are no risks of loosing client data as the quorum can't be formed.

pgermishuys · 2019-09-03T07:04:00Z

Closing in favor of #1976

riccardone added 3 commits May 13, 2019 12:20

New ReadReplica config setting (default false). When it is set to tru…

8cf60cc

…e, the node does not partecipate in the election process as a candidate (completed but need to fix ElectionService unit tests)

fixed one unit test (wip need to fix another one)

7f33eb6

minor fixes (comment; flip condition as readreplica; fix tests)

406e817

riccardone added the kind/enhancement Issues which are a new feature label May 17, 2019

riccardone requested a review from jageall May 17, 2019 12:01

riccardone added 2 commits May 24, 2019 10:40

Fix condition to avoid the ReadReplica to be elected as master when o…

900f58a

…ther nodes are back up

removed verbose logs

e69ae70

ChrisChinchilla added the area/documentation Issues relating to project documentation label May 29, 2019

hayley-jean mentioned this pull request Aug 15, 2019

Add Read Only Replica #1976

Merged

pgermishuys closed this Sep 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ReadReplica role (former Non Promotable Clone) #1931

ReadReplica role (former Non Promotable Clone) #1931

riccardone commented May 17, 2019

riccardone commented May 17, 2019

riccardone commented May 24, 2019

riccardone commented May 28, 2019 •

edited

pgermishuys commented Sep 3, 2019

ReadReplica role (former Non Promotable Clone) #1931

ReadReplica role (former Non Promotable Clone) #1931

Conversation

riccardone commented May 17, 2019

riccardone commented May 17, 2019

riccardone commented May 24, 2019

riccardone commented May 28, 2019 • edited

pgermishuys commented Sep 3, 2019

riccardone commented May 28, 2019 •

edited