Cache replicated entries & don't block main task w/ replication #88

thedodd · 2020-11-20T01:07:57Z

closes #12
closes #76
closes #87

With this change, we are also caching entries which come from the leader replication protocol. As entries come in, we append them to the log and then cache the entry. When it is safe to apply entries to the state machine, we will take them directly from the in-memory cache instead of going to disk. Moreover, and most importantly, we are not longer blocking the AppendEntries RPC handler with the logic of the state machine replication workflow. There is a small amount of async task juggling to ensure that we don't run into situations where we would have two writers attempting to write to the state machine at the same time. This is easily avoided in our algorithm. closes #12 closes #76

thedodd · 2020-11-20T01:11:03Z

@MarinPostma & @sunli829 I was wondering if you two would be interested in reviewing this PR. Pretty happy with how simple this turned out to be. There were some complexities to think through to ensure that we don't have any Raft "safety" violations as we transition from follower to leader, as cluster leadership changes, and to ensure the cache can be trusted. Fortunately, all quite simple at the end of the day.

MarinPostma · 2020-11-20T10:31:59Z

Hello @thedodd ! That was fast! I didn't know about OrderedFutures but that nicely does the trick! I will try it later today. Thank you!!

The log index provided to the log compaction interface was a bit misleading. When performing log compaction, the compaction can only cover the breadth of the log up to the last applied log (obvs) and under write load, this value may change quickly. As such, the expectations of the log compaction interface have been refined and clarified. Now, the only expectation is that the storage implementation will export/checkpoint/snapshot its state machine, and then use the value of that export's last applied log as the metadata indicating the breadth of the log covered by the snapshot.

thedodd added bug Something isn't working enhancement New feature or request replication Related to the replication system labels Nov 20, 2020

thedodd self-assigned this Nov 20, 2020

Prep for 0.5.6 release.

1441e7b

thedodd force-pushed the 12-cache-replicated-entries-and-dont-block branch from 6cb62ac to 1441e7b Compare November 20, 2020 02:26

thedodd merged commit 5835bf4 into master Nov 24, 2020

thedodd deleted the 12-cache-replicated-entries-and-dont-block branch November 24, 2020 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache replicated entries & don't block main task w/ replication #88

Cache replicated entries & don't block main task w/ replication #88

thedodd commented Nov 20, 2020

thedodd commented Nov 20, 2020

MarinPostma commented Nov 20, 2020

Cache replicated entries & don't block main task w/ replication #88

Cache replicated entries & don't block main task w/ replication #88

Conversation

thedodd commented Nov 20, 2020

thedodd commented Nov 20, 2020

MarinPostma commented Nov 20, 2020