Use phi accrual failure detectors for Raft elections and session timeouts #294

kuujo · 2017-11-01T23:52:41Z

This PR refactors how leadership elections and session expirations are handled in Raft.

It adds a phi accrual failure detector used to determine when to start a new election. In order to avoid multiple servers starting an election at the same time, randomized timers are used to check the current phi value. The Raft election timeout is used as a fallback to ensure the timeout doesn't surpass that point.

Sessions are also expired using phi accrual failure detectors. This is done by the leader sending heartbeats to clients. New sessions are opened with a minimum timeout, and the leader sends heartbeats to the clients at the rate of the minimum session timeout. Sending heartbeats from the leader to clients also ensures clients resolve new leaders as soon as possible. In order to account for the time period during which an old leader crash was being detected and a new leader was being elected, nodes track the last heartbeat time and subtract that time from session timeouts. This means sessions can be expired via the failure detector immediately after a leader change if the client can't be reached by the leader.

coveralls · 2017-11-02T00:00:22Z

Coverage increased (+0.7%) to 52.014% when pulling 9c1da2f on phi-accrual-failure-detectors into a6ff559 on master.

coveralls · 2017-11-02T00:56:21Z

Coverage increased (+1.008%) to 52.308% when pulling 0a7f888 on phi-accrual-failure-detectors into a6ff559 on master.

coveralls · 2017-11-02T00:58:26Z

Coverage increased (+1.06%) to 52.36% when pulling 0a7f888 on phi-accrual-failure-detectors into a6ff559 on master.

coveralls · 2017-11-02T06:52:42Z

Coverage increased (+0.6%) to 51.934% when pulling 556311d on phi-accrual-failure-detectors into a6ff559 on master.

kuujo · 2017-11-02T07:23:47Z

...ocols/failure-detection/src/main/java/io/atomix/protocols/phi/PhiAccrualFailureDetector.java

@@ -29,93 +25,69 @@
 * <p>
 * Based on a paper titled: "The φ Accrual Failure Detector" by Hayashibara, et al.
 */
-public class PhiAccrualFailureDetector<T extends Identifier> {
-  private final Map<T, History> states = Maps.newConcurrentMap();
+public class PhiAccrualFailureDetector {


We should add a builder for this class

kuujo · 2017-11-02T07:24:31Z

...ocols/failure-detection/src/main/java/io/atomix/protocols/phi/PhiAccrualFailureDetector.java

-  public void report(T nodeId, long arrivalTime) {
-    checkNotNull(nodeId, "NodeId must not be null");
+  public void report(long arrivalTime) {
+    checkNotNull("NodeId must not be null");


This assertion is checking a string literal

kuujo · 2017-11-02T07:27:39Z

protocols/raft/src/main/java/io/atomix/protocols/raft/roles/FollowerRole.java

    log.trace("Starting heartbeat timer");
+    AtomicLong lastHeartbeat = new AtomicLong();


This value needs to be updated, otherwise the last heartbeat time is always reported.

kuujo · 2017-11-02T07:29:33Z

protocols/raft/src/main/java/io/atomix/protocols/raft/roles/LeaderRole.java

+
+    Scheduled oldTimer;
+    if (minTimeout.isPresent()) {
+      Scheduled newTimer = raft.getThreadContext().schedule(


Not sure if we should be using a recurring timer or setting a timer recursively when responses are received.

kuujo · 2017-11-02T07:30:03Z

protocols/raft/src/main/java/io/atomix/protocols/raft/roles/LeaderRole.java

+            .filter(m -> m != null)
+            .collect(Collectors.toList()))
+        .build();
+    log.trace("Sending {}", request);


Add the MemberId to this log statement

kuujo · 2017-11-02T07:31:29Z

protocols/raft/src/main/java/io/atomix/protocols/raft/session/impl/RaftSessionContext.java

@@ -52,19 +53,23 @@
 * Raft session.
 */
 public class RaftSessionContext implements RaftSession {
+  private static final int PHI_FAILURE_THRESHOLD = 10;


We should be able to remove this threshold constant

coveralls · 2017-11-02T20:59:10Z

Coverage increased (+0.6%) to 51.929% when pulling a2f4e5d on phi-accrual-failure-detectors into a6ff559 on master.

coveralls · 2017-11-02T21:44:29Z

Coverage increased (+0.1%) to 51.836% when pulling dbbf32a on phi-accrual-failure-detectors into a4223bc on master.

kuujo · 2017-11-03T00:57:03Z

This implementation seems to have a problem with doing several rounds of election at startup. I haven't looked at the logs closely enough to figure out why yet.

kuujo · 2017-11-03T00:59:45Z

nvm... I think it was because nodes weren't resetting election timers after voting in an election. Should be fixed now.

coveralls · 2017-11-03T01:11:27Z

Coverage decreased (-0.3%) to 51.43% when pulling 1e3b6aa on phi-accrual-failure-detectors into a4223bc on master.

…lar intervals.

kuujo · 2017-11-03T01:16:25Z

Rebased with #295. This should pass consistently now.

coveralls · 2017-11-03T01:27:40Z

Coverage increased (+0.2%) to 51.628% when pulling 95af42b on phi-accrual-failure-detectors into 966f364 on master.

kuujo requested a review from jhall11 November 1, 2017 23:52

kuujo commented Nov 2, 2017

View reviewed changes

kuujo force-pushed the phi-accrual-failure-detectors branch from a2f4e5d to dbbf32a Compare November 2, 2017 21:34

kuujo force-pushed the phi-accrual-failure-detectors branch from dbbf32a to 1e3b6aa Compare November 3, 2017 01:00

kuujo added 7 commits November 2, 2017 18:15

Use phi accrual failure detection in Raft session timeouts.

7c5c29c

Implement failure detection for Raft leader elections.

336478f

Ensure one heartbeat is sent per client for all sessions.

a93a103

Handle zero session time when checking session timeout.

62a6481

Record timestamp *after* heartbeat response in leader heartbeats.

16c03cc

Ensure Raft heartbeat failure detector logs append times at semi-regu…

f500100

…lar intervals.

Clean up failure detection codes.

95af42b

kuujo force-pushed the phi-accrual-failure-detectors branch from 1e3b6aa to 95af42b Compare November 3, 2017 01:16

kuujo merged commit 857152d into master Nov 3, 2017

kuujo deleted the phi-accrual-failure-detectors branch November 3, 2017 04:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use phi accrual failure detectors for Raft elections and session timeouts #294

Use phi accrual failure detectors for Raft elections and session timeouts #294

kuujo commented Nov 1, 2017

coveralls commented Nov 2, 2017

coveralls commented Nov 2, 2017

coveralls commented Nov 2, 2017 •

edited

coveralls commented Nov 2, 2017

kuujo Nov 2, 2017

kuujo Nov 2, 2017

kuujo Nov 2, 2017

kuujo Nov 2, 2017 •

edited

kuujo Nov 2, 2017

kuujo Nov 2, 2017

coveralls commented Nov 2, 2017 •

edited

coveralls commented Nov 2, 2017 •

edited

kuujo commented Nov 3, 2017

kuujo commented Nov 3, 2017

coveralls commented Nov 3, 2017 •

edited

kuujo commented Nov 3, 2017 •

edited

coveralls commented Nov 3, 2017 •

edited

		log.trace("Starting heartbeat timer");
		AtomicLong lastHeartbeat = new AtomicLong();

Use phi accrual failure detectors for Raft elections and session timeouts #294

Use phi accrual failure detectors for Raft elections and session timeouts #294

Conversation

kuujo commented Nov 1, 2017

coveralls commented Nov 2, 2017

coveralls commented Nov 2, 2017

coveralls commented Nov 2, 2017 • edited

coveralls commented Nov 2, 2017

kuujo Nov 2, 2017

Choose a reason for hiding this comment

kuujo Nov 2, 2017

Choose a reason for hiding this comment

kuujo Nov 2, 2017

Choose a reason for hiding this comment

kuujo Nov 2, 2017 • edited

Choose a reason for hiding this comment

kuujo Nov 2, 2017

Choose a reason for hiding this comment

kuujo Nov 2, 2017

Choose a reason for hiding this comment

coveralls commented Nov 2, 2017 • edited

coveralls commented Nov 2, 2017 • edited

kuujo commented Nov 3, 2017

kuujo commented Nov 3, 2017

coveralls commented Nov 3, 2017 • edited

kuujo commented Nov 3, 2017 • edited

coveralls commented Nov 3, 2017 • edited

coveralls commented Nov 2, 2017 •

edited

kuujo Nov 2, 2017 •

edited

coveralls commented Nov 2, 2017 •

edited

coveralls commented Nov 2, 2017 •

edited

coveralls commented Nov 3, 2017 •

edited

kuujo commented Nov 3, 2017 •

edited

coveralls commented Nov 3, 2017 •

edited