The raftd server is a reference implementation for using the goraft library. This library provides a distributed consensus protocol based on the Raft protocol as described by Diego Ongaro and John Ousterhout in their paper, In Search of an Understandable Consensus Algorithm. This protocol is based on Paxos but is architected to be more understandable. It is similar to other log-based distributed consensus systems such as Google's Chubby or Heroku's doozerd.
This reference implementation is very simple. It is a key/value database with the following HTTP API:
# Set the value of a key.
$ curl -X POST http://localhost:4001/db/my_key -d 'FOO'
# Retrieve the value for a given key.
$ curl http://localhost:4001/db/my_key
FOO
All the values sent to the leader will be propagated to the other servers in the cluster. This reference implementation does not support command forwarding. If you try to send a change to a follower then it will simply be denied.
First, install raftd:
$ go get github.com/goraft/raftd
To start the first node in your cluster, simply specify a port and a directory where the data will be stored:
$ raftd -p 4001 ~/node.1
To add nodes to the cluster, you'll need to start on a different port and use a different data diretory. You'll also need to specify the host/port of the leader of the cluster to join:
$ raftd -p 4002 -join localhost:4001 ~/node.2
When you restart the node, it's already been joined to the cluster so you can remove the -join
argument.
Finally, you can add one more node:
$ raftd -p 4003 -join localhost:4001 ~/node.3
Now when you set values to the leader:
$ curl -XPOST localhost:4001/db/foo -d 'bar'
The values will be propagated to the followers:
$ curl localhost:4001/db/foo
bar
$ curl localhost:4002/db/foo
bar
$ curl localhost:4003/db/foo
bar
Killing the leader will automatically elect a new leader. If you kill and restart the first node and try to set a value you'll receive:
$ curl -XPOST localhost:4001/db/foo -d 'bar'
raft.Server: Not current leader
Leader forwarding is not implemented in the reference implementation.
If you want to see more detail then you can specify several options for logging:
-v Enables verbose raftd logging.
-debug Enables debug-level raft logging.
-trace Enables trace-level raft logging.
If you're having an issue getting raftd
running, the -debug
and -trace
options can be really useful.
One issue with running a 2-node distributed consensus protocol is that we need both servers operational to make a quorum and to perform an actions on the server. So if we kill one of the servers at this point, we will will not be able to update the system (since we can't replicate to a majority). You will need to add additional nodes to allow failures to not affect the system. For example, with 3 nodes you can have 1 node fail. With 5 nodes you can have 2 nodes fail.