Rapid: membership service with consistent view changes.

WORK IN PROGRESS

Package with membership service with decentralized monitoring topology and consistent propagation of changes of the cluster configuration.

Monitoring topology is created from K graphs, where every node must monitor and monitored by K nodes. According to rapid technical report current monitoring topology will guaranteee almost-everywhere detection. In case when majority of the cluster detects change it will take 2 network delays for the cluster to detect new configuration. If there are conflicts it will take 3 more network delays to reach an agreement.

Service is using grpc for all network communication.

Configuration

type Config struct {
        // Expected network delay used for ticks
        NetworkDelay StringDuration

        // Paxos

        // Timeouts are expressed in number of network delays.
        // ElectionTimeout if replica doesn't receive hearbeat for ElectionTimeout ticks
        // it will start new election, by sending Prepare to other replicas.
        ElectionTimeout int
        // HeartbeatTimeout must be lower than election timeout.
        // Leader will send last sent Accept message to every replica as a heartbeat.
        HeartbeatTimeout int

        // Monitoring

        // Timeouts are expressed in number of network delays.
        // Each observer for the same subject must reinforce other observer vote after reinforce timeout.
        ReinforceTimeout int
        // RetransmitTimeout used to re-broadcast observed alerts.
        RetransmitTimeout int

        // Connectivity is a K paramter, used for monitoring topology construction.
        // Each node will have K observers and K subjects.
        Connectivity  int
        LowWatermark  int
        HighWatermark int

        // IP and Port of the seed.
        Seed *types.Node

        // IP and Port of the instance.
        IP   string
        Port uint64

        DialTimeout, SendTimeout time.Duration
}

API

API may change during development

Rapid instance needs to be initialized with logger, configuration and failure detector. In the example i am using simple prober, that dials once every period and if it can't reach the node consecutively it exits with notification. This can be replaced with application-based failure detector, phi acrual failure detector, etc.

The interface for failure detector is:

type FailureDetector interface {
        Monitor(context.Context, *types.Node) error
}

To bootstrap a cluster, start a single seed node and connect every other node to this seed:

logger := zap.NewNop()
fd := prober.New(logger.Sugar(), 2*time.Second, 2*time.Second, 3)
updates := make(chan *types.Configuration, 1)
if err := rapid.New(logger, conf, fd).Run(ctx, updates); err != nil {
   panic(err)
}

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
.github/workflows		.github/workflows
bootstrap		bootstrap
consensus		consensus
example		example
graph		graph
monitor		monitor
network		network
specs		specs
types		types
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
rapid.go		rapid.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rapid: membership service with consistent view changes.

Contents

Configuration

API

Specs

License

References

1. Stable and Consistent Membership at Scale with Rapid

About

Releases

Packages

Languages

License

dshulyak/rapid

Folders and files

Latest commit

History

Repository files navigation

Rapid: membership service with consistent view changes.

Contents

Configuration

API

Specs

License

References

1. Stable and Consistent Membership at Scale with Rapid

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages