#14

Incident response and management

  • Opened by mmcgrana Oct 7, 2013
  • 5 comments
#11

Paper: Raft

Raft is an attempt at making a consensus algorithm that is easily understandable(compared with Paxos). https://ramcloud.stanford.edu/wiki/download/attachments/11370504/raft.pdf

  • Opened by edmellum Oct 7, 2013
  • 2 comments
#10

Dynamo database paper

Dynamo is easily understandable and a good intro to distributed eventually consisted databases. http://www.read.seas.harvard.edu/~kohler/class/cs239-w08/decandia07dynamo.pdf

#8

Book: Web Operations

Web Operations (Allspaw and Robbins)

  • Opened by mmcgrana Oct 6, 2013
  • 1 comment
#5

Book: Resilience Engineering in Practice

Resilience Engineering in Practice (Hollnagel et al.)

  • Opened by mmcgrana Oct 5, 2013
  • 2 comments
#4

Book: Human Error

Human Error (Reason)

  • Opened by mmcgrana Oct 5, 2013
  • 1 comment
#1

Paper: Online, Asynchronous Schema Change in F1

Online, Asynchronous Schema Changes in F1 (Rae et al.)

  • Opened by mmcgrana Oct 5, 2013
  • 1 comment