Raft is the algorithm implemented on etcd and by extension CoreOS. I think it's a great addition to this list. Nice list BTW !
Book: Service Availability Principles (by the SA Forum)
http://www.amazon.co.uk/Service-Availability-Principles-Maria-Toeroe-ebook/dp/B007KGE02G
https://zvzzt.wordpress.com/2012/08/16/a-note-on-uptime/ (candidate resource)
http://odbms.org/download/dean-keynote-ladis2009.pdf (Design, Lessons, and Advice from Building Distributed Systems at Google) leads to 404 page.
http://en.wikipedia.org/wiki/CAP_theorem I don't have any recommendations for specific papers currently, but I think it's an important concept for engineers to learn!
Could be a good survey / practical resource on operable apps. I think I've read this but it was a while ago, so I need to review.
Add links to papers, posts, presentations and conferences related to engineering distributed systems
Self-explanatory.
General paper- or book-length resources on security engineering practices.