"While pursuing success in a dynamic, complex environment with limited resources and multiple goal conflicts, a succession of small, everyday decisions eventually produced breakdowns on a massive ...
Post: Distributed Systems Design
https://www.bluebox.net/insight/blog-article/distributed-systems-design-part-1-4 https://www.bluebox.net/insight/blog-article/distributed-systems-design-part-2-4 https://www.bluebox.net/insight/blog-article/distributed-systems-design-part-3-4 ...
Book: Failure is Not an Option
http://www.amazon.com/Failure-Is-Not-Option-Mission/dp/1439148813 A great book about how the operations team worked and grew from the beginning of NASA through all of the major Apollo missions. Although ...
Post / Website: Lambda Architecture
http://lambda-architecture.net Nathan Marz came up with the term Lambda Architecture (LA) for a generic, scalable and fault-tolerant data processing architecture, based on his experience working on distributed ...
Paper: B4: Experience with a Globally Deployed Software Defined WAN
Add this excellent paper on how Google built a global spanning WAN for their business. B4: Experience with a Globally Deployed Software Defined WAN Sushant Jain, Alok Kumar, Subhasree Mandal, Joon Ong, ...
Presentation: Using Logs To Build a Solid Data Infrastructure (Martin Kleppmann)
"Using Logs To Build a Solid Data Infrastructure" by Martin Kleppmann would be a nice compliment for logs: http://www.ustream.tv/recorded/61479591
This repo seems to miss a license. You could use https://github.com/sindresorhus/awesome#license as inspiration.
http://www.gluecon.com/ Not a great description of the conference on the front page really, the session descriptions are more useful: https://docs.google.com/spreadsheets/d/1A6RoScrHsRn96u74o7uwSOa5VuJ7mfLKS_cgaVhyCq0/edit#gid=0 ...