Presentation: Practicalities of Productionizing Distributed Systems
Video is here, need to find the slides though.
Basic material on capacity planning. Best suggestion so far is 'The Art of Capacity Planning' as discussed in #21.
The Spark work is really interesting and there are some good papers on it: http://people.csail.mit.edu/matei/papers/2010/hotcloud_spark.pdf https://www.usenix.org/system/files/conference/nsdi12/nsdi12-final138.pdf ...
Some good URLs around this that I know of: Kafka: A Distributed Messaging System for Log Processing (Kreps et al.) The Log: What every software engineer should know about real-time data's unifying ...
http://odbms.org/download/dean-keynote-ladis2009.pdf (Design, Lessons, and Advice from Building Distributed Systems at Google) leads to 404 page.