#21

Add 'The Art of Capacity Planning'

This was recommended to me by @chooper and is a tremendous help getting started with capacity planning.

  • Opened by naaman Oct 18, 2013
  • 3 comments
#22

Book: Managing the Unexpected

Managing the Unexpected (Weick and Sutcliffe)

#23

Post: How to lose $172,222 a second for 45 minutes

hello, educational article and post-mortem document http://pythonsweetness.tumblr.com/post/64740079543/how-to-lose-172-222-a-second-for-45-minutes

#24

Book: Mature Optimization

http://carlos.bueno.org/optimization/mature-optimization.pdf Internal performance optimization manual for facebook.

#25

Added Micro Service presentation

This presentation about designing and building micro services is fantastic. One of the best explanations I've come across.

  • Opened by kyleboon Dec 15, 2013
  • 1 comment
#26

Paper/Chapter: Appendix F - Personal observations on the reliability of the Shuttle by Richard Feynman

I'm not exactly sure if this fits in here, but the appendix F from the Challenger explosion investigation was a goldmine of engineering principles and how things can go wrong that I could learn from ...

  • Opened by ferd Dec 29, 2013
  • 1 comment
#27

Presentation: Practicalities of Productionizing Distributed Systems

Video is here, need to find the slides though.

  • Opened by mmcgrana Dec 29, 2013
  • 1 comment
#28

How to know where to start? Knowledge map?

How well does this content fit a knowledge map structure? The list is great but it's also large-ish and growing. Having a logical starting point (perhaps per high-level topic) might be interesting. ...

  • Opened by bjeanes Dec 30, 2013
  • 1 comment
#29

Book: The Field Guide to Understanding Human Error

Recommended in this reading list. Maybe this turns out to be a better fit than #4.

#30

Book: Effective Monitoring and Alerting

Recommended in this reading list.