diff --git a/README.md b/README.md index 8b9f9c0..b6be058 100644 --- a/README.md +++ b/README.md @@ -34,6 +34,8 @@ We welcome [suggestions](CONTRIBUTING.md). * [Resident Distributed Datasets: a Fault-Tolerant Abstraction for In-Memory Cluster Computing](https://www.usenix.org/system/files/conference/nsdi12/nsdi12-final138.pdf) (Zahari et al.) * [The Human Side of Postmortems](https://docs.google.com/file/d/0Byl4UKRYLErDVlJMNDNjaThiR2M/edit) (Zwieback) * [Crew Resource Management: a Positive Change for the Fire Service](http://www.iaff.org/06news/NearMissKit/6.%20Crew%20Resource%20Management/CRM.pdf) +* [Architecture of a Database System](http://db.cs.berkeley.edu/papers/fntdb07-architecture.pdf) (Hellerstein et al.) +* [The Art of the Propagator](http://web.mit.edu/~axch/www/art.pdf) (Radul and Sussman) #### Posts diff --git a/resources.md b/resources.md new file mode 100644 index 0000000..c7e996d --- /dev/null +++ b/resources.md @@ -0,0 +1,12 @@ +#### Patterns + +* http://martinfowler.com/bliki/CircuitBreaker.html + +#### Metrics & profiling + +* http://metrics.codahale.com/ +* https://github.com/riemann/riemann-jvm-profiler + +#### Fault tolerance + +* https://github.com/Netflix/Hystrix diff --git a/schedule.md b/schedule.md new file mode 100644 index 0000000..0c5c4f4 --- /dev/null +++ b/schedule.md @@ -0,0 +1,165 @@ +# Schedule for 8th Light reading group + +## 2015-11-04 +* Sections 6-C [The Art of the Propagator](http://web.mit.edu/~axch/www/art.pdf) (Radul and Sussman) + +## 2015-10-28 +* Sections 1-5 [The Art of the Propagator](http://web.mit.edu/~axch/www/art.pdf) (Radul and Sussman) + +## 2015-08-12 +* [Dremel: Interactive Analysis of Web-Scale Datasets](http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/pubs/archive/36632.pdf) (Melnik et al.) + +## 2015-07-15 +* [Microservice Trade-Offs](http://martinfowler.com/articles/microservice-trade-offs.html) (Fowler) + +## 2015-07-08 +* [Resilient Distributed Datasets: A Fault-Tolerant Abstraction for +In-Memory Cluster Computing](https://www.usenix.org/system/files/conference/nsdi12/nsdi12-final138.pdf) (Zahari et al.) + +## 2015-06-24 +* [Please stop calling databases CP or AP](https://martin.kleppmann.com/2015/05/11/please-stop-calling-databases-cp-or-ap.html) (Kleppmann) + +## 2015-06-17 +* [Call me maybe: final thoughs](https://aphyr.com/posts/286-call-me-maybe-final-thoughts) (Kingsbury) + +## 2015-06-10 +* [Distributed Systems Are a UX Problem](http://bravenewgeek.com/distributed-systems-are-a-ux-problem/) (Brave New Geek) + +## 2015-06-03 +* Sections 5-8 of [Architecture of a Database System](http://db.cs.berkeley.edu/papers/fntdb07-architecture.pdf) (Hellerstein et al.) + +## 2015-05-27 +* Section 4 of [Architecture of a Database System](http://db.cs.berkeley.edu/papers/fntdb07-architecture.pdf) (Hellerstein et al.) + +## 2015-05-20 +* Sections 1-3 of [Architecture of a Database System](http://db.cs.berkeley.edu/papers/fntdb07-architecture.pdf) (Hellerstein et al.) + +## 2015-05-13 +* [The Unified Logging Infrastructure for Data Analytics at Twitter](http://vldb.org/pvldb/vol5/p1771_georgelee_vldb2012.pdf) (Lee et al.) + +## 2015-05-06 +* [Kafka: A Distributed Messaging System for Log Processing](http://research.microsoft.com/en-us/UM/people/srikanth/netdb11/netdb11papers/netdb11-final12.pdf) (Kreps et al.) + +## 2015-04-29 +* [The Log: What every software engineer should know about real-time data's unifying abstraction](http://engineering.linkedin.com/distributed-systems/log-what-every-software-engineer-should-know-about-real-time-datas-unifying) (Kreps) + +## 2015-04-22 +* Chapter 7-8 of [The Field Guide To Understanding Human Error](http://www.amazon.com/Field-Guide-Understanding-Human-Error/dp/1472439058) (Dekker) + +## 2015-04-15 +* Chapter 5-6 of [The Field Guide To Understanding Human Error](http://www.amazon.com/Field-Guide-Understanding-Human-Error/dp/1472439058) (Dekker) + +## 2015-04-08 +* Chapter 3-4 of [The Field Guide To Understanding Human Error](http://www.amazon.com/Field-Guide-Understanding-Human-Error/dp/1472439058) (Dekker) + +## 2015-04-01 +* Chapter 1-2 of [The Field Guide To Understanding Human Error](http://www.amazon.com/Field-Guide-Understanding-Human-Error/dp/1472439058) (Dekker) + +## 2015-03-25 +* [Learning to Embrace Failure](http://best.dtu.dk/SC13/p20-casestudy.pdf) (Limoncelli et al.) +* [Some Rules for Engineering and Operations](http://blog.b3k.us/2012/01/24/some-rules.html) (Black) + +## 2015-03-18 +* [Out of the Tar Pit](http://shaffner.us/cs/papers/tarpit.pdf) (Moseley and Marks) + +## 2015-03-11 +* [Spanner: Google’s Globally-Distributed Database](http://research.google.com/archive/spanner-osdi2012.pdf) (Corbett et al.) +* [The Trouble with Timestamps](http://aphyr.com/posts/299-the-trouble-with-timestamps) (Kingsbury) + +## 2015-03-04 +* Chapter 16-18 [The Tangled Web: A Guide to Securing Modern Web Applications](http://www.amazon.com/The-Tangled-Web-Securing-Applications/dp/1593273886) (Zalewski) + +## 2015-02-25 +* Chapter 13-15 [The Tangled Web: A Guide to Securing Modern Web Applications](http://www.amazon.com/The-Tangled-Web-Securing-Applications/dp/1593273886) (Zalewski) + +## 2015-02-18 +* Chapter 10-12 [The Tangled Web: A Guide to Securing Modern Web Applications](http://www.amazon.com/The-Tangled-Web-Securing-Applications/dp/1593273886) (Zalewski) + +## 2015-02-11 +* Chapter 7-9 [The Tangled Web: A Guide to Securing Modern Web Applications](http://www.amazon.com/The-Tangled-Web-Securing-Applications/dp/1593273886) (Zalewski) + +## 2015-02-04 +* Chapter 4-6 [The Tangled Web: A Guide to Securing Modern Web Applications](http://www.amazon.com/The-Tangled-Web-Securing-Applications/dp/1593273886) (Zalewski) + +## 2015-01-28 +* Chapters 1-3 [The Tangled Web: A Guide to Securing Modern Web Applications](http://www.amazon.com/The-Tangled-Web-Securing-Applications/dp/1593273886) (Zalewski) + +## 2015-01-21 +* [In Search of an Understandable Consensus Algorithm] (https://ramcloud.stanford.edu/raft.pdf) (Ongaro-Ousterhout) +* [Paxos Made Simple] (http://research.microsoft.com/en-us/um/people/lamport/pubs/paxos-simple.pdf) (Lamport) + +## 2015-01-14 +* [The Chubby Lock Service for Loosely Coupled Distributed Systems](http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/archive/chubby-osdi06.pdf) (Burrows) + +## 2015-01-07 +* Chapters 9-18 of [High Performance Browser Networking](http://chimera.labs.oreilly.com/books/1230000000545/index.html) (Grigorik) + +## 2014-12-31 +* [no meeting] + +## 2014-12-24 +* [no meeting] + +## 2014-12-17 +* [no meeting] + +## 2014-12-10 +* Chapters 5-8 of [High Performance Browser Networking](http://chimera.labs.oreilly.com/books/1230000000545/index.html) (Grigorik) + +## 2014-12-03 +* Chapters 1-4 of [High Performance Browser Networking](http://chimera.labs.oreilly.com/books/1230000000545/index.html) (Grigorik) + +## 2014-11-26 +* [no meeting] + +## 2014-11-19 +* [The Google File System](http://research.google.com/archive/gfs-sosp2003.pdf) (Ghemawat et al.) + +## 2014-11-12 +* [On Designing and Deploying Internet Scale Services](http://mvdirona.com/jrh/talksAndPapers/JamesRH_Lisa.pdf) (Hamilton) + +## 2014-11-05 +* Chapters 5-8 of [Effective Monitoring and Alerting](http://www.amazon.com/Effective-Monitoring-Alerting-For-Operations/dp/1449333524) (Ligus) + +## 2014-10-29 +* Chapters 3-4 of [Effective Monitoring and Alerting](http://www.amazon.com/Effective-Monitoring-Alerting-For-Operations/dp/1449333524) (Ligus) + +## 2014-10-22 +* Chapters 1-2 of [Effective Monitoring and Alerting](http://www.amazon.com/Effective-Monitoring-Alerting-For-Operations/dp/1449333524) (Ligus) + +## 2014-10-15 +* [Resilience Engineering: Part I](http://www.kitchensoap.com/2011/04/07/resilience-engineering-part-i/) +* [Resilience Engineering: Part II](http://www.kitchensoap.com/2012/06/18/resilience-engineering-part-ii-lenses/) (Allspaw) + +## 2014-10-08 +* [Getting Real About Distributed Systems Reliability](http://blog.empathybox.com/post/19574936361/getting-real-about-distributed-system-reliability) (Kreps) +* [My Philosophy on Alerting](https://docs.google.com/document/d/199PqyG3UsyXlwieHaqbGiWVa8eMWi8zzAn0YfcApr8Q/edit#heading=h.whsaboyw21nk) (Ewaschuk) + +## 2014-10-01 +* [Incident Response at Heroku](https://blog.heroku.com/archives/2014/5/9/incident-response-at-heroku) (McGranaghan) +* [Observability at Twitter](https://blog.twitter.com/2013/observability-at-twitter) (Watson) + +## 2014-09-24 +* Chapter 16-18 of [Release It!: Design and Deploy Production-Ready Software](http://www.amazon.com/Release-It-Production-Ready-Pragmatic-Programmers/dp/0978739213) (Nygard) + +## 2014-09-17 +* Chapter 11-15 of [Release It!: Design and Deploy Production-Ready Software](http://www.amazon.com/Release-It-Production-Ready-Pragmatic-Programmers/dp/0978739213) (Nygard) + +## 2014-09-10 +* Chapter 6-10 of [Release It!: Design and Deploy Production-Ready Software](http://www.amazon.com/Release-It-Production-Ready-Pragmatic-Programmers/dp/0978739213) (Nygard) + +## 2014-09-03 +* Chapter 5 (Stability Patterns) of [Release It!: Design and Deploy Production-Ready Software](http://www.amazon.com/Release-It-Production-Ready-Pragmatic-Programmers/dp/0978739213) (Nygard) + +## 2014-08-27 +* Chapter 4 (Stability Antipatterns) of [Release It!: Design and Deploy Production-Ready Software](http://www.amazon.com/Release-It-Production-Ready-Pragmatic-Programmers/dp/0978739213) (Nygard) + +## 2014-08-20 +* Chapters 1-3 of [Release It!: Design and Deploy Production-Ready Software](http://www.amazon.com/Release-It-Production-Ready-Pragmatic-Programmers/dp/0978739213) (Nygard) + +## 2014-08-13 +* [Fallacies of Distributed Computing Explained](http://www.rgoarchitects.com/Files/fallacies.pdf) (Rotem-Gal-Oz) +* [You Can’t Sacrifice Partition Tolerance](http://codahale.com/you-cant-sacrifice-partition-tolerance/) (Hale) + +## 2014-08-06 +* [Notes on Distributed Systems for Young Bloods](http://www.somethingsimilar.com/2013/01/14/notes-on-distributed-systems-for-young-bloods/) (Hodges)