#50

Papers: Twitter Data Pipeline

Scaling Big Data Mining Infrastructure: The Twitter Experience (Lin and Rayboy) The Unified Logging Infrastructure for Data Analytics at Twitter (Lee et al.)

  • Opened by mmcgrana May 22, 2014
  • 1 comment
#49

add twttr research

added a link to the twitter research site

  • Opened by softprops May 21, 2014
  • 2 comments
#45

Post: A Note on Uptime

https://zvzzt.wordpress.com/2012/08/16/a-note-on-uptime/ (candidate resource)

#44

Post link leads to 404

http://odbms.org/download/dean-keynote-ladis2009.pdf (Design, Lessons, and Advice from Building Distributed Systems at Google) leads to 404 page.

  • Opened by bndr May 16, 2014
  • 1 comment
#42

CAP theorem

http://en.wikipedia.org/wiki/CAP_theorem I don't have any recommendations for specific papers currently, but I think it's an important concept for engineers to learn!

  • Opened by neoice May 14, 2014
  • 1 comment
#41

Release It

Could be a good survey / practical resource on operable apps. I think I've read this but it was a while ago, so I need to review.

  • Opened by mmcgrana May 14, 2014
  • 5 comments
#37

Kafka, The Log

Some good URLs around this that I know of: Kafka: A Distributed Messaging System for Log Processing (Kreps et al.) The Log: What every software engineer should know about real-time data's unifying ...

#36

Spark

The Spark work is really interesting and there are some good papers on it: http://people.csail.mit.edu/matei/papers/2010/hotcloud_spark.pdf https://www.usenix.org/system/files/conference/nsdi12/nsdi12-final138.pdf ...

  • Opened by mmcgrana May 10, 2014
  • 1 comment
#35

Courseware: Computer and Network Security

https://engineering.purdue.edu/kak/compsec/Lectures.html

  • Opened by mmcgrana Apr 15, 2014
  • 1 comment