- Agile_Data_Code 139 Chapter-wise code for Agile Data the O'Reilly book
- Collecting-Data 27 This is a HOWTO for collecting data in Ruby and Python applications and sending it to S3 via Kafka.
- Cloud-Stenography 16 Main Repo
- enron-python-flask-cassandra-pig 15 Hortonworks demo of Enron emails with Pig, Cassandra, Python and Flask
- enron-node-mongo 14 Building a simple Node application with Pig, MongoDB, Node.js and the Enron Emails