- Agile_Data_Code 139 Chapter-wise code for Agile Data the O'Reilly book
- Collecting-Data 27 This is a HOWTO for collecting data in Ruby and Python applications and sending it to S3 via Kafka.
- Cloud-Stenography 16 Main Repo
- enron-python-flask-cassandra-pig 15 Hortonworks demo of Enron emails with Pig, Cassandra, Python and Flask
- pig-to-json 14 A Pig to JSON UDF for Pig that converts tuples and bags to JSON strings