Create your own GitHub profile
Sign up for your own profile on GitHub, the best place to host code, manage projects, and build software alongside 28 million developers.Sign up
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Chapter-wise code for Agile Data the O'Reilly book
This is a HOWTO for collecting data in Ruby and Python applications and sending it to S3 via Kafka.
Hortonworks demo of Enron emails with Pig, Cassandra, Python and Flask
Code for creating and querying an Avro encoded repository of the UC Berkeley Enron email archive