Dumbo IP count written in C.
A fast Python module for dealing with so called "typed bytes".
Python module that allows one to easily write and run Hadoop programs.
Official Python low-level client for Elasticsearch.
Java classes that can be useful for Dumbo programs that run on Hadoop Streaming.
Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
Stand-alone language identification system
Simple Python module for logistic regression
An interactive SSL-capable intercepting HTTP proxy for penetration testers and software developers
A collection of Hadoop Pig utilities.
Scripts used to setup a Spark cluster on EC2
SPark on AWS
A Python module for dealing with so called "typed bytes".