Create your own GitHub profile
Sign up for your own profile on GitHub, the best place to host code, manage projects, and build software alongside 28 million developers.
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster. See https://github.com/Cascading/cascading for the release repos…
Extensions for Cascading’s local/streaming execution mode
Simple bash functions for manipulating Amazon Elastic MapReduce clusters
A simple script for initializing an EC2 command line environment
Random notes on distributed computing and stuff.
in the last year
Recursive descent copy will duplicate array values:
COMPILER.nested( "/**" ).copy( from, into );
Press h to open a hovercard with more details.