GitHub is home to over 31 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Pig Visualization framework
Build a pipeline for Redshift Data Warehouse http://mortardata.com
Mortar Project with examples for several different public data sets and data types/formats
Mortar's JSON tools for Apache Pig
An example of how to run a Mortar Pig script through Luigi
A customizable recommendation engine for Hadoop and Pig by Mortar Data.
Runnable examples and templates for connecting to MongoDB data with Mortar Hadoop.
Visualize your mortar-recsys recommendation engine results
Tool for generating config files using a template and environment variables.
MongoDB adapter for Hadoop
An Apache Pig storage function for DynamoDB by Mortar Data.
Template Project for Creating new Java Loaders and UDFs for Pig
Example Mortar project for working with the million song dataset
Mirror of Apache Sqoop
Recommendation Engine for Github
pygments syntax highlighting in ruby
Collection of Pig scripts using data in MongoDB
John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm
Python Secure FTP module (Forked from Google Code Project with a couple of new methods)
Description and terms for the Netflix Cloud Prize, which runs from March-September 2013. Read the rules, fork to your GitHub account to create a Submission, then send us your email address.
Mortar plugin for retrieving pig job outputs from S3
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
Simple syntax highlighting for writing Pig scripts (http://hadoop.apache.org/pig) in Textmate.
A fork of heroku-buildpack-python that includes some additional build dependencies: libffi
Pig load functions for reading in logs from papertrail on Mortar.
Mirror of Apache Avro