• luigi

    Forked from spotify/luigi

    Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

    Python 12 1,181 Updated Sep 13, 2016
  • Pig Visualization framework

    JavaScript 2 112 Updated Apr 7, 2016
  • Mortar Project with examples for several different public data sets and data types/formats

    PigLatin 38 32 Updated Oct 29, 2015
  • Mortar's JSON tools for Apache Pig

    Java 10 4 Updated Apr 2, 2015
  • An example of how to run a Mortar Pig script through Luigi

    Shell 2 5 Updated Feb 5, 2015
  • A customizable recommendation engine for Hadoop and Pig by Mortar Data.

    PigLatin 822 111 Updated Dec 22, 2014
  • Build a pipeline for Redshift Data Warehouse http://mortardata.com

    Python 15 11 Updated Dec 18, 2014
  • Runnable examples and templates for connecting to MongoDB data with Mortar Hadoop.

    PigLatin 2 6 Updated Oct 24, 2014
  • Visualize your mortar-recsys recommendation engine results

    JavaScript 1 2 Updated Oct 22, 2014
  • Tool for generating config files using a template and environment variables.

    Python 2 Updated Oct 4, 2014
  • MongoDB adapter for Hadoop

    Java 7 514 Updated Aug 11, 2014
  • An Apache Pig storage function for DynamoDB by Mortar Data.

    Java 8 10 Updated Jul 28, 2014
  • Template Project for Creating new Java Loaders and UDFs for Pig

    Java 15 18 Updated Jul 23, 2014
  • Example Mortar project for working with the million song dataset

    Python 12 31 Updated Jul 2, 2014
  • sqoop

    Forked from apache/sqoop

    Mirror of Apache Sqoop

    Java 239 Updated May 7, 2014
  • Recommendation Engine for Github

    Java 20 2 Updated Apr 23, 2014
  • pygments syntax highlighting in ruby

    C 133 Updated Mar 15, 2014
  • Ruby 3 5 Updated Feb 13, 2014
  • Collection of Pig scripts using data in MongoDB

    Python 6 11 Updated Jan 19, 2014
  • John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm

    C++ 1,120 Updated Dec 20, 2013
  • Pysftp

    Forked from andybak/Pysftp

    Python Secure FTP module (Forked from Google Code Project with a couple of new methods)

    Python 10 Updated Dec 7, 2013
  • Java 3 2 Updated Oct 11, 2013
  • Description and terms for the Netflix Cloud Prize, which runs from March-September 2013. Read the rules, fork to your GitHub account to create a Submission, then send us your email address.

    520 Updated Sep 14, 2013
  • Ruby 2 Updated Sep 11, 2013
  • Mortar plugin for retrieving pig job outputs from S3

    Ruby 2 Updated Jun 29, 2013
  • Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.

    Java 369 Updated Jun 22, 2013
  • Simple syntax highlighting for writing Pig scripts (http://hadoop.apache.org/pig) in Textmate.

    12 Updated Jun 19, 2013
  • A fork of heroku-buildpack-python that includes some additional build dependencies: libffi

    Python 37 Updated Jun 16, 2013
  • Pig load functions for reading in logs from papertrail on Mortar.

    Java 2 Updated Apr 25, 2013
  • avro

    Forked from apache/avro

    Mirror of Apache Avro

    Java 467 Updated Apr 18, 2013