Skip to content


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Data and example code for Programming Pig, by Alan F. Gates
Java Python Perl
branch: master


This repository contains the example data and Pig Latin scripts for the book
_Programming Pig_ by Alan F. Gates, published by O'Reilly.  All data used in
the examples is in the public domain.  All Pig Latin scripts and associated 
user defined functions are released under the Apache 2.0 license.

In this repository you will find the code used to take the data from it source
and prepare if for the example in the setup directory.  The data directory
contains the cleansed data, ready for use in the examples.  The examples 
directory contains the example Pig Latin scripts, divided by chapters.  The 
udfs directory contains the UDFs used in the examples.
Something went wrong with that request. Please try again.