Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.Sign up
This repository contains the example data and Pig Latin scripts for the book _Programming Pig_ by Alan F. Gates, published by O'Reilly. All data used in the examples is in the public domain. All Pig Latin scripts and associated user defined functions are released under the Apache 2.0 license. In this repository you will find the code used to take the data from it source and prepare if for the example in the setup directory. The data directory contains the cleansed data, ready for use in the examples. The examples directory contains the example Pig Latin scripts, divided by chapters. The udfs directory contains the UDFs used in the examples.