Skip to content
This repository


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Mirror of Apache Pig

This branch is 0 commits ahead and 0 commits behind spork

.eclipse.templates PIG-2432 Eclipse .classpath file is out of date
bin PIG-2766: Pig-HCat Usability
conf [PIG-2632] Create a SchemaTuple which generates efficient Tuples via …
contrib PIG-2732: Let's get rid of the deprecated Tuple methods (PIG-2732-fix…
ivy Fix guava version to match Spark's
lib-src PIG-1304: Fail underlying M/R jobs when concatenated gzip and bz2 fil…
lib Updated to Spark 0.6.0
license PIG-965: PERFORMANCE: optimize common case in matches (PORegex) (anki…
shims PIG-2702: Make Pig local mode (and tests) faster by working around th…
src Merge pull request #35 from julienledem/spork
test Merge pull request #35 from julienledem/spork
tutorial PIG-2342: Pig tutorial documentation needs to update about building t…
.gitignore [PIG-2632] Create a SchemaTuple which generates efficient Tuples via …
CHANGES.txt PIG-2807: TestParser TestPigStorage TestNewPlanOperatorPlan broken by…
KEYS PIG-1728: doc updates (chandec via olgan)
LICENSE.txt PIG-692 When running a job from a script, use that script name as the…
NOTICE.txt PIG-2129: NOTICE file needs updates
README.txt -
RELEASE_NOTES.txt updated external reference to point to hadoop's new common dir
SPARK_README.txt Updated to Spark 0.6.0
autocomplete PIG-692 When running a job from a script, use that script name as the…
build.xml Updated to Spark 0.6.0
doap_Pig.rdf Added doap file. This will be used in listing Pig on Apache's index o…
ivy.xml Updated to Spark 0.6.0
Apache Pig
Pig is a dataflow programming environment for processing very large files. Pig's
language is called Pig Latin. A Pig Latin program consists of a directed
acyclic graph where each node represents an operation that transforms data.
Operations are of two flavors: (1) relational-algebra style operations such as
join, filter, project; (2) functional-programming style operators such as map,

Pig compiles these dataflow programs into (sequences of) map-reduce jobs and
executes them using Hadoop. It is also possible to execute Pig Latin programs
in a "local" mode (without Hadoop cluster), in which case all processing takes
place in a single local JVM. 

General Info

For the latest information about Pig, please visit our website at:

and our wiki, at:

Getting Started
1. To learn about Pig, try
2. To build and run Pig, try and
3. To check out the function library, try

Contributing to the Project

We welcome all contributions. For the details, please, visit

Something went wrong with that request. Please try again.