Mirror of Apache Pig
Java Perl GAP PigLatin Shell Python Other
Clone or download
Latest commit d84817a Jul 2, 2018
Permalink
Failed to load latest commit information.
.eclipse.templates PIG-5287: bump jython to 2.7.1 (dbist13 via rohini) Aug 11, 2017
bin PIG-5246: Modify bin/pig about SPARK_HOME, SPARK_ASSEMBLY_JAR after u… Jul 25, 2017
conf PIG-4948: Pig on Tez AM use too much memory on a small cluster Jul 19, 2016
contrib PIG-5317: Upgrade old dependencies: commons-lang, hsqldb, commons-log… Jan 2, 2018
dev-support/docker PIG-4526: Make setting up the build environment easier (nielsbasjes v… Apr 27, 2016
ivy PIG-5344: Update Apache HTTPD LogParser to latest version (nielsbasje… Jul 2, 2018
lib-src/bzip2/org/apache PIG-4496: Fix CBZip2InputStream to close underlying stream May 12, 2015
license PIG-4324: Remove jsch-LICENSE.txt Nov 11, 2014
shims PIG-4923 Removing empty directories Jan 9, 2017
src PIG-5341: PigStorage with -tagFile/-tagPath produces incorrect result… Jun 5, 2018
test PIG-5341: PigStorage with -tagFile/-tagPath produces incorrect result… Jun 5, 2018
tutorial PIG-5282: Upgade to Java 8 (satishsaley via rohini) Aug 21, 2017
.gitignore PIG-4923: Drop Hadoop 1.x support in Pig 0.17 (szita via rohini) Jan 7, 2017
BUILDING.md PIG-4923: Drop Hadoop 1.x support in Pig 0.17 (szita via rohini) Jan 7, 2017
CHANGES.txt PIG-5344: Update Apache HTTPD LogParser to latest version (nielsbasje… Jul 2, 2018
KEYS Adding PGP public key for szita Jun 2, 2017
LICENSE.txt PIG-692 When running a job from a script, use that script name as the… Mar 5, 2009
NOTICE.txt PIG-4324: Remove jsch from NOTICE.txt Nov 15, 2017
README.txt PIG-4519: Correct link to Contribute page Apr 24, 2015
RELEASE_NOTES.txt updated external reference to point to hadoop's new common dir Jun 22, 2009
autocomplete PIG-692 When running a job from a script, use that script name as the… Mar 5, 2009
build.xml PIG-5317: Upgrade old dependencies: commons-lang, hsqldb, commons-log… Jan 2, 2018
doap_Pig.rdf Added doap file. This will be used in listing Pig on Apache's index o… May 16, 2011
ivy.xml PIG-5344: Update Apache HTTPD LogParser to latest version (nielsbasje… Jul 2, 2018
start-build-env.sh PIG-4526: Make setting up the build environment easier (nielsbasjes v… Apr 27, 2016

README.txt

Apache Pig
===========
Pig is a dataflow programming environment for processing very large files. Pig's
language is called Pig Latin. A Pig Latin program consists of a directed
acyclic graph where each node represents an operation that transforms data.
Operations are of two flavors: (1) relational-algebra style operations such as
join, filter, project; (2) functional-programming style operators such as map,
reduce. 

Pig compiles these dataflow programs into (sequences of) map-reduce or Apache Tez
jobs and executes them using Hadoop. It is also possible to execute Pig Latin
programs in a "local" mode (without Hadoop cluster), in which case all 
processing takes place in a single local JVM. 

General Info
===============

For the latest information about Pig, please visit our website at:

   http://pig.apache.org/

and our wiki, at:

   http://wiki.apache.org/pig/

Getting Started
===============
1. To learn about Pig, try http://wiki.apache.org/pig/PigTutorial
2. To build and run Pig, try http://wiki.apache.org/pig/BuildPig and
http://wiki.apache.org/pig/RunPig
3. To check out the function library, try http://wiki.apache.org/pig/PiggyBank


Contributing to the Project
===========================

We welcome all contributions. For the details, please, visit
https://cwiki.apache.org/confluence/display/PIG/HowToContribute