Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Mirror of Apache Pig
Java Perl GAP PigLatin Shell Python Other

PIG-4657: [Pig on Tez] Optimize GroupBy and Distinct key comparison (…

…rohini)

git-svn-id: https://svn.apache.org/repos/asf/pig/trunk@1696491 13f79535-47bb-0310-9956-ffa450edef68
latest commit 797a64705e
Rohini Palaniswamy authored
Failed to load latest commit information.
.eclipse.templates PIG-3522: Remove shock from pig
bin PIG-4403: Combining -Dpig.additional.jars.uris with -useHCatalog brea…
conf PIG-4649: [Pig on Tez] Union followed by HCatStorer misses some data …
contrib Rollback PIG-4623: Fixed the 'new line' character inside double-quote…
ivy PIG-4639: Add better parser for Apache HTTPD access log
lib-src/bzip2/org/apache PIG-4496: Fix CBZip2InputStream to close underlying stream
license PIG-4324: Remove jsch-LICENSE.txt
shims PIG-4530: StackOverflow in TestMultiQueryLocal running under hadoop20…
src PIG-4657: [Pig on Tez] Optimize GroupBy and Distinct key comparison (…
test PIG-4657: [Pig on Tez] Optimize GroupBy and Distinct key comparison (…
tutorial PIG-4047: Break up pig withouthadoop and fat jar
.gitignore PIG-4330: Regression test for PIG-3584 - AvroStorage does not correct…
CHANGES.txt PIG-4657: [Pig on Tez] Optimize GroupBy and Distinct key comparison (…
KEYS Adding prkommireddi public key to KEYS
LICENSE.txt PIG-692 When running a job from a script, use that script name as the…
NOTICE.txt Update copyright year in NOTICE
README.txt PIG-4519: Correct link to Contribute page
RELEASE_NOTES.txt updated external reference to point to hadoop's new common dir
autocomplete PIG-692 When running a job from a script, use that script name as the…
build.xml PIG-4650: ant mvn-deploy target is broken
doap_Pig.rdf Added doap file. This will be used in listing Pig on Apache's index o…
ivy.xml PIG-4639: Add better parser for Apache HTTPD access log

README.txt

Apache Pig
===========
Pig is a dataflow programming environment for processing very large files. Pig's
language is called Pig Latin. A Pig Latin program consists of a directed
acyclic graph where each node represents an operation that transforms data.
Operations are of two flavors: (1) relational-algebra style operations such as
join, filter, project; (2) functional-programming style operators such as map,
reduce. 

Pig compiles these dataflow programs into (sequences of) map-reduce or Apache Tez
jobs and executes them using Hadoop. It is also possible to execute Pig Latin
programs in a "local" mode (without Hadoop cluster), in which case all 
processing takes place in a single local JVM. 

General Info
===============

For the latest information about Pig, please visit our website at:

   http://pig.apache.org/

and our wiki, at:

   http://wiki.apache.org/pig/

Getting Started
===============
1. To learn about Pig, try http://wiki.apache.org/pig/PigTutorial
2. To build and run Pig, try http://wiki.apache.org/pig/BuildPig and
http://wiki.apache.org/pig/RunPig
3. To check out the function library, try http://wiki.apache.org/pig/PiggyBank


Contributing to the Project
===========================

We welcome all contributions. For the details, please, visit
https://cwiki.apache.org/confluence/display/PIG/HowToContribute
Something went wrong with that request. Please try again.