Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Mirror of Apache Pig
Fetching latest commit…
Cannot retrieve the latest commit at this time.
|Failed to load latest commit information.|
Apache Pig =========== Pig is a dataflow programming environment for processing very large files. Pig's language is called Pig Latin. A Pig Latin program consists of a directed acyclic graph where each node represents an operation that transforms data. Operations are of two flavors: (1) relational-algebra style operations such as join, filter, project; (2) functional-programming style operators such as map, reduce. Pig compiles these dataflow programs into (sequences of) map-reduce jobs and executes them using Hadoop. It is also possible to execute Pig Latin programs in a "local" mode (without Hadoop cluster), in which case all processing takes place in a single local JVM. General Info =============== For the latest information about Pig, please visit our website at: http://pig.apache.org/ and our wiki, at: http://wiki.apache.org/pig/ Getting Started =============== 1. To learn about Pig, try http://wiki.apache.org/pig/PigTutorial 2. To build and run Pig, try http://wiki.apache.org/pig/BuildPig and http://wiki.apache.org/pig/RunPig 3. To check out the function library, try http://wiki.apache.org/pig/PiggyBank Contributing to the Project =========================== We welcome all contributions. For the details, please, visit http://wiki.apache.org/pig/HowToContribute.