Skip to content
This repository


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Mirror of Apache Crunch (Incubating)

This branch is 0 commits ahead and 0 commits behind master

Fetching latest commit…


Cannot retrieve the latest commit at this time

Octocat-spinner-32 crunch-archetype
Octocat-spinner-32 crunch-contrib
Octocat-spinner-32 crunch-core
Octocat-spinner-32 crunch-dist
Octocat-spinner-32 crunch-examples
Octocat-spinner-32 crunch-hbase
Octocat-spinner-32 crunch-scrunch
Octocat-spinner-32 crunch-spark
Octocat-spinner-32 crunch-test
Octocat-spinner-32 src
Octocat-spinner-32 .gitignore
Octocat-spinner-32 LICENSE
Octocat-spinner-32 NOTICE
Octocat-spinner-32 README
Octocat-spinner-32 pom.xml
Welcome to Apache Crunch!

Apache Crunch is a Java library for writing, testing, and running Hadoop
MapReduce pipelines, based on Google's FlumeJava. Its goal is to make
pipelines that are composed of many user-defined functions simple to write,
easy to test, and efficient to run.

For more information please see the website:

Building the Source Code

We recommend Maven 3 and JDK 6 for building Crunch. To build the project
run the following Maven command:

  mvn package

To run the integration test suite and to install the created JARs in your
local Maven cache:

  mvn install

Crunch has experimental support for Hadoop 2 through the "hadoop-2" build
profile (add -Phadoop-2 to enable it). If you want to use HBase support on
Hadoop 2, please note that you have to build HBase 0.94.3 from source using
the following command:

  mvn clean install -Dhadoop.profile=2.0
Something went wrong with that request. Please try again.