Skip to content


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Kettle plugin that provides support for interacting within many "big data" projects including Hadoop, Hive, HBase, Cassandra, MongoDB, and others.

Merge pull request #331 from e-cuellar/BACKLOG-2413

[BACKLOG-2413] Changes in "Open File" dialog to conform to MapRFS.
latest commit 8ebb6f36a2
@mdamour1976 mdamour1976 authored
Failed to load latest commit information.
build-res Revert "[ENGOPS-999] Fix junit availbility check.
package-res [ENGOPS-512] pack oss license file via ivy resolve
res [PDI-7843] - adding an icon for the Oozie Job Executor
samples/jobs/hadoop [BACKLOG-353] - loading the driver class list after a jar is selected…
src [BACKLOG-2413] Changes in "Open File" dialog to conform to MapRFS.
test-res [PDI-8044] Introduction of Hadoop Configuration (shim layer) to allow…
test-src [BACKLOG-2100] Remove Hadoop Clusters from Kettle
.classpath Fixed plugin project dependencies to mapred classes. Shims now publis…
.gitignore Now ignoring test artifacts (pdi-*)
.project BAD-38: Moved shims into their own project (pentaho-hadoop-shims)
LICENSE.txt [BAD-8] Updated license to Apache License, Version 2.0.
README.markdown Added project README [ENGOPS-1009] Add cdh53 version property
build.xml [BACKLOG-2052] big-data-plugin assembly includes metastore jar
ivy.xml [ENGOPS-1009] Mismatched dependency config
ivysettings.xml [ENGOPS-503] shared file update from distribute.groovy
package-ivy.xml [PDI-4601] Fixing artifacts
package-samples-ivy.xml [build] updated package-samples-ivy.xml to correctly add -samples to …
pentaho-big-data-plugin.iml [PDI-7843] - "Basic Options" mode functional. more extraction of re-u…


Pentaho Big Data Plugin

The Pentaho Big Data Plugin Project provides support for an ever-expanding Big Data community within the Pentaho ecosystem. It is a plugin for the Pentaho Kettle engine which can be used within Pentaho Data Integration (Kettle), Pentaho Reporting, and the Pentaho BI Platform.


The Pentaho Big Data Plugin is built with Apache Ant and uses Apache Ivy for dependency management. All you'll need to get started is Ant 1.7.0 or newer to build the project. The build scripts will download Ivy if you do not already have it installed.

$ git clone git://
$ cd big-data-plugin
$ ant

This will produce a plugin archive in dist/pentaho-big-data-plugin-${project.revision}.tar.gz (and .zip). This archive can then be extracted into your Pentaho Data Integration plugin directory.

Further Reading

Additional documentation is available on the Community wiki: Big Data Plugin for Java Developers


Licensed under the Apache License, Version 2.0. See LICENSE.txt for more information.

Something went wrong with that request. Please try again.