Fork of Hadoop JobControl classes, working on some changes I'd like to see
Java
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
src/kdp
.gitignore
README
build.xml

README

JobControl fork

Extending Hadoop JobControl to handle more complex tasks.

Hadoop Enhanced JobControl
==========================

Extend Hadoop JobControl to handle more complex tasks.

- Allow file system operations (move, delete) as dependencies.
- Restart failed workflows.
- Allow composition of workflows (i.e a workflow as a dependency).
- Expose a well-defined extension point for adding Pig, Cascading, etc.
- Simplify job creation using a Scala-based DSL.

Examples
--------

Sample project (three stage extraction of a representative keyword
from a piece of text) to use JobControl.