No description or website provided.
Java Shell
Latest commit 67cc7db Sep 22, 2014 @andreybratus Update README.md
Permalink
Failed to load latest commit information.
src
.gitignore added test suite, reordered packages Aug 21, 2014
README.md
pom.xml added test suite, reordered packages Aug 21, 2014
startserver

README.md

RefineOnSpark

RefineOnSpark is a driver program to run OpenRefine jobs on the Spark cluster.

1. Prerequsites on the cluster

  • An instance of OpenRefine is up and bind to the default localhost:3333.
  • Input files are served via HDFS, however local files are also accepted, but have to be located under the same path on all the worker nodes.

2. Application taxonomy

TODO