Skip to content
Scripting Languages on Hadoop: Jaql vs. Pig Latin (MapReduce stuff)
Java Shell Ruby
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
benchmarks readded gnuplot files Sep 1, 2009
java/markov moved #2 Aug 12, 2009
paper final changes Aug 31, 2009
pig Final cleanup Sep 1, 2009
presentation Final cleanup Sep 1, 2009
ruby more fixes Aug 31, 2009
.gitignore ignore eps Aug 31, 2009
readme.txt Final cleanup Sep 1, 2009 no pig Aug 12, 2009


This code has been written by Johan Uhle and Konstantin Haase during the summer term 2009 at Hasso Plattner Institute, Universität Potsdam, D-14482 Potsdam, Germany, in the seminar “Map/Reduce Algorithms on Hadoop” supervised by Alexander Albrecht and Prof. Felix Naumann.

Since this is not an eclipse (nor a java) project, we did not provide an Ant file.

For running wordcount benchmarks:
Make sure you have setup hadoop, jaql and pig.

For runnning markov benchmarks:
Make sure you have setup hadoop and pig.
Execute ./

For generating the pdf:
Make sure you have latex, gnuplot and ruby installed.
Execute rake

For generating the Pig UDF:
cd pig/splitsuc
javac -cp pig.jar
cd ..
jar -cf splitsuc.jar splitsuc
Something went wrong with that request. Please try again.