Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Scripting Languages on Hadoop: Jaql vs. Pig Latin (MapReduce stuff)

branch: master

Fetching latest commit…

Octocat-spinner-32-eaf2f5

Cannot retrieve the latest commit at this time

Octocat-spinner-32 benchmarks
Octocat-spinner-32 jaql
Octocat-spinner-32 java
Octocat-spinner-32 paper
Octocat-spinner-32 pig
Octocat-spinner-32 presentation
Octocat-spinner-32 ruby
Octocat-spinner-32 .gitignore
Octocat-spinner-32 LICENSE
Octocat-spinner-32 Rakefile
Octocat-spinner-32 markov.sh
Octocat-spinner-32 readme.txt
Octocat-spinner-32 wordcount.sh
readme.txt
This code has been written by Johan Uhle and Konstantin Haase during the summer term 2009 at Hasso Plattner Institute, Universität Potsdam, D-14482 Potsdam, Germany, in the seminar “Map/Reduce Algorithms on Hadoop” supervised by Alexander Albrecht and Prof. Felix Naumann.

Since this is not an eclipse (nor a java) project, we did not provide an Ant file.

For running wordcount benchmarks:
Make sure you have setup hadoop, jaql and pig.
  ./wordcount.sh

For runnning markov benchmarks:
Make sure you have setup hadoop and pig.
Execute ./markov.sh

For generating the pdf:
Make sure you have latex, gnuplot and ruby installed.
Execute rake

For generating the Pig UDF:
cd pig/splitsuc
javac -cp pig.jar SPLITSUC.java STORESQL.java
cd ..
jar -cf splitsuc.jar splitsuc
Something went wrong with that request. Please try again.