Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
How to use Wukong to run R scripts in Hadoop as well as locally
R Ruby
branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
README.rdoc
mapper.R
reducer.R
word_count.rb

README.rdoc

Writing a Hadoop Job in R using Wukong

First, install everything (following works on Ubuntu 11.10):

$ sudo apt-get install r-core littler
$ sudo gem install wukong

Run the provided scripts locally using

$ ruby word_count.rb --run=local --map_command='r mapper.R' --reduce_command='r reducer.R' INPUT_FILE OUTPUT_FILE

And on Hadoop with

$ ruby word_count.rb --run=hadoop --map_command='r mapper.R' --reduce_command='r reducer.R' INPUT_FILE OUTPUT_FILE
Something went wrong with that request. Please try again.