RHadoop is a collection of three R packages that allow users to manage and analyze data with Hadoop. The packages have been implemented and tested in Cloudera's distribution of Hadoop (CDH3). and R 2.13.0. THe packages have also been tested with Revolution R 4.3 and 5.0
RHadoop consists of the following packages:
rmr - functions providing Hadoop MapReduce functionality in R
rhdfs - functions providing file management of the HDFS from within R
rhbase - functions providing database management for the HBase distributed database from within R
Overview of RHadoop, from the Revolution Analytics blog.
Slides and Replay of 30-minute presentation about RHadoop, "Leveraging R in Hadoop Environments".