ggplot2.SparkR is an R package for scalable visualization of big data represented in Spark DataFrame. It is an extension to the original ggplot2 package and can seamlessly handle both R data.frame and Spark DataFrame with no modifications to the original API.
ggplot2.SparkR requires no additional training for existing R users who are already familiar with ggplot2 and allows them to benefit from powerful distributed processing capabilities of Spark for efficient visualization of big data.
Until now, 6 graph types (bar, bin2d, boxplot, freqpoly, histogram, stat-sum graphs) and 15 options are supported. We plans to further extend it in the future.
Find out more at http://skku-skt.github.io/ggplot2.SparkR/
Get the development version from github:
# install.packages("devtools") devtools::install_github("SKKU-SKT/ggplot2.SparkR")