Skip to content

GwonChun/ggplot2.SparkR

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ggplot2.SparkR

ggplot2.SparkR is an R package for scalable visualization of big data represented in Spark DataFrame. It is an extension to the original ggplot2 package and can seamlessly handle both R data.frame and Spark DataFrame with no modifications to the original API.

ggplot2.SparkR requires no additional training for existing R users who are already familiar with ggplot2 and allows them to benefit from powerful distributed processing capabilities of Spark for efficient visualization of big data.

Until now, 6 graph types (bar, bin2d, boxplot, freqpoly, histogram, stat-sum graphs) and 15 options are supported. We plans to further extend it in the future.

Find out more at http://skku-skt.github.io/ggplot2.SparkR/

Installation

Get the development version from github:

# install.packages("devtools")
devtools::install_github("SKKU-SKT/ggplot2.SparkR")

Mailing list

Your are welcome to ask ggplot2.SparkR questions or bugs on ggplot2.SparkR or send an email to ggplot2-sparkr@googlegroups.com. Anyone can read the archived discussion that you post messages.

Other Resources

  • ggplot2: Plotting system for R by Hadley Wickham
  • Apache Spark: Large-scale data processing engine.

About

Rebooting ggplot2 for scalable big data visualization

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 100.0%