PivotalR is a package that enables users of R, the most popular open source statistical programming language and environment, to interact with Greenplum Database and the PostgreSQL for big data analytics. It does so by providing an interface to the operations on tables/views in the database. These operations are almost the same as those of data.frame. Minimal amount of data is transfered between R and the database. Thus the users of R do not need to learn SQL when they operate on the objects in the database. PivotalR also lets the user to run the functions of the open source machine learning package Apache MADlib directly from R.
An Introduction to PivotalR
vignette("pivotalr") # execute in R console to view the PDF file
To install PivotalR:
Get the latest stable version from CRAN by running
Or try out the latest development version from github by running the following code (need R >= 3.0.2):
## install.packages("devtools") # 'devtools' package is only available for R >= 3.0.2 devtools::install_github("PivotalR", "greenplum-db")
Or download the source tarball directly from here, and then install the tarball
install.packages("greenplum-db-PivotalR-xxxx.tar.gz", repos = NULL, type = "source")
where "greenplum-db-PivotalR-xxxx.tar.gz" is the name of the package that you have downloaded.
To get started: