Skip to content

rstudio/spark-install

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spark-install

Cross-platform installer for Apache Spark.

This project provides a cross-platform installer for Apache Spark designed to use system resources efficiently under a common API. This initial version comes with support for R and Python that arose from a collaboration between RStudio and Microsoft.

R

# install from github
devtools::install_github(repo = "rstudio/spark-install", subdir = "R")
library(sparkinstall)

# lists the versions available to install
spark_available_versions()

# installs an specific version
spark_install(version = "1.6.2")

# uninstalls an specific version
spark_uninstall(version = "1.6.2", hadoop_version = "2.6")

Python

# install from github
from urllib import urlopen          # Python 2.X
from urllib.request import urlopen  # Python 3.X
exec urlopen("https://raw.githubusercontent.com/rstudio/spark-install/master/Python/spark_install.py").read() in globals()

# lists the versions available to install
spark_versions()

# installs an specific version
spark_install(spark_version = "1.6.2")

# uninstalls an specific version
spark_uninstall(spark_version = "1.6.2", hadoop_version = "cdh4")

About

Cross-platform installer for Apache Spark

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published