Skip to content

aurora-mareviv/sparklyr_test

Repository files navigation

sparklyr_test

Testing the sparklyr package for data wrangling & basic modelling.

https://aurora-mareviv.github.io/sparklyr_test

Contents

Jupyter Notebooks

sparklyr_test is a Jupyter notebook for interactive data exploration & modelling.

sparklyr_test2 is another Jupyter notebook for interactive data exploration & modelling with a -much- bigger dataset. Visualize it in nbviewer.jupyter.org

R script

sparklyr_script is a .R script that launches as an application.

$ spark-submit --deploy-mode client sparklyr_script.R

Clone this repo

$ git clone https://github.com/aurora-mareviv/sparklyr_test

Tutorial

Materials to start using Spark in R (sparklyr package).

References

Sparklyr

PySpark

Jupyter