WNE SparkR Workshop - set of scripts and notebooks
-
Updated
Jun 12, 2019 - R
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
WNE SparkR Workshop - set of scripts and notebooks
Kaggle - Outbrain Click Prediction (Oct-2016 - Jan-2017)
Linear Model Interaction Terms Optimizer
Projects created using R
Some notes for R users on using R on a Databricks Spark cluster.
Mirror of https://gitlab.com/zero323/dlt
Access The Spark Catalog API Via 'sparklyr'
Alternative read and write methods for sparklyr
A simple script that reads in a web log file into a Spark cluster and determines frequency count for different types of HTTP reply
Created by Matei Zaharia
Released May 26, 2014