Apache Spark
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Here are 53 public repositories matching this topic...
Kaggle - Outbrain Click Prediction (Oct-2016 - Jan-2017)
-
Updated
Apr 21, 2017 - R
Comparison between the implementations of the Lasso algorithm between the Spark MLib library and the R glmnet package.
-
Updated
Jun 21, 2017 - R
Word Prediction using Markov Chains.
-
Updated
Jul 8, 2017 - R
A simple script that reads in a web log file into a Spark cluster and determines frequency count for different types of HTTP reply
-
Updated
Oct 25, 2017 - R
RsparkleR provides an R interface for launching virtual machines and deploying Sparkler
-
Updated
Jan 2, 2018 - R
Projects created using R
-
Updated
Mar 19, 2018 - R
RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
-
Updated
Jul 19, 2018 - R
Machine Learning and Deep Learning Course
-
Updated
Feb 18, 2019 - R
Sparklyr extension package providing geospatial analytics capabilities
-
Updated
Mar 8, 2019 - R
Interpretable Machine Learning with rsparkling
-
Updated
Jun 9, 2019 - R
WNE SparkR Workshop - set of scripts and notebooks
-
Updated
Jun 12, 2019 - R
💨 R Package for analyzing Discharge Abstract Database (CIHI) using Apache Spark
-
Updated
Jul 23, 2019 - R
Created by Matei Zaharia
Released May 26, 2014
- Followers
- 416 followers
- Repository
- apache/spark
- Website
- spark.apache.org
- Wikipedia
- Wikipedia