Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apache Spark

Here are 316 public repositories matching this topic...

lukaselmer / ethz-web-scale-data-mining-project-spark-config

lukaselmer / ethz-web-scale-data-mining-project-runs

doubaokun / dockers

Springworks / usbscale-runner

sanogotech / SparkPipelineSample

felixcheung / vagrant-projects

ghosh17 / Predictive-Analysis

mustardgrain / docker-spark

s8sg / spark-standalone-cluster

mganta / docker-livy

opencourses / big_data

tonycox / spark-ignite-docker

bigdata-labs / spark2-hadoop2.6-hbase-labs

navicore / spark-on-kubernetes

mangalaman93 / dspark

bruler / kub-setup

OElesin / hadoop-cluster-docker

timmyraynor / spark-on-hadoop-docker

ramhiser / spark-kubernetes

maxwit / witdata

Related Topics