Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apache Spark

Here are 237 public repositories matching this topic...

lynnlangit / learning-hadoop-and-spark

jaceklaskowski / spark-workshop

feng-li / Distributed-Statistical-Computing

arjones / bigdata-workshop-es

derrickburns / generalized-kmeans-clustering

databrickslabs / automl-toolkit

innat / ML-Resource

rogaha / data-processing-pipeline

manuparra / taller_SparkR

tikal-fuseday / delta-architecture

Vinge1718 / spark-blog

tdebatty / spark-knn-graphs

jukiewiczm / kaggle-predict-future-sales

marcelmittelstaedt / BigData

korolmi / dataeng

lresende / ansible-kubernetes-cluster

axsaucedo / hadoop-overview

vmitchell85 / spark-kiosk-notify

netease-bigdata / ne-spark-courseware

ABigdataer / MovieRecommendSystem

Related Topics