Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apache Spark

Here are 237 public repositories matching this topic...

derrickburns / generalized-kmeans-clustering

jaceklaskowski / spark-workshop

databrickslabs / automl-toolkit

lynnlangit / learning-hadoop-and-spark

innat / ML-Resource

feng-li / Distributed-Statistical-Computing

rogaha / data-processing-pipeline

tikal-fuseday / delta-architecture

tdebatty / spark-knn-graphs

lresende / ansible-kubernetes-cluster

jukiewiczm / kaggle-predict-future-sales

vmitchell85 / spark-kiosk-notify

arjones / bigdata-workshop-es

netease-bigdata / ne-spark-courseware

shaojunying / Software-engineering-discipline-online-learning-platform-based-on-knowledge-map

korolmi / dataeng

marcelmittelstaedt / BigData

rvilla87 / TwitterTrends

getyourguide / DDataFlow

manuparra / taller_SparkR

Related Topics