apache-spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apache-spark

Here are 393 public repositories matching this topic...

furkancets / PrescreiberPipelineSpark

realvineeths / Solar-Power-generation-pipeline

bsachin207 / CloudComputing

MonirZaman / spark_projects

emunozlorenzo / Spark

sujeongcha / Scalable-book-recommender-system

gurayorhan / Data-Mining-With-Apache-Spark

essraahmed / Data-Lake-with-Spark

qu8n / BentoML

OmarElmenofy / IBM-Data-Engineering-Capstone-Project

SadafAsad / Realtime-Data-Streaming

amyth-singh / pinterest-data-pipeline

kiriti-badam / SparkCourse

pratik-agarwal / Connected-Components-in-Apache-Spark

Muritiku / spark-images

chetkhatri / getting-started-airflow-for-spark

soyelherein / pyspark-tdd-template

nbompetsis / spark-graphframes-aggregate-messages-coloring-graph

Thanaraklee / DataFlow-with-GCP

sawadogosalif / Big-Data-Technologies

Related Topics