Apache Spark
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Here are 122 public repositories matching this topic...
spark cluster images optimized for Kubernetes deployed in AWS and Jupyter Dockerstacks images
-
Updated
Sep 12, 2018 - Dockerfile
Personal Dockerfile for Machine Learning and BigData processing
-
Updated
Oct 29, 2018 - Dockerfile
Docker Compose setup for PySpark
-
Updated
Dec 1, 2018 - Dockerfile
Spark Dockerfile setup with native support for Nomad as a scheduler
-
Updated
Jan 25, 2019 - Dockerfile
Lightweight Docker image for Apache Spark based on Alpine Linux.
-
Updated
Feb 1, 2019 - Dockerfile
This Repo is in connection to the IMPRO-Spark-Docker repo developed keep that in mind
-
Updated
Feb 13, 2019 - Dockerfile
Customized PySpark Docker image with R support
-
Updated
Mar 8, 2019 - Dockerfile
docker spark standalone
-
Updated
Jul 8, 2019 - Dockerfile
Kubernetes friendly Spark images with dotnet for working with GoogleCloudPlatform/spark-on-k8s-operator
-
Updated
Sep 8, 2019 - Dockerfile
Created by Matei Zaharia
Released May 26, 2014
- Followers
- 416 followers
- Repository
- apache/spark
- Website
- spark.apache.org
- Wikipedia
- Wikipedia