pyspark
Here are 33 public repositories matching this topic...
Learn Apache Spark Python (PySpark) Create your own Cluster and Insert data into the PostgreSQL Database. ✨
-
Updated
Mar 14, 2024 - Dockerfile
Docker Compose environment for big data research and machine learning development
-
Updated
Feb 15, 2024 - Dockerfile
Dockerized Environment for developing Geospatial applications in Python using Apache Spark, Apache Sedona and Delta Lake.
-
Updated
Nov 17, 2023 - Dockerfile
Default Docker image used to run experiments on csquare.run.
-
Updated
Mar 6, 2023 - Dockerfile
A framework to automate and manage Spark jobs on Kubernetes in a Google Cloud Platform environment.
-
Updated
Dec 8, 2022 - Dockerfile
Container-based inner loop development environment for Databricks
-
Updated
Nov 11, 2022 - Dockerfile
Using python3.6 alpine base image adds java,pandas, numpy,pyspark and spark as rundeps. This image can be used as container image when you run spark-submit on k8.
-
Updated
Nov 11, 2022 - Dockerfile
PySpark in Docker Containers
-
Updated
Jun 22, 2022 - Dockerfile
-
Updated
Feb 27, 2022 - Dockerfile
Docker images for spark on kubernetes
-
Updated
Oct 24, 2021 - Dockerfile
Improve this page
Add a description, image, and links to the pyspark topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the pyspark topic, visit your repo's landing page and select "manage topics."