hadoop
Here are 75 public repositories matching this topic...
Spyrk-cluster is a data mini-lab, considering the main technologies used these days. It's useful to either understand how to configure a cluster, or just to take it for granted to use for testing with submit or interactive jobs.
-
Updated
Apr 7, 2021 - Dockerfile
Helm chart for Apache Hadoop using multi-arch docker images
-
Updated
Mar 21, 2022 - Dockerfile
Standalone Spark setup with Hadoop and Hive leveraged on docker containers.
-
Updated
Dec 16, 2021 - Dockerfile
Base hadoop/spark/bigdata image with advanced config loading scripts.
-
Updated
Nov 3, 2020 - Dockerfile
Apache Hive Metastore in Standalone Mode With Docker
-
Updated
Jul 22, 2024 - Dockerfile
Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker
-
Updated
Mar 21, 2022 - Dockerfile
Docker Compose environment for big data research and machine learning development
-
Updated
Feb 15, 2024 - Dockerfile
Scalable Mahout Docker image with built-in Hadoop works at Docker and Kubernetes
-
Updated
Nov 4, 2020 - Dockerfile
A simple Big data stack with Docker
-
Updated
Apr 24, 2021 - Dockerfile
A Docker image containing necessary tools for Big Data
-
Updated
Apr 2, 2024 - Dockerfile
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."