hadoop-cluster
Here are 45 public repositories matching this topic...
Hadoop cluster on Docker (single host)
-
Updated
Aug 3, 2018 - Shell
-
Updated
Aug 25, 2019 - Shell
hadoop-tool-dockerfile
-
Updated
May 7, 2019 - Shell
This is a self-documentation of learning distributed data storage, parallel processing, and Linux OS using Apache Hadoop, Apache Spark and Raspbian OS. In this project, 3-node cluster will be setup using Raspberry Pi 4, install HDFS and run Spark processing jobs via YARN.
-
Updated
Jul 13, 2024 - Shell
Ansible scripts for setting up Multi-Cluster Hadoop
-
Updated
Aug 19, 2022 - Shell
This project aims to simulate and configure a Distributed File System using Hadoop HDFS. For this project, 3 machines were created: 1 Master Node and 2 Worker Nodes.
-
Updated
Jun 17, 2024 - Shell
A BASH script to setup Apache Hadoop and Apache Hive with Derby database on Debian GNU/Linux
-
Updated
Dec 7, 2022 - Shell
PYTHON - FABRIC AUTOMATION FRAMEWORK -> HADOOP 2.7.3 CLUSTER DEPLOYEMENT+UBUNTU LINUX
-
Updated
Aug 9, 2017 - Shell
Batch data processing on the dockerized Hadoop cluster
-
Updated
Jul 9, 2021 - Shell
Guide to installing a Hadoop and Spark on an Oracle virtual machine.
-
Updated
Mar 20, 2024 - Shell
Hadoop cluster with docker-compose
-
Updated
Sep 2, 2017 - Shell
I built my own mini analysis lab to practice and learn something new. An environment that I can use remotely with any device (with RealVNC) and of course an install script to rebuild it quickly anytime. I hope, it will be useful for others, who want to make their own.
-
Updated
Jan 8, 2022 - Shell
Improve this page
Add a description, image, and links to the hadoop-cluster topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop-cluster topic, visit your repo's landing page and select "manage topics."