A lightweight, single-node Hadoop 3.4.2 cluster. This image includes NameNode and DataNode services.
To build the docker image
chmod +x build-hadoop-docker.sh
./build-hadoop-docker.sh./start-hadoop.shThe Hadoop cluster will be running post script completion
A docker-compose-hadoop.yml file is provided as well.
docker-compose -f docker-compose-hadoop.yml up -d --buildHDFS data is persisted using a Docker volume named hadoop_data. The container
stores data at /hadoop.
- NameNode data:
/hadoop/namenode - DataNode data:
/hadoop/datanode
docker exec -it hadoop-cluster bash