This repository prepared for create hadoop multinode cluster within docker container as automatically.
- firstly you will create a docker image that contains installed hadoop and other environments.
- after that you will create a master container that has master configurations.
- lastly you will create slave containers as the number given.
The easiest way to get this docker image installed is to clone the latest version from the git repository and run bellow commands:
$ git clone https://github.com/lvntyldz/docker-multinode-hadoop.git
$ cd docker-multinode-hadoop
$ ./run.sh <SLAVE_CONTAINER_COUNT>
To create a master container and 2 slave containers which will be related master containers run below commands.
$ git clone https://github.com/lvntyldz/docker-multinode-hadoop.git
$ cd docker-multinode-hadoop
$ ./run.sh 2
Follow below steps to run the word count example that is "hello world" of hadoop community.
$ cd codeSamples/
This script counts of word in a text file that contains A.SANCAR's summary of life.
$ ./src/run/wc/run.sh
Visit these urls from local browser
(http://localhost:50070/dfshealth.html#tab-datanode) || (http://localhost:8088/cluster/nodes)