No description, website, or topics provided.
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

A simple Apache Hadoop 2.7.3 Docker image

This is a very simple and not optimized Docker image containing Apache Hadoop.

The main purpose of this container is to help who wants to start using Apache Hadoop and does not want to spend too much time for the initial installation and configuration process. As you will notice, the container is not optimized. However, we provide the Dockerfile and strongly encourage whoever wants to improve this Docker image.

Build the image

You can build your own image, using the Dockerfile. Just run the following command:

docker build  -t matnar/hadoop:2.7.3 .

Pull the image

This image is also released as an official Docker image from Docker's automated build repository - you can always pull or refer the image when launching containers.

docker pull matnar/hadoop

Start a container

In order to use the Docker image you have just build or pulled use:

docker run -t -i -p 50070:50070 --name=master matnar/hadoop 

Create an isolated network with several datanodes

docker network create --driver bridge hadoop_network

docker run -t -i -p 50075:50075 -d --network=hadoop_network --name=slave1 matnar/hadoop 
docker run -t -i -p 50076:50075 -d --network=hadoop_network --name=slave2 matnar/hadoop 
docker run -t -i -p 50077:50075 -d --network=hadoop_network --name=slave3 matnar/hadoop 
docker run -t -i -p 50070:50070 --network=hadoop_network --name=master matnar/hadoop