Skip to content

Hadoop cluster setup and related helper files for learning purpose.

Notifications You must be signed in to change notification settings

srinivasanHadoop/hadoop-env

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hadoop cluster for experiments

Easy way to setup the hadoop in cluster mode and playwith the different tools over the hadoop ecosystem.

Currently docker based cluster setup is added.

Start hadoop cluster

Launch 3 node hadoop cluster with two datanode and one namenode. The Resource manager will be running on nodemanager docker container itself. And the node manager will be running on all the datanode containers.

Bellow shell script make use of the docker commands to start and manage the configurations of hadoop cluster.

./bin/start-hadoop.sh

Start spark with Yarn

./bin/start-spark.sh

Cleanup entire cluster.

Once you are done with the testing on the cluster, just destroy it by running this command.

bash ./bin/clean-all.sh

This command will clean all the containers started along with the hadoop cluster and also other hadoop related servicse like spark, drill etc.

About

Hadoop cluster setup and related helper files for learning purpose.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Shell 91.5%
  • Dockerfile 8.5%