GitHub - chuxi/fabcluster: fabric deploy hadoop, hbase, zookeeper, kafka and spark

fabcluster is a python program for spark (including hadoop, hbase, kafka, zookeeper) cluster deploying and basic management.

It uses fabric package as the deploy and manage tool. You can use it to setup basic configuration in a cluster, including hosts, disable firewall, create a new user and set all the following deploying and management under the new-user-role's control.

Before you start reading and using my program, I advice you reading some tutorials about python and fabric. ( Before I wrote this program, actually I know nothing about python and fabric. = =| )

So here is two tutorials I think which are helpful for you to understand my program. I ever wrote some samples for my studying and tried some easy commands in python console, so I just suggest you writing something when you are learning.

Liao Xuefeng's tutorial (Chinese~~) : http://www.liaoxuefeng.com/wiki

Fabric Tutorial : http://docs.fabfile.org

Preparation:

Set up your your nodes ip in clusters.

 clusters = ['10.214.208.11', '10.214.208.12', '10.214.208.13', '10.214.208.14']

Set your common user name and password (which user should have the sudo right)
```
 env.user = 'king'       
 env.password = 'your password'
```
the env.keywords coordinates the files in tars, finally the structure should like following pic. You can remove or add your own keywords and implements the installation by yourself. It is not difficult because I have hoed out most bash shell problems in my code.
```
 env.files = os.listdir('./tars')
 env.keywords = [filter(str.isalpha, i.split('.')[0]) for i in env.files]
```

One of the most important thing is to set up your sub-clusters of each role in env.roledefs. You can change some settings here.

 env.roledefs = {
     'clusters': clusters,
     'hadoop_master': clusters[:1],
     'hadoop_smaster': clusters[1:2],
     'hadoop_slaves': clusters,
     'zookeeper': clusters,
     'hbase': clusters,
     'hbase_master': clusters[0:1],
     'hbase_slaves': clusters[1:],
     'kafka': clusters,
     'spark': clusters,
     'spark_master': clusters[:1],
     'spark_slaves': clusters
 }

Set your new user's name, group, password. I believe it is reasonable to use one user for the cluster's every working module.
```
 newgroup = 'hadoop'
 newuser = 'hadoop'
 newpasswd = 'hadoop'
```

Then I record some words about the usage of my program. It is very easy if you understand the fabric principles.

    fab installs

The command upside will run the whole installation. It starts from basic settings, including hosts, new user, ssh no passwords, disable Firewall, set NTP. Then completes JDK installation, Hadoop, Zookeeper, HBase, Kafka, Spark.

    fab installs:hbase

It will install HBase, including all basic installations, and Hadoop, Zookeeper. Because HBase is running on HDFS and Zookeeper.

    fab installs:kafka
    fab installs:spark

This two just install singular kafka and spark.

And I set up some interfaces about starts, stops and cleans work.

    fab starts
    fab starts:hadoop
    fab starts:hbase
    
    fab stops
    
    fab cleans

In fact, I think you will write your own deploy program. May my program help.

Now you must run commands separately, like installs:hadoop, installs:zookeeper, installs:hbase, installs:kafka, installs:spark. And start all parts one by one: starts:hadoop (Be careful reformat and clean function), starts:zookeeper, starts:hbase, starts:kafka, starts:spark

Thank you.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
test		test
.gitignore		.gitignore
fabfile.py		fabfile.py
readme.md		readme.md
view.png		view.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

chuxi/fabcluster

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages