Skip to content

alokjani/bigdata-vagrant-devlab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Big Data Development Lab

Introduction

If you are looking to setup Hadoop for local development, you are in the right place :)

This repository will help you create AIO Hadoop VM on you local machine with entire Hadoop Ecosystem.

  • Hadoop 2.7.2
  • Hive 1.2.1
  • Pig 0.17.0
  • Flume 1.7.0
  • Sqoop 1.4.6
  • Zeppelin 0.8.0

Installation

The setup has been tested with Vagrant & Cent7

vagrant box add centos/7
vagrant plugin install vagrant-vbguest
vagrant up
vagrant ssh

Hadoop

To setup hadoop (HDFS & YARN), run the following as root

/vagrant/scripts/setup_hadoop.sh

To start/stop hadoop

/vagrant/scripts/start-hadoop.sh
/vagrant/scripts/stop-hadoop.sh

To access the HDFS CLI

/usr/local/hadoop/bin/hdfs dfs -ls /

Hive

To setup Hive, run the following as root

/vagrant/scripts/setup_hive.sh

To start hive

/vagrant/scripts/start-hive.sh

To access the hive CLI

/usr/local/hive/bin/hive

Pig

To setup Pig, run the following as root

/vagrant/scripts/setup_pig.sh ⏎

To access the Pig CLI

/usr/local/pig/bin/pig ⏎

Flume

To setup Flume, run the following as root

/vagrant/scripts/setup_flume.sh

Sqoop

To setup Sqoop, run the following as root

/vagrant/scripts/setup_sqoop.sh   
/vagrant/scripts/setup_mysql.sh    

Zeppelin

To setup Zeppelin, run the following as root

/vagrant/scripts/setup_zepplin.sh

To start Zeppelin,

/root/zeppelin/bin/zeppelin-daemon.sh start

Zeppelin interpreters can be added into /root/zeppelin/interpreter directory. To add MySQL interpreter, go to UI > create interpreter (like "mysql") and select the “jdbc” interpreter group. From here you will be able to configure your MySQL JDBC connection.

Here are needed settings:

default.driver = com.mysql.jdbc.Driver default.user = (username used to login) default.pw = (password used to login) default.url = jdbc:mysql://localhost:3306/ (host and port used to connect to MySQL)

In the “Dependencies” section, you must specify the artifact of the MySQL Connector JAR that we previously downloaded. We downloaded version 5.1.40 into /root/zeppelin/interpreter/mysql,so the artifact is mysql:mysql-connector-java:5.1.40.

Web Interfaces

TODO

  • Solr & Lucene
  • Mahout
  • Storm
  • HBase
  • Spark
  • Drill
  • Tez
  • Oozie

References

This lab based on the following excellent works:

Releases

No releases published

Packages

No packages published

Languages