How to use HDFS/Spark Workbench

This Hadoop-hive-spark workbench is based on

How to use HDFS/Spark Workbench

To start an HDFS/Spark Workbench:

    ./start-hadoop-spark-workbench-with-Hive.sh

This will start the following services namenode datanode1 datanode2 hive-metastore-postgresql -this is the postgresql database hive-server hive-metastore hue - shut this down if not required spark-master spark-worker 1 to 4

Starting workbench for debugging

Start in new terminal for each line.

docker-compose -f docker-compose-hive.yml up  namenode hive-metastore-postgresql
docker-compose -f docker-compose-hive.yml up  datanode hive-metastore
docker-compose -f docker-compose-hive.yml up  hive-server
docker-compose -f docker-compose-hive.yml up  spark-master spark-worker1 spark-worker2 spark-worker3 spark-worker4  hue

Interfaces

Namenode: http://localhost:50070
Datanode: http://localhost:50075
Spark-master: http://localhost:8080
Hue (HDFS Filebrowser): http://localhost:8088/home

Hive test

Load data into Hive:
  $ docker-compose exec hive-server bash
  # /opt/hive/bin/beeline -u jdbc:hive2://localhost:10000
  > CREATE TABLE pokes (foo INT, bar STRING);
  > LOAD DATA LOCAL INPATH '/opt/hive/examples/files/kv1.txt' OVERWRITE INTO TABLE pokes;
  > select * from pokes;
  > describe extended pokes

Spark test

make example


To remove created data run:

make clean-example

Maintainer

priyanchandrapala at yahoo.co.uk

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
example		example
hive		hive
notebook/2EP5A34XR		notebook/2EP5A34XR
zeppelin		zeppelin
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
hadoop-hive.env		hadoop-hive.env
start-hadoop-spark-workbench-with-Hive.sh		start-hadoop-spark-workbench-with-Hive.sh
wat_file_list.txt		wat_file_list.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to use HDFS/Spark Workbench

Starting workbench for debugging

Interfaces

Hive test

Spark test

Maintainer

About

Releases

Packages

Languages

priyanlc/docker-hadoop-spark-hive-workbench

Folders and files

Latest commit

History

Repository files navigation

How to use HDFS/Spark Workbench

Starting workbench for debugging

Interfaces

Hive test

Spark test

Maintainer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages