zeppelin

A debian:jessie based Spark Zeppelin Docker container.

usage

Run the Zeppelin container in Spark local mode, or in a Spark cluster.

local

Pull the image and run the container:

docker pull dylanmei/zeppelin:latest
docker run --name zeppelin -p 8080:8080 -p 8081:8081 dylanmei/zeppelin:latest

Zeppelin will be running at http://${YOUR_DOCKER_HOST}:8080.

specify a data volume

Optionally, use Docker volumes to map the local ./data directory as the /data/zeppelin directory used in the Zeppelin banks.csv tutorial:

docker run --name zeppelin -v `pwd`/data:/zeppelin/data -p 8080:8080 -p 8081:8081 dylanmei/zeppelin:latest

modify the default ports

By default, Zeppelin wants to use ports 8080-8081. So do a lot of other things, including a Spark master UI. Change the ports Zeppelin uses by specifying --environment "ZEPPELIN_PORT="8090" to docker run. For example:

docker run --name zeppelin -e "ZEPPELIN_PORT=8090" -p 8090:8090 -p 8091:8091 dylanmei/zeppelin:latest

cluster

Create a standalone cluster with docker-compose:

docker-compose up

The Spark Master UI will be running at http://${YOUR_DOCKER_HOST}:8080 and Zeppelin will be running at at http://${YOUR_DOCKER_HOST}:8090. Zeppelin and the Spark Worker will mount the ./data directory as a volume and share it's contents.

customize

Forking this project to change Spark/Hadoop versions is unnecessary! Instead, create a Dockerfile based on dylanmei/zeppelin:master and supply a new, executable install.sh file in the same directory. It will override the base one via Docker's ONBUILD instruction.

The steps, expressed here as a script, can be as simple as:

#!/bin/bash
cat > ./Dockerfile <<DOCKERFILE
FROM dylanmei/zeppelin:master

ENV ZEPPELIN_MEM="-Xmx1024m"
DOCKERFILE

cat > ./install.sh <<INSTALL
git pull
mvn clean package -DskipTests \
  -Pspark-1.2 \
  -Dspark.version=1.2.1 \
  -Phadoop-2.2 \
  -Dhadoop.version=2.0.0-cdh4.2.0 \
  -Pyarn
INSTALL

docker build -t my_zeppelin .

license

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
data		data
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
install.sh		install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

.gitignore

.gitignore

Dockerfile

Dockerfile

README.md

README.md

docker-compose.yml

docker-compose.yml

install.sh

install.sh

Repository files navigation

zeppelin

usage

local

specify a data volume

modify the default ports

cluster

customize

license

About

Releases

Packages

Languages

apontador/docker-zeppelin

Folders and files

Latest commit

History

Repository files navigation

zeppelin

usage

local

specify a data volume

modify the default ports

cluster

customize

license

About

Resources

Stars

Watchers

Forks

Languages