Skip to content
forked from t3g7/deployer

Deploy Apache Spark & Cassandra in a Docker Swarm cluster

License

Notifications You must be signed in to change notification settings

dbrambilla/deployer

 
 

Repository files navigation

Spark Cassandra cluster deployment scripts

Build Status Docker Hub

Apache Spark in a Docker container, based on java:8. Heavily inspired by gettyimages/docker-spark. Cluster deployed with Docker Compose and orchestrated with Docker Swarm.

Official image for Cassandra is used and installed on the same nodes as Spark slaves.

Run the cluster locally on VirtualBox

In scripts/, run :

./bootstrap.sh

This script will set up a key-value store using Consul, a Swarm cluster with a master and a slave, and an overlay network for multi-host networking.

After creating the nodes

Connect to the Swarm master with:

eval $(docker-machine env --swarm swarm-master)

The cluster configuration should be visible by running:

docker info

Deploy Spark

Deploy the containers on constrained nodes with:

docker-compose --x-networking --x-network-driver=overlay up -d

Deploy Cassandra

In scripts/, run:

./deploy_cassandra.sh

After deployment is complete, you can run cqlsh this way:

docker run -it --rm --net container:cass1 cassandra:2.2.4 cqlsh

A folder /data is mounted as a shared volume outside of the container to save Cassandra data.

References

About

Deploy Apache Spark & Cassandra in a Docker Swarm cluster

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Shell 100.0%