Cherry: A Distributed Task-Aware Shuffle Service for Serverless Analytics

About

Code for the paper "Cherry: A Distributed Task-Aware Shuffle Service for Serverless Analytics".

While there has been a lot of effort in recent years in optimising Big Data systems like Apache Spark and Hadoop, the all-to-all transfer of data between a MapReduce computation step, i.e., the shuffle data mechanism between cluster nodes remains always a serious bottleneck. In this work, we present Cherry, an open-source distributed task-aware Caching sHuffle sErvice for seRveRless analYtics. Our thorough experiments on a cloud testbed using realistic and synthetic workloads showcase that Cherry can achieve an almost 23% to 39% reduction in completion of the reduce stage with small shuffle block sizes, a 10% reduction in execution time on real workloads, while it can efficiently handle Spark execution failures with a constant task time re-computation overhead compared to existing approaches.

The current implementation has been made with Python 3.7, Apache Spark 3.0.1, Kubernetes 1.20.1, Docker 20.10.1, Java 8, Scala 3.1.0 and Ansible 2.10. Also, Calico CNI has been used as a Network Plugin for Kubernetes, and Prometheus Operator 0.48.1 for monitoring the Kubernetes Cluster.

Getting Started

The following instructions will help you run this project on a Kubernetes cluster. The already implemented Ansible playbooks will help speed up this procedure.

Prerequisites

As mentioned above, you will need to install the correct versions of Python, Apache Spark, Kubernetes (kubeadm, kubelet, kubectl), Docker, Java, Scala and Ansible to all the hosts in the available cluster. Firstly, install Ansible to all nodes as follows:

sudo apt install software-properties-common
sudo apt-add-repository --yes --update ppa:ansible/ansible
sudo apt install ansible

In order to install the rest of the software required with Ansible, first configure the info of the /ansible/inventory/hosts file based on your cluster. Then execute the following:

cd ./ansible
ansible-playbook -i inventory prerequisites.yml
ansible-playbook -i inventory create_kubernetes_cluster.yml
ansible-playbook -i inventory additional_configuration.yml
ansible-playbook -i inventory playbooks/start_kubernetes_services.yml

The above commands will configure the Kubernetes cluster. More specifically:

Available Ansible Playbooks

There have also been implemented other Ansible Playbooks in the /ansible/playbooks folder that do the following:

Build and Push Docker Images to Docker Hub
Pull Docker Images from Docker Hub for each node
Remove old Docker Images to free disk space
Clear RAM
Copy data to Worker nodes for Spark Workloads
Destroy the Kubernetes Cluster
Create an HDFS Cluster and Generate TPC-DS Data

Configuration

In order to configure Spark, there are /conf/spark-env.sh and /conf/spark-defaults.conf files available for this role. In order to overwrite these configurations there is a specific Bash script (the spark-driver.sh file) to implement this procedure.

Execution Options

To simplify the deployment of a Spark Cluster and a Spark workload as a Kubernetes Job with different parameters, the aforementioned script is used, and the flags it accepts are the following:

## -s: service
# values: cherry, external, none
## -w: workload
# values: synthetic, skew, tpcds, file.py, file.jar
## -c: class
# values: java.class if workload==jar
# -d: dataset
# values: .cvs, null, etc
# -p: parallelism
# values: number
# -l: look-ahead caching enabled for Cherry
# values: boolean (default:false)
# -r: look-ahead caching port
# values: Port number (default: 7788)
# -g: gigabytes # size of data created in GBs between 2 stages
# values: Int (e.g. 1, 10, 100)
# -z: distributed # uses distributed Cherry service
# values: Int (e.g. 1, 10, 100)
# -k: skewness if workload==skew
# values: [0, 1]

Available Workloads

The available workloads are the following:

synthetic.workload.py
skewed_synthetic_workload.py
amazon_customer_reviews_workload.py
TPCDS Benchmark (after using the Ansible playbook that creates an HDFS cluster and generates TPC-DS Benchmark Data)

More on how to deploy a Spark Workload with examples in a Kubernetes cluster in the next section.

Deployment

To deploy a Spark Cluster with differently configured workloads, you need to deploy the implemented Spark Metadata Service, the Spark Master, Workers, Cherry shuffle services and finally the Driver job. Example command:

kubectl delete deploy spark-metadata-service spark-worker spark-cherry-shuffle-service -n spark \
&& kubectl delete job spark-driver -n spark \
&& sleep 1m \
&& kubectl create -f ./kubernetes/spark-metadata-service/spark-metadata-service-deployment.yaml --namespace=spark \
&& sleep 1m \
&& kubectl create -f ./kubernetes/spark-cherry-shuffle-service/spark-cherry-shuffle-service-deployment.yaml --namespace=spark \
&& kubectl create -f ./kubernetes/spark-worker/spark-worker-deployment.yaml --namespace=spark \
&& kubectl scale deployments/spark-worker --replicas=10 --namespace=spark \
&& kubectl scale deployments/spark-cherry-shuffle-service --replicas=10 --namespace=spark \
&& sleep 1m \
&& kubectl create -f ./kubernetes/spark-driver/spark-driver-job.yaml --namespace=spark

Example commands with flags for the spark-driver.sh script to execute different workloads (need to be modified in the /kubernetes/spark-driver/spark-driver-job.yaml file):

# executes the TPC-DS Q2 and Q5 queries with 100 mappers in first stage and Vanilla Spark is used
["/spark/spark-driver.sh", "-s", "none", "-w", "tpcds", "-p", "100", "-q", "q2,q5"] 
# executes the synthetic workload with 1500 mappers, creates 5GB of shuffle data, uses distributed Cherry shuffle service and the look-ahead caching policy 
["/spark/spark-driver.sh", "-s", "cherry", "-w", "synthetic", "-p", "1500", "-g", "5", "-l", "true", "-r", "7788", "-z", "true"]
# executes the skewed synthetic workload with distributed Cherry and caching, and skewness=0.8
command: [ "/spark/spark-driver.sh", "-s", "cherry", "-w", "skew", "-p", "2000", "-g", "20", "-l", "true", "-r", "7788", "-z", "true", "-k", "0.8"]

Built With

License

This project is licensed under the GNU License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
ansible		ansible
conf		conf
kubernetes		kubernetes
metrics-monitor		metrics-monitor
spark-code		spark-code
tpc-ds-gen		tpc-ds-gen
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.sh		app.sh
common.sh		common.sh
skewed_synthetic_workload.py		skewed_synthetic_workload.py
spark-cherry-shuffle-service.sh		spark-cherry-shuffle-service.sh
spark-driver.sh		spark-driver.sh
spark-examples_2.12-3.1.0-SNAPSHOT.jar		spark-examples_2.12-3.1.0-SNAPSHOT.jar
spark-master.sh		spark-master.sh
spark-metadata-service.sh		spark-metadata-service.sh
spark-worker.sh		spark-worker.sh
synthetic_workload.py		synthetic_workload.py

License

nikoshet/spark-cherry-shuffle-service

Folders and files

Latest commit

History

Repository files navigation

Cherry: A Distributed Task-Aware Shuffle Service for Serverless Analytics

Table of Contents

About

Getting Started

Prerequisites

Available Ansible Playbooks

Configuration

Execution Options

Available Workloads

Deployment

Built With

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages