Apache Spark in MareNostrum-IV and Nord3

Scripts to launch Spark executions on MareNostrum-IV and Nord3

Singularity Image

The latest version of the image is dcc-spark02. Edit and add the applications and libraries to be added to your environment as you please.

Build Singularity image

sudo singularity build dcc-spark02.simg dcc-spark02.singularity

Test shell

singularity run shell dcc-spark02.simg

Run image (in local!)

singularity exec dcc-spark02.simg spark-class "org.apache.spark.deploy.master.Master" --host 127.0.0.1
singularity exec dcc-spark02.simg spark-class 'org.apache.spark.deploy.worker.Worker' -d /tmp/spark-worker spark://127.0.0.1:7077
singularity exec dcc-spark02.simg pyspark --master spark://127.0.0.1:7077 < example.py

Kill images

kill $( ps aux | grep spark | grep -v "grep" | awk "{print $2}" )

MareNostrum-IV and Nord3

Environment

This is what you find in the MareNostrum GPFS filesystem:

Your shared personal home is at /gpfs/home/GROUP/USER (don't run things from here → NOT a working directory)
Your shared project home is at /gpfs/project/GROUP/USER (don't run things from here → NOT a working directory)
Your shared scratch is at /gpfs/scratch/GROUP/USER (working directory. It is temporary storage, without backup)
Your shared archive is at /gpfs/archive/GROUP/USER (long term storage)
Local scratch is only seen locally (not shared scratch), and is lost at the end of the execution (working directory)

Be sure to have configured your access to BSC machines with RSA key exchange. Spark will communicate Master and Workers via passwordless SSH!

Process to run regular scripts

Script submission process:

Copy experiment files through dt0{1,2,3}.bsc.es (e.g., use scp)
Connect to the cluster:
- MN-IV: Connect through MN-IV login mn{1,2,3}.bsc.es (using ssh)
- Nord3: Connect through Nord3 login nord{1,2,3}.bsc.es (using ssh)
Create script with totally automated experiment (and execution permissions)
- Move experiments to /gpfs/scratch/...
- Execute stuff
- Retrieve experiments to /gpfs/home/...
Submit the job:
- MN-IV: Execute sbatch script.sh
- Nord3: Execute bsub < script.sh

Run scripts in MareNostrum-IV

MareNostrum has machines with 48 CPU and 96GB RAM. Queues can be checked with squeue (usual queues are debug and BSC).

Using modules avail you can list the available modules to be activated (e.g. java, python, singularity, gcc...).

Example of script header, at debug queue, 20 minutes to timeout, 144 cores (3 machines), and in exclusive, then load intel module and run stuff:

#SBATCH --output=output_%j.out
#SBATCH --error=output_%j.err
#SBATCH --nodes=3
#SBATCH --exclusive
#SBATCH --job-name=spark01_single
#SBATCH --time=00:20:00

modules load intel

Here bash script...

Then submit the script using the sbatch command.

Usage manuals:

https://www.bsc.es/user-support/mn4.php (check commands sbatch, squeue, scancel)

Run scripts in Nord3

Nord3 has machines with 16 CPU and 127GB RAM. Queues can be checked with bqueues (usual queues are debug and bsc_cs).

Using modules avail you can list the available modules to be activated (e.g. java, python, singularity, gcc...).

Example of script header, at debug queue, 20 minutes to timeout, 32 cores (2 machines), and in exclusive, then load intel module and run stuff:

#BSUB -J example_spark01_single
#BSUB -q debug 
#BSUB -W 00:20
#BSUB -oo output_%J.out
#BSUB -eo output_%J.err
#BSUB -n 32 
#BSUB -x

modules load intel

Here bash script...

Then submit the script using the bsub command.

Usage manuals:

https://www.bsc.es/user-support/nord3.php (check commands bsub, bjobs, bkill, bhosts, lshosts)

Example of Running Spark scripts

Compile the Singularity image, and copy it to your Nord3/MN-IV PROJECT.

Get the example script, and modify only the Experiment preparation and Experiment wrap-up.

Copy the image to the scratch directory (shared or local)
Copy the experiment files to scratch (shared or local)
Edit the script file to match with your experiments (in preparation and wrap-up)
- Variable EXPERIMENT contains the script to be submitted to Spark
- Variable SPARK_WORKER_ARGS and WORKERS_NODE contain the worker configuration. By default there is a worker per node, and all node resources available to the worker. If more workers per node are given, SPARK_WORKER_ARGS must be adjusted (e.g. WORKERS_NODE=2 → SPARK_WORKER_ARGS='-c 8 -m 62g')
Submit your job with sbatch script.sh (MN-IV) or bsub < script.sh (Nord3)

The example script is set to copy the Singularity image to scratch only if it doesn't exist. If you modify the image, be sure to remove the image in /gpfs/scratch/...

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
README.md		README.md
bscdc_spark.singularity		bscdc_spark.singularity
dcc-spark01.singularity		dcc-spark01.singularity
dcc-spark02.singularity		dcc-spark02.singularity
example.R		example.R
example.py		example.py
example.scala		example.scala
instructions_david.md		instructions_david.md
mobydick.txt		mobydick.txt
run_script_mn4.sh		run_script_mn4.sh
run_script_n3.sh		run_script_n3.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Apache Spark in MareNostrum-IV and Nord3

Singularity Image

Build Singularity image

Test shell

Run image (in local!)

Kill images

MareNostrum-IV and Nord3

Environment

Process to run regular scripts

Run scripts in MareNostrum-IV

Run scripts in Nord3

Example of Running Spark scripts

About

Releases

Packages

Contributors 2

Languages

HiEST/nord3-spark

Folders and files

Latest commit

History

Repository files navigation

Apache Spark in MareNostrum-IV and Nord3

Singularity Image

Build Singularity image

Test shell

Run image (in local!)

Kill images

MareNostrum-IV and Nord3

Environment

Process to run regular scripts

Run scripts in MareNostrum-IV

Run scripts in Nord3

Example of Running Spark scripts

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages