Course materials for PRACE Introduction to Spark for Data Scientists.
Clone or download
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
Spark_Applications Spark Appl Jan 9, 2019
lab_exercises Volcanic_Ash_Use_Case.pdf Jan 11, 2019
presentations Merge branch 'master' of https://github.com/EPCCed/prace-spark-for-da… Jan 11, 2019
walkthrough_examples metadata Jan 8, 2019
Get_Started_Notebooks_Cirrus.pdf small update to get started guide Jan 10, 2019
Get_Started_local.md Create Get_Started_local.md Jan 9, 2019
LICENSE Initial commit Jan 4, 2019
README.md

README.md

Introduction to Spark for Data Scientists

Course materials for the PRACE course "Introduction to Spark for Data Scientists".

  • Refer to Get_Started_Notebooks_Cirrus.pdf to set up your environment on Cirrus and get started by running a Jupyter notebook.
  • Refer to Get_Started_local.md to set up a Apache Spark on your laptop.
  • Spark_Applications contains a simple example of a standalone Spark application and shows how to submit it on Cirrus.
  • lab_exercises are the exercises that you are going to complete in the practical sessions.
  • walkthrough_examples contains a set of Jupyter notebooks with PySpark examples that you can walk through in your own time.
  • The lectures are included in presentations for reference.

© The University of Edinburgh 2019