Intro. to Apache Spark and Python Notebooks
This project shows you how to use Python Notebooks and Apache Spark to perform simple analysis on the Back to the Future transcript.
Follow these simple instructions and you'll be up and running:
git clone https://github.com/markwatsonatx/tutorial-spark-notebook-wordcount cd tutorial-spark-notebook-wordcount docker-compose up -d
docker-compose up -d you can access the sample notebook at http://DOCKER-HOST-IP:38889/notebooks/WordCount.ipynb.
You can learn more by watching the YouTube video here.
Check out the Scala version here: https://github.com/markwatsonatx/tutorial-spark-notebook-wordcount-scala