Experimentations with spark
/!\ Spark only works on Linux / MacOs.
- Download Spark. I used Spark 2.4.5.
- Unzip Spark on your computer.
- You'll need Java installed. Carefull Spark 2.4.5 runs on Java 8. If you have an upper version of Java installed, you can follow this thread to solve your problem.
- Go to your Spark folder and run
./spark-2.4.5-bin-hadoop2.7/bin/spark-submit --hto make sure Spark is installed. - Install
pyspark:
pip install pyspark- Run scripts and have fun:
./spark-2.4.5-bin-hadoop2.7/bin/spark-submit --master local[4] ./Spark/WordCount/wordcount.py