Automatization of "Building your Own Big Data Infrastructure for Data Science" article written by Ashton Sidhu
The build process uses Vagrant to create a virtual machine with Ubuntu 18.10 and install the following software:
- Hadoop
- Yarn
- Hive
- Spark
- MySQL
- Jupyter Lab
- Docker
vagrant up
vagrant ssh
start-all-services.sh