There should be no necessary libraries to run the code here beyond the Anaconda distribution of Python. The code should run with no issues using Python versions 3.*.
This project is part of the requirements for the Data Scientist Nanodegree program at Udacity. For this project, I was interested in using GlassDoor data for DS/ML jobs to better understand:
- What are the top industries hiring data professionals
- What are the main tools used by data professionals
- Which data role has the highest salary
There is a Jupyter Notebook available here to showcase work related to the above questions. Markdown cells were used to assist in walking through the thought process for individual steps.
The main findings of the code can be found at the post available here.
Data obtained from Kaggle.