There should be no necessary libraries to run the code here beyond the Anaconda distribution of Python. The code should run with no issues using Python versions 3.*.
If Anaconda distribution is not installed, then to be able to run these notebooks, the following libraries needs to be installed on the top of python:
- Pandas
- Seaborn
- Matplotlib
For this project, I was interestested in using Stack Overflow developer survey 2020 data to better understand the developer's behaviour. I focussed mainly on the following key questions:
- What are the major Job Factors the developers evaluate while looking for a job?
- What are programming languages the developers mostly worked/working on now and are interested or desired to work later?
- Will developers have more Job Satisfaction by having a Master's degree over a Bachelor's degree in some field or by having a CS Major over a Non-CS Major?
In this repository, there is Developer-Behaviour.ipynb file available to showcase work related to the above questions. I could not put the dataset file in the git repository as it's size is large. But, I provided the dataset link at the bottom of the page here to download. The dataset csv file needs to be in the same folder where the .ipynb files are there.
The main findings of the code can be found at the medium blog post available here! https://medium.com/walmartlabs/developers-behaviour-from-datascience-perspective-1b25376b2839.
The major credit goes to Stack Overflow for providing the data. You can find the licensing, data used and its decription here to download. Also, you can access these at Stack Overflow website.