- You will need to install Pandas, Seaborn, Matplotlib and Scikit-learn.
- You will need to install the Dataset from Kaggle.
As an aspiring People Data Analyst, I decided to use IBM's HR Analytics Employee Attrition and Performance dataset to show the skills I learned from Udacity's Data Scientist Nonodegree Program. For this capstone, I will apply the data science process CRISP-DM that I have been familiar with.
- HR-Employee-Attrition.csv: This is the Dataset I used in my project .
- Capstone Project - DSND.ipynb: This is my notebook where I did my analysis and build my model.
Results and discussion are published on Medium.
Must give credit to IBM and pavansubhash for the data. You can find the Licensing for the data and other descriptive information at the Kaggle link available here. Otherwise, feel free to use the code here as you would like!