GOAL: Identify the factors affecting healthcare costs and make recommendations.
- Employed R programming and R Studio to acquire the dataset and conduct data cleaning, resulting in a dataset ready for exploration.
- Utilized exploratory data analysis techniques to understand data; resulting in the identification of 4 key drivers of healthcare costs.
- Identified key drivers; applied decision trees, support vector machines and naive bayes to predict future expensive and non expensive customers with an accuracy of 84%, 85% and 80% respectively.
- Documented insights in a report and formulated a presentation containing the data outcomes. Suggested 3 techniques to reduce healthcare costs.