For ease of viewing, please use the nbviewer version linked here
Exploratory data analysis of the Supermarket Sales database from Kaggle using R and Jupyter Notebooks.
My code includes: data formatting, working with date objects, data visualization, heatmaps, scatterplots, frequency plots and regression analysis. I mainly consider three questions:
- What drives the purchased quantity?
- What drives the total purchased amount?
- What drives the rating that customers give based on their experience?
Every step of the code is commented and the rationale for the way I do certain procedures is described when possible.