Skip to content

fdiamant/workearly_final_assignment

Repository files navigation

Final Assignment

System

  1. MacOS 12.3.1
  2. MySQL Workbench 8.0.31 Community
  3. PyCharm 2023.1 (Community Edition)
  4. Python 3.11 Interpreter
  5. Tableau Public 2023.1.0

Step 1, sql

Step 2, python

Missing values handling

After calculating the percentage of missing values per column, it becomes evident that a significant portion of the missing value data is observed in the store_location column (12.16%) and the category_name column (8.11%). However, as the significant columns here are not category_name and store_location, it is not necessary to drop the indexes with missing values.

Scatter plot creation

After aggregating the data in the CSV file, I assigned the zip_code column as the values for the x-axis and the sum of bottles sold column as the values for the y-axis. Subsequently, I generated a colormap for each unique item_description based on the gnuplot2 colorspace of the matplotlib library.

For details, see the liquor_sale.py

Step 3, Tableau

The CSV file generated in the previous step was utilized to create visualizations in Tableau.

Dashboard on Tableau Public
Dashboard screenshot

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages