# Tabular Playground Series - January 2022

<h2> The Storytelling Behind Data - Using Flourish </h2>

<h3> What would be in this notebook? </h3>

This notebook won't be about the best approach to build the forecasting model to get a score as minimum as possible. I'll try to look through this dataset, do some EDA (Exploratory Data Analysis) and try to build a story using beautiful yet impactful open-source visualization tool - Flourish. Flourish makes it easy to visualize data using easy approaches without need of code. Interactive data dashboarding to analytics storytelling, that's what I wish to explore in the Playground Jan 22' Series.  

This data doesn't simulates a real-world environment, but data storytelling is a fun thing to do. Let's now navigate to the charts and explore the dataset in much more depth. Hope you have a happy reading :)

<h3> How the dataset looks like? </h3>

Problem Statement - There are two (fictitious) independent store chains selling Kaggle merchandise that want to become the official outlet for all things Kaggle. We've decided to see if the Kaggle community could help us figure out which of the store chains would have the best sales going forward.   

We import the data using python libraries and check them

In [None]:
#Importing Pandas Library to open the dataset
import pandas as pd

#Loading the dataset and viewing first few rows
playground_data = pd.read_csv('../input/tabular-playground-series-jan-2022/train.csv')
playground_data.head()

<h3> Understanding the dataset </h3>

* The dataset has 6 columns that store the details of sales of Kaggle Products by Stores in multiple countries.
* 3 Countries - Finland, Norway and Sweden is present. There are 2 Stores, Kaggle Mart and Kaggle Rama
* The stores sell 3 types of goodies - Kaggle Mug, Kaggle Hat and Kaggle Stickers
* Sales data is recorded from January 01, 2015 to December 31, 2018 for all stores across 3 countries.  

Now that we know what the dataset is, let's study the following questions using interactive Flourish charts:

# Question 1: Which Goodies were the most in Demand? 

Not all products have equal demand. We have 3 goodies. Let's check which were in most demand across all stores & countries over 4 Years

<iframe src='https://flo.uri.sh/visualisation/8288533/embed' title='Interactive or visual content' class='flourish-embed-iframe' frameborder='0' scrolling='no' style='width:100%;height:600px;' sandbox='allow-same-origin allow-forms allow-scripts allow-downloads allow-popups allow-popups-to-escape-sandbox allow-top-navigation-by-user-activation'></iframe>

Seems like people are buying Kaggle Hats the most. Let's drill down to this even more -

# Question 2: Exploring the Love for Kaggle Hats

<iframe src='https://flo.uri.sh/visualisation/8288943/embed' title='Interactive or visual content' class='flourish-embed-iframe' frameborder='0' scrolling='no' style='width:100%;height:700px;' sandbox='allow-same-origin allow-forms allow-scripts allow-downloads allow-popups allow-popups-to-escape-sandbox allow-top-navigation-by-user-activation'></iframe>


<iframe src='https://flo.uri.sh/visualisation/8288779/embed' title='Interactive or visual content' class='flourish-embed-iframe' frameborder='0' scrolling='no' style='width:100%;height:490px;' sandbox='allow-same-origin allow-forms allow-scripts allow-downloads allow-popups allow-popups-to-escape-sandbox allow-top-navigation-by-user-activation'></iframe>

We have the following Observations:

* Kaggle Hats were the product that was most in demand (54% of Total Sales)
* Kaggle Stickers were the least in demand. It accounted only 16% of total sales.

Kaggle Hat's were sold 3.5x more than Kaggle Stickers. Yeah that's some serious demand of data science fashion :)

# Question 3: What was the number of Goodies each Store sold across countries over 4 years?

<iframe src='https://flo.uri.sh/visualisation/8289379/embed' title='Interactive or visual content' class='flourish-embed-iframe' frameborder='0' scrolling='no' style='width:100%;height:600px;' sandbox='allow-same-origin allow-forms allow-scripts allow-downloads allow-popups allow-popups-to-escape-sandbox allow-top-navigation-by-user-activation'>

<h3> Observations made from above analyses </h3>

The following are the observations made from the above study:

* For all of the 3 goodies, Store Kaggle Rama sold the most number of goodies.
* Kaggle Rama accounted for 63.54% of the total sales of Kaggle Goodies overall.

<iframe src='https://flo.uri.sh/visualisation/8289534/embed' title='Interactive or visual content' class='flourish-embed-iframe' frameborder='0' scrolling='no' style='width:100%;height:600px;' sandbox='allow-same-origin allow-forms allow-scripts allow-downloads allow-popups allow-popups-to-escape-sandbox allow-top-navigation-by-user-activation'></iframe>

<iframe src='https://flo.uri.sh/visualisation/8289939/embed' title='Interactive or visual content' class='flourish-embed-iframe' frameborder='0' scrolling='no' style='width:100%;height:600px;' sandbox='allow-same-origin allow-forms allow-scripts allow-downloads allow-popups allow-popups-to-escape-sandbox allow-top-navigation-by-user-activation'></iframe>

Also, for all of the Kaggle Goodies, we can see Kaggle Rama taking approximately 63% of the total sales across countries for 4 year span.

# Question 4: What was the Amount of Sales made by each Country (Collectively for 4 Years) ?

<iframe src='https://flo.uri.sh/visualisation/8290052/embed' title='Interactive or visual content' class='flourish-embed-iframe' frameborder='0' scrolling='no' style='width:100%;height:600px;' sandbox='allow-same-origin allow-forms allow-scripts allow-downloads allow-popups allow-popups-to-escape-sandbox allow-top-navigation-by-user-activation'></iframe>

From the analysis below, we observe that for all of the Goodies, Norway reported the highest number of sales. Sales were highest in Norway, followed by Sweden and Finland. Also we can observe, in the descending order of sales:

* Hats were amongst in 1st, 2nd and 3rd position
* Mugs were in 4th, 5th and 6th position
* Stickers occupied the last 3 positions.

# Question 5: Is this Sequence of Sales present on the Store-level as well?

We can plot the above data for both of the stores individually to check if the above trend follows or not. Both of these graphs are plotted within an interactive dashboard. (Use the store name to filter the results)

<iframe src='https://flo.uri.sh/visualisation/8290513/embed' title='Interactive or visual content' class='flourish-embed-iframe' frameborder='0' scrolling='no' style='width:100%;height:600px;' sandbox='allow-same-origin allow-forms allow-scripts allow-downloads allow-popups allow-popups-to-escape-sandbox allow-top-navigation-by-user-activation'></iframe>

The above trend of Hats being in (1st, 2nd, 3rd), Mugs in (4th, 5th and 6th) and Stickers being in last three positions are proven for both the stores that sell Kaggle Goodies.

<H3> The Ways Next - With this Study </H3>

Over the next set of few days, till the duration of this event, I'll try to add more interactive and interesting visualizations over Tabular Playground - Jan 2022 series. For the next thing, I would be including the date-wise sales comparisons, to understand the sales behaviour better. Also, very much thanks to [Marília Prata](https://www.kaggle.com/mpwolke) for letting me know about the event. 

Would love to hear your opinions on this notebook. Feel free to connect with me on [Linkedin](https://www.linkedin.com/in/amankumar01/). It was happy writing this notebook. Cheers!!