## Project Proposal COMP 6934 W25

Your term project will be visualization designs for a selected dataset.

The final project will include

* A minimum of three visualizations. One of the visualizations should demonstrate an original design incorporating animation or interaction in a creative manner.
* A complete analysis in the manner of assignment 2 will be expected, including:
    * introduction to the data sources and data set
    * data analysis incorporating a *what* methodology
    * purposes and goals of the visualization project, driven by client questions and needs, and incorporating a Munzner-style *why* methods
* A section for each visual design used, including code, discussion of the design motivation for each visual design, incorporating idioms, *how* analysis, and sources of inspiration such as existing designs or gallery entries.
* A conclusion discussing the success of the visualizations and suggestions for improvement.

# Proposal Instructions

For the project proposal, you must select a dataset for your final project and do an initial description of how you will address the dataset in your project.

**Selection of the dataset** is on a competitive, first-come, basis, each student should have a different dataset to work with. Be the first to **claim your selection** at the [selection web site](https://www.cs.mun.ca/~brown/6934/signupProject/projselect.php) https://www.cs.mun.ca/~brown/6934/signupProject/projselect.php
* Approval of your selection will be based on the character of the data set
* enough data points to provide interesting design tasks. Anything under 2000 data points would be considered too small. Smaller data sets with complex data items (such as geometries and/or links) may also be acceptable.
* different types of data and/or attributes
* real world interesting subject matter (no fake or generated data sets)

**Work with a client** if you can. I have recruited a few faculty members who have data visualization problems to work on. I will post these as possible dataset choices. You may recruit your own *reliable* client (such as a faculty member) to work with.  I will provide an information sheet for them and some basic questions for you to ask before making final commitment to working with your client. 

**Work without a client** if you want. You must find your own dataset to work with and also act as the client by imagining appropriate goals and objectives. I will also post some suggestions for finding a dataset selection without a client.

Include these items in your proposal submission.
* Introduction to the dataset selection and the client
* A preliminary analysis of the data set apply *what* methods
* A minimum of three questions regarding the dataset, expressing your client's goals and objectives, suitable for a Munzner style *why* analysis.
* Any initial ideas for visualizations, including visuals you do not currently know how to code (you can simply sketch your ideas). Do not limit your ideas to code solutions you are already familiar with.
    * Explain what each visual accomplishes in terms of the project goals
    * Submit at least three visual designs for feedback. They do not have to be coded.

Optionally include any other preliminary ideas for other parts of the final submission. These will not be graded for marks, but you may get some good guidance and feedback.

# **Project Proposal: Black Friday Sale Analysis**  

## **Dataset:**  
[Black Friday Sale Dataset](https://www.kaggle.com/datasets/rajeshrampure/black-friday-sale)  

---

## **Introduction to the Dataset Selection and Client**  

The dataset contains transactional data from a Black Friday sale, including customer demographics, product categories, purchase amounts, and other attributes related to consumer shopping behavior. The dataset provides insights into shopping trends, customer segmentation, and purchasing patterns.  

This project aims to analyze consumer behavior during the Black Friday sale, identifying factors influencing high spending, customer demographics, and popular product categories. The analysis will focus on understanding key sales trends and customer purchasing patterns to help retailers optimize future sales strategies.  

---

## **Preliminary Analysis (What Methods)**  

1. **Data Preprocessing:**  
   - Cleaning and handling missing values, such as those in categorical fields like product category.  
   - Standardizing data formats, especially for purchase amounts and categorical data like customer demographics.  

2. **Exploratory Data Analysis (EDA):**  
   - Understanding **customer segmentation** by analyzing factors like age, gender, city category, and spending habits.  
   - Identifying **top-selling product categories** and purchase distributions.  
   - Examining **spending trends** across different demographics and locations.  

3. **Statistical & Trend Analysis:**  
   - Analyzing **spending trends** across different customer demographics (e.g., age group, city category, occupation).  
   - Identifying **correlations** between customer attributes (age, gender, location) and spending behavior.  

---

## **Client’s Goals and Objectives (Munzner-Style Why Analysis)**  

The client (a retailer or market analyst) wants to understand **how customer demographics influence shopping behavior on Black Friday** and how they can optimize their marketing and sales strategies for future events.  

The key questions driving the project are:  

1. **Which customer demographics (age, gender, city type) contribute the most to total sales?**  
   - This will help businesses target the right audience for marketing campaigns.  

2. **What are the most popular product categories, and how do they vary across different customer segments?**  
   - This will provide insights into which product categories drive revenue and which customer groups prefer them.  

3. **How do spending patterns change based on city type and customer occupation?**  
   - This question will help businesses understand if urban or rural customers spend more and whether occupation plays a role in shopping behavior.  

---

## **Proposed Visualizations (How Methods & Design Motivation)**  

1. **Interactive Heatmap of Customer Spending by Demographics:**  
   - **Description:** A heatmap showing how different demographic groups (age, gender, city category) contribute to total revenue.  
   - **Purpose:** This will help retailers identify key customer segments contributing the most to sales.  
   - **Design Motivation:** Heatmaps provide an intuitive way to visualize high-impact groups and compare spending behavior across different segments.  

2. **Bar Chart with Customer Purchase Distribution by Product Category:**  
   - **Description:** A bar chart displaying the top-selling product categories and how purchases are distributed across different customer segments.  
   - **Purpose:** This will highlight which product categories drive the most revenue and how different customer groups prefer specific product types.  
   - **Design Motivation:** Bar charts are effective in comparing different categories and identifying patterns in product demand.  

3. **Animated Line Graph of Purchase Trends Over Time:**  
   - **Description:** A time-series line graph showing purchase amounts over time, segmented by city category and age group.  
   - **Purpose:** This visualization will help understand how purchase behavior fluctuates during the Black Friday event and across different regions.  
   - **Design Motivation:** Line graphs are ideal for showing trends over time, and animation will help highlight purchasing spikes and trends dynamically.

4. **Sunburst Chart of Purchase Distribution by Product Category**
    - **Description:**  A hierarchical sunburst chart showing how purchases are distributed across product categories, subcategories, and customer demographics.
    - **Purpose:**  To provide a layered view of product demand and how different customer groups contribute to total revenue.

    - **Design Motivation:**  Sunburst charts allow for an easy-to-understand breakdown of hierarchical data, making it clear how different segments contribute to overall sales.

5. **Network Graph of Customer-Product Connections**
     - **Description:**  A network graph representing the relationships between customer segments and product categories, showing how different customer demographics are linked to various product purchases.

     - **Purpose:**  To uncover patterns in customer-product relationships and identify which product categories appeal most to specific demographics.

     - **Design Motivation:**  Network graphs help visualize complex relationships, making it easier to see connections that might not be obvious in traditional charts.


---

## **Conclusion**  

This project will provide an in-depth analysis of consumer purchasing behavior during Black Friday through interactive and animated visualizations. The heatmap will identify key customer segments, the bar chart will showcase popular product categories, and the animated line graph will reveal spending trends over time.  

The final analysis will evaluate how well these visualizations address the client’s objectives and suggest potential improvements, such as predictive modeling for future sales forecasts or refining demographic targeting for marketing campaigns.  

---



# Attributions

Be sure to add your own sources or indicate you have none to add. Sources can be web sites, text materials, and so on. They do not have to be hyperlinks. Other people are also sources.

| Source | What is it | How used |
|--|--|--|
| [National Retail Federation (NRF) – Black Friday Trends](https://nrf.com/insights/holiday-and-seasonal-trends/black-friday) | Official statistics and insights into Black Friday shopping trends, spending habits, and demographics. | Used to understand consumer shopping behavior and spending trends for contextual analysis. |
| [Statista – Black Friday Spending Statistics](https://www.statista.com/topics/4076/black-friday/) | Market research data on Black Friday sales, consumer spending patterns, and category-wise breakdowns. | Provides supporting data for visualizing sales trends and category performance. |
| [McKinsey & Company – Black Friday Shopping Trends](https://www.mckinsey.com/industries/retail/our-insights/black-friday-shopping-trends) | Industry expert insights on how Black Friday sales strategies are evolving. | Helps frame the analysis in a broader industry context to derive actionable insights. |
| [Towards Data Science – Black Friday Sales Data Analysis](https://towardsdatascience.com/black-friday-sales-data-analysis) | Data science blog explaining best practices for analyzing and visualizing sales data. | Used as a reference for data preprocessing, feature selection, and visualization techniques. |
| [Data Viz Project – Choosing the Right Visualization](https://datavizproject.com/) | A resource for selecting the best data visualization techniques. | Helps in deciding which visual representations (heatmaps, bar charts, animated graphs) are best for the dataset. |
| [Harvard Business Review – Data-Driven Decision Making in Retail](https://hbr.org/) | Articles on using data analytics and visualization for business insights in retail. | Supports the argument for using data visualization to drive decision-making in sales and marketing. |
