<a href="https://colab.research.google.com/github/HussainPythonista/Movie-Product-Recommendation-System/blob/main/Data_Cleaning/Walmart_Analysis.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Walmart Dataset Overview:
  - A Walmart dataset typically refers to a collection of data related to the operations, sales, and transactions of Walmart stores. Such a dataset may include a variety of information, such as:

## Sales Data:

- Transaction details, including timestamps, products purchased, and transaction amounts.
Information on sales trends, seasonality, and popular products.

## Store Information:

- Details about individual Walmart store locations.
Store sizes, geographical locations, and other store-specific attributes.

## Product Information:

- Characteristics of products available in Walmart stores.
Product categories, prices, and inventory data.

## Customer Data:

- Information about Walmart customers.
Customer demographics, purchase history, loyalty programs, etc.

## Promotion and Discount Data:

- Details about promotions, discounts, and marketing campaigns.
Effectiveness of promotions on sales.

## Time Series Information:

- Time-related data to analyze trends over different time periods (daily, monthly, yearly).
Use Cases for Analyzing Walmart Datasets:

## Sales Forecasting:

- Predict future sales based on historical data and external factors.

### Inventory Management:
- Optimize inventory levels to prevent stockouts or overstock situations.

### Customer Segmentation:

- Understand customer behavior and preferences to tailor marketing strategies.

### Store Performance Analysis:
- Evaluate the performance of individual stores and identify areas for improvement.

### Promotion Effectiveness:
- Assess the impact of promotions and discounts on sales.

### Market Basket Analysis:
- Identify patterns of products frequently purchased together to optimize product placement.

### Seasonal Analysis:

- Understand how sales patterns change during different seasons or events.

### Price Optimization:
- Optimize product pricing strategies for increased competitiveness and profitability.


#Steps to Explore a Walmart Dataset:
## Data Loading:
- Load the dataset into a data analysis tool (like Pandas in Python) to start exploring.

## Data Exploration:

- Examine the structure of the dataset, check for missing values, and explore basic statistics.
## Visualizations:
- Create visualizations (e.g., histograms, line charts) to understand distributions and trends.
## Time Series Analysis:
- If the dataset includes a time component, conduct time series analysis to identify patterns.
##Correlation Analysis:
- Investigate relationships between different variables (e.g., sales and promotions).
## Feature Engineering:
- Create new features that may enhance the analysis (e.g., extracting day of the week).

## Descriptive Statistics:
- Calculate descriptive statistics for key variables to gain insights.
Hypothesis Testing:
- Formulate hypotheses and conduct statistical tests to validate assumptions.
Modeling.



In [None]:
from google.colab import drive
drive.mount('/content/drive')

Mounted at /content/drive


In [None]:
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

In [None]:
walmart_dataset=pd.read_csv("/content/drive/MyDrive/Movie-Product-Recommendation /Datasets/Walmart/sdf/marketing_sample_for_walmart_com-product_details__20200101_20200331__30k_data.csv")
walmart_dataset

Unnamed: 0,Uniq Id,Crawl Timestamp,Product Url,Product Name,Description,List Price,Sale Price,Brand,Item Number,Gtin,Package Size,Category,Postal Code,Available
0,51b010b871cde349bd32159a1cc1a15f,2020-01-24 16:08:36 +0000,https://www.walmart.com/ip/Allegiance-Economy-...,Allegiance Economy Dual-scale Digital Thermometer,We aim to show you accurate product informati...,11.11,11.11,Cardinal Health,,707389636164,,Health | Medicine Cabinet | Thermometers | Dig...,,True
1,d6a7f100e44a626a3701804e99236ad6,2020-01-24 15:54:21 +0000,https://www.walmart.com/ip/Kenneth-Cole-Reacti...,Kenneth Cole Reaction Eau De Parfum Spray For ...,We aim to show you accurate product informati...,23.99,23.99,Kenneth Cole,,191565696101,,Premium Beauty | Premium Fragrance | Premium P...,,True
2,99d2b7da7e3e427a942f864937dacd9d,2020-01-24 18:34:28 +0000,https://www.walmart.com/ip/Kid-Tough-Fitness-I...,Kid Tough Fitness Inflatable Free-Standing Pun...,We aim to show you accurate product informati...,30.76,30.76,BONK FIT,563852139.0,855523007070,,Sports & Outdoors | Outdoor Sports | Hunting |...,,True
3,4c76d170c2c6a759cbce812d790a0b88,2020-01-24 11:08:53 +0000,https://www.walmart.com/ip/THE-FIRST-YEARS/167...,THE FIRST YEARS,We aim to show you accurate product informati...,6.99,6.99,The First Years,553299941.0,71463046263,,Baby | Diapering | Baby Wipes,,True
4,8ac95837dc8baa01e504fd8f633ffaf2,2020-03-10 07:37:21 +0000,https://www.walmart.com/ip/4-Pack-MD-USA-Seaml...,4 Pack - MD USA Seamless Toe-Wave-In Mesh Diab...,We aim to show you accurate product informatio...,28.27,28.27,MD USA,,191897514500,,Health | Diabetes Care | Diabetic Socks,,True
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
29995,08800cf3c19fa9d4c6f33e7b0fba0020,2020-01-24 19:35:05 +0000,https://www.walmart.com/ip/Inside-Groin-Cup-wi...,Inside Groin Cup with Supporter,We aim to show you accurate product informati...,6.95,6.95,Ace Martial Arts Supply,,615953380446,,Sports & Outdoors | Sports | Baseball Gear & E...,,True
29996,89a60fe65396e6371684ba68c98f07b1,2020-03-10 02:16:29 +0000,https://www.walmart.com/ip/Tone-Brothers-Drome...,"Tone Brothers Dromedary Pimientos, 2 oz",We aim to show you accurate product informatio...,14.82,14.82,Dromedary,,23709004018,,"Food | Meal Solutions, Grains & Pasta | Canned...",,True
29997,9ccabcb4452746a24f3638a78f17d920,2020-03-10 05:29:15 +0000,https://www.walmart.com/ip/Brittanies-Thyme-24...,Brittanies Thyme 242091 16 oz Organic Olive Oi...,We aim to show you accurate product informatio...,16.15,16.15,BRITTANIES THYME,,894077002483,,Personal Care | Bath & Body | Hand Soaps,,True
29998,6b5145dc769431b33fd2faa3f98e1760,2020-01-24 21:07:53 +0000,https://www.walmart.com/ip/30-Foot-Round-Bould...,30-Foot Round Boulder Swirl Unibead Above Grou...,We aim to show you accurate product informati...,414.99,414.99,SmartLine,,723815752161,,Toys | Outdoor Play | Swimming Pools & Spas | ...,,True


**Dropping some unwanted column for project**

In [None]:
walmart_dataset=walmart_dataset.drop(columns=["Uniq Id","Crawl Timestamp","Product Url"])

In [None]:
walmart_dataset

Unnamed: 0,Product Name,Description,List Price,Sale Price,Brand,Item Number,Gtin,Package Size,Category,Postal Code,Available
0,Allegiance Economy Dual-scale Digital Thermometer,We aim to show you accurate product informati...,11.11,11.11,Cardinal Health,,707389636164,,Health | Medicine Cabinet | Thermometers | Dig...,,True
1,Kenneth Cole Reaction Eau De Parfum Spray For ...,We aim to show you accurate product informati...,23.99,23.99,Kenneth Cole,,191565696101,,Premium Beauty | Premium Fragrance | Premium P...,,True
2,Kid Tough Fitness Inflatable Free-Standing Pun...,We aim to show you accurate product informati...,30.76,30.76,BONK FIT,563852139.0,855523007070,,Sports & Outdoors | Outdoor Sports | Hunting |...,,True
3,THE FIRST YEARS,We aim to show you accurate product informati...,6.99,6.99,The First Years,553299941.0,71463046263,,Baby | Diapering | Baby Wipes,,True
4,4 Pack - MD USA Seamless Toe-Wave-In Mesh Diab...,We aim to show you accurate product informatio...,28.27,28.27,MD USA,,191897514500,,Health | Diabetes Care | Diabetic Socks,,True
...,...,...,...,...,...,...,...,...,...,...,...
29995,Inside Groin Cup with Supporter,We aim to show you accurate product informati...,6.95,6.95,Ace Martial Arts Supply,,615953380446,,Sports & Outdoors | Sports | Baseball Gear & E...,,True
29996,"Tone Brothers Dromedary Pimientos, 2 oz",We aim to show you accurate product informatio...,14.82,14.82,Dromedary,,23709004018,,"Food | Meal Solutions, Grains & Pasta | Canned...",,True
29997,Brittanies Thyme 242091 16 oz Organic Olive Oi...,We aim to show you accurate product informatio...,16.15,16.15,BRITTANIES THYME,,894077002483,,Personal Care | Bath & Body | Hand Soaps,,True
29998,30-Foot Round Boulder Swirl Unibead Above Grou...,We aim to show you accurate product informati...,414.99,414.99,SmartLine,,723815752161,,Toys | Outdoor Play | Swimming Pools & Spas | ...,,True


In [None]:
walmart_dataset=walmart_dataset.drop(columns=["Postal Code","Available","Item Number","Gtin","Package Size","Description"])

In [None]:
walmart_dataset

Unnamed: 0,Product Name,List Price,Sale Price,Brand,Category
0,Allegiance Economy Dual-scale Digital Thermometer,11.11,11.11,Cardinal Health,Health | Medicine Cabinet | Thermometers | Dig...
1,Kenneth Cole Reaction Eau De Parfum Spray For ...,23.99,23.99,Kenneth Cole,Premium Beauty | Premium Fragrance | Premium P...
2,Kid Tough Fitness Inflatable Free-Standing Pun...,30.76,30.76,BONK FIT,Sports & Outdoors | Outdoor Sports | Hunting |...
3,THE FIRST YEARS,6.99,6.99,The First Years,Baby | Diapering | Baby Wipes
4,4 Pack - MD USA Seamless Toe-Wave-In Mesh Diab...,28.27,28.27,MD USA,Health | Diabetes Care | Diabetic Socks
...,...,...,...,...,...
29995,Inside Groin Cup with Supporter,6.95,6.95,Ace Martial Arts Supply,Sports & Outdoors | Sports | Baseball Gear & E...
29996,"Tone Brothers Dromedary Pimientos, 2 oz",14.82,14.82,Dromedary,"Food | Meal Solutions, Grains & Pasta | Canned..."
29997,Brittanies Thyme 242091 16 oz Organic Olive Oi...,16.15,16.15,BRITTANIES THYME,Personal Care | Bath & Body | Hand Soaps
29998,30-Foot Round Boulder Swirl Unibead Above Grou...,414.99,414.99,SmartLine,Toys | Outdoor Play | Swimming Pools & Spas | ...


In [None]:
# walmart_dataset.to_csv("/content/drive/MyDrive/Movie-Product-Recommendation /Datasets/Cleaned Data/walmart.csv")

In [None]:
walmart_dataset["Category"]

0        Health | Medicine Cabinet | Thermometers | Dig...
1        Premium Beauty | Premium Fragrance | Premium P...
2        Sports & Outdoors | Outdoor Sports | Hunting |...
3                            Baby | Diapering | Baby Wipes
4                  Health | Diabetes Care | Diabetic Socks
                               ...                        
29995    Sports & Outdoors | Sports | Baseball Gear & E...
29996    Food | Meal Solutions, Grains & Pasta | Canned...
29997             Personal Care | Bath & Body | Hand Soaps
29998    Toys | Outdoor Play | Swimming Pools & Spas | ...
29999    Sports & Outdoors | Outdoor Sports | The Realt...
Name: Category, Length: 30000, dtype: object