Skip to content

belalfarrag95/Sales-Data-Analysis-Python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🐍 Sales Data Analysis — Python & Google Colab

Monthly Sales Trend

A complete end-to-end sales data analysis project using Python — covering data cleaning, exploratory data analysis (EDA), and multi-dimensional visualizations across sales reps, regions, product categories, customer types, and payment methods.


📌 Project Overview

This project analyzes a sales dataset using Python in Google Colab, following a structured workflow from raw data ingestion to actionable business insights — producing 6 charts covering every key business dimension.

Detail Info
Tool Python (Google Colab)
Libraries Pandas · Matplotlib · Seaborn
Dataset Sales.csv — multi-rep, multi-region sales data
Dimensions Month · Region · Sales Rep · Product · Customer Type · Payment Method

📊 Analysis & Visualizations

1. Total Sales Amount Over Months

Monthly Trend Monthly sales trend showing growth from ~$250K in January to a peak of ~$450K in Oct–Nov, then a slight dip in December. Clear seasonal upward trend through Q3–Q4.


2. Total Sales by Region

Sales by Region Regional distribution is nearly equal across all 4 regions — North leads at 27.3%, followed by East (25.1%), West (24.6%), and South (23.0%). No dominant region — balanced market penetration.


3. Total Sales by Sales Representative

Sales by Rep David is the top-performing sales rep with ~$1.1M+ in total sales. Bob follows closely. Charlie is the lowest performer. All reps fall in the $850K–$1.1M range — a fairly competitive team.


4. Total Quantity Sold per Sales Rep

Quantity by Rep David also leads in quantity sold (~6,000 units), followed by Alice and Bob (~5,000). Charlie has the lowest quantity at ~4,200 units — consistent with his lower revenue performance.


5. Total Quantity Sold by Product Category

Quantity by Category Clothing leads product categories with ~7,000 units, followed closely by Furniture (~6,700). Food has the lowest quantity at ~5,500 units. All categories show strong demand.


6. Total Sales by Customer Type

Customer Type Nearly perfect split between Returning (50.1%) and New customers (49.9%) — indicating strong customer retention AND healthy new customer acquisition simultaneously.


7. Total Sales by Payment Method

Payment Method Credit Card leads at $1.76M, followed by Bank Transfer ($1.72M), with Cash slightly lower (~$1.54M). Digital payment methods dominate.


💡 Key Insights

  • 📈 Sales grow steadily from Jan to Nov — strong Q3/Q4 performance with a Dec dip
  • 🏆 David is the top rep in both revenue (~$1.1M) and quantity (~6,000 units)
  • 🌍 All 4 regions are nearly equal — no single dominant market
  • 👕 Clothing is the best-selling category by volume at ~7,000 units
  • 🔄 50/50 split between new and returning customers — excellent retention rate
  • 💳 Credit Card is the preferred payment method, but all 3 methods are well-distributed

🛠️ Tools & Libraries

import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
Library Usage
Pandas Data loading, cleaning, grouping, aggregation
Matplotlib Line chart, Donut charts, layout control
Seaborn Bar charts with viridis/magma color palettes

📁 Files in This Repo

File Description
Sales_Data.ipynb Full Jupyter Notebook with all code and outputs
Sales.csv Raw sales dataset
1.png6.png Output chart screenshots

🚀 How to Run

  1. Open Google Colab
  2. Upload Sales_Data.ipynb and Sales.csv
  3. Run all cells (RuntimeRun all)
  4. All 7 charts will render inline

👤 Author

Belal Farrag — Data Analyst

LinkedIn GitHub

About

End-to-end sales data analysis using Python & Pandas — EDA, visualizations across reps, regions, products & payment methods (Google Colab)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors