This project performs Exploratory Data Analysis (EDA) on the Superstore Sales Dataset to identify sales trends, profit patterns, and business insights.
The goal is to understand which factors drive revenue and profit โ such as region, product category, and discounts โ using Python data analysis tools.
- Source: Kaggle โ Superstore Dataset
- Attributes include:
- Order Date, Ship Date, Sales, Quantity, Discount, Profit
- Customer, Region, Segment, Category, Sub-Category
- Python
- Pandas, NumPy โ data cleaning & transformation
- Matplotlib, Seaborn โ data visualization
- Jupyter Notebook โ analysis & presentation
- Perform data cleaning and handle missing values
- Conduct descriptive analysis on sales and profit distribution
- Identify top-performing product categories and regions
- Analyze impact of discounts on profits
- Visualize sales and profit trends over time
- The West region contributed the highest overall profit.
- Technology category had the largest sales share.
- High discounts in Furniture led to lower profit margins.
- Monthly trends showed consistent spikes during year-end sales.
- ๐ฆ Bar chart of top 10 profitable sub-categories
- ๐ก๏ธ Heatmap of correlation between sales, profit, and discounts
- ๐ Line plot of sales over time
- ๐ฅง Pie chart of sales share by region
- Clone the repository
- Install dependencies
- Open the notebook
- Data cleaning & transformation
- Exploratory data analysis
- Business intelligence insights
- Visualization & storytelling
- Build interactive dashboard using Plotly or Streamlit
- Automate regional sales reports
- Predict profit margins using regression models
โญ If you found this project helpful, donโt forget to star the repo!