This project analyzes a fictional 6-month retail sales dataset (Customers, Orders, Products) to understand:
- Data Quality & Cleaning Steps
- Customer Behavior Toward Discounts
- Best-Performing Discounted Products
- Actionable Recommendations to Boost Sales
The study combines SQL queries, data cleaning, and profitability analysis to provide data-driven insights into the effectiveness of discounts across markets and products.
- Dataset: Dataset
- SQL → Data cleaning, Querying & segmentation
- Excel → Initial exploration
- Power BI → Dashboards
- Critical IDs (CustomerID, OrderNumber, ProductCode) must not be missing.
- Purpose: Prevents data loss in joins and tracking.
- Fix NetSales = UnitSales × DiscountedPrice
- Fix UnitSales (back-calculated from NetSales ÷ DiscountedPrice)
- Fix OriginalPrice using consistent ProductCode reference
- Fix GrossProfit using per-unit calculations
- Purpose: Restores missing financial values using business logic.
- Flagged negative/zero values in sales, prices, and profit.
- Deleted rows with all-zero financials.
- Purpose: Removes corrupted or meaningless records.
- Ensured DiscountedPrice ≤ OriginalPrice.
- Purpose: Correct mis-recorded discounts.
- Identified and removed duplicate orders.
- Purpose: Prevents inflated sales totals.
- EU: most discount-sensitive (51.8%).
- UK: nearly identical responsiveness (51.7%) but with the largest absolute discount sales (£155,859).
- US: least responsive (34.5%), despite a large customer base.

- Over 50% of customers in EU, UK, and Asia buy only discounted products.
- ROW & US: more price-insensitive shoppers (36–40% only-discount buyers).

- Discounts account for ~45–52% of sales in most regions.
- US lags, with just 29% of sales discount-driven.
- UK’s discount sales (£155.86K) are ~269x larger than ROW (£580).
- Gross Profit Margin (GPM): GrossProfit / NetSales → shows post-discount profitability.
- Discount Impact Ratio (DIR): GrossProfit / DiscountAmount → profit generated per unit of discount given.
- Calculated discount amounts and profitability metrics.
- Filtered to actual discounted products.
- Aggregated performance by product.
- Ranked products by:
- Gross Profit Margin
- Discount Impact Ratio
- Total Gross Profit
- GPM: 26.06%
- DIR: 3.42
- Insight: Premium positioning, discounts effectively boost sales.
- GPM: 25.5%
- Balanced discounts (9.27%) keep profitability high.
- GPM: 3.86% despite 10.12% discount.
- Insight: Discounts not translating to profit → poor ROI.

- Loyalty tiers with extra discounts.
- Flash sales + bundles (UK’s £155K discount sales support bundling).
- Test deeper short-term discounts.
- Try non-discount incentives (e.g., free shipping).
- Reduce discount reliance.
- Focus on experiential promotions (e.g., tastings, bundles without discount messaging).
- Feature Dalmore 12yo in campaigns.
- Test deeper discounts for Caol Ila 12yo.
- Bundle Loch Lomond 18yo + Dalmore 12yo as a premium set.
- Reduce Lagavulin 16yo discount depth (from 10.1% → 5–7%).
- Adjust Glenfiddich 18yo discounts (19.6% → 12–15%).
- Reposition Don Julio 1942 with premium branding and smaller discounts.
- Pilot launches in UK (high discount-driven sales).
- Extend EU strategies to ROW.
- Create US-specific bundles without heavy discounting.

- Covers only 6 months → misses seasonal effects (e.g., holidays, Black Friday).
- Limited visibility on customer lifecycle (long-term loyalty unknown).
- No external context (competitor pricing, macro trends).
- 12+ months of sales (to capture seasonality).
- Customer tenure & lifetime value.
- Competitor discounting patterns.
- Discounts strongly influence EU & UK markets, less so in the US.
- Dalmore 12yo and Caol Ila 12yo demonstrate discount-driven profitability.
- Poor ROI on heavily discounted products (Lagavulin, Glenfiddich) → need pricing revision.
- Balanced, market-specific discounting outperforms blanket strategies.



