# Data Validation

The analysis was performed on the provided product_sales.csv dataset, containing aproximately 15,000 sales records.

## Missing Revenue Data:
1,074 records had missing 'revenue' values, these records were removed from the dataset to ensure accuracy of financial calculations.

## Sales Method Standardization:
The 'sales_method' column was cleaned to ensure consistency. Specifically, 'Em + Call' was standardized to 'Email + call' to consolidate all instances of this combined approach. The final sales methods analyze are: **'Email', 'Call', and 'Email + call'.**

![Count of sales methods before cleaning](count_sm_before.png)
![Count of sales methods after cleaning](count_sm_after.png)

## Data types:
All relevant columns ('revenue', 'week', 'nb_sold', 'years_as_customer', 'nb_site_visits') were confirmed to be in appropiate numerical formats.

# Exploratory Data Analysis

## 1. Distribution of Revenue:
The histogram shows that revenue is right-skewed, with a concentration of sales at lower revenue values. There is a long tail extending towards higher revenue, indicating some high-value sales, but they are less frequent. This suggests that most transactions generate modest revenue, with fewer but significant high-value transactions.

![Distribution of Revenue](dist_revenue.png)

## 2. Count of Sales Methods:
The bar chart reveals the distribution of sales methods after standardization. 'Email' is the most frequently used sales method, followed by 'Call'. 'Email + call' is also a significant method, demonstrating its current operational scale.

![Distribution of Sales Method](dist_sm.png)

## 3. Average Revenue by Sales Method:
This bar plot directly compares the average revenue generated by sales method. 'Email + call' consistently shows the highest average revenue, significantly outperforming 'Email' and 'Call' methods individually. This plot visually identifies which methods are yielding higher average sales, with the **'Email + call'** method appearing most effective per transaction.

![Average Revenue by Sales Method](avg_revenue_by_sm.png)

## 4. How many customers were there for each approach?

|Sales Method | Number of Unique Customers |
|---|---|
|Call | 4781 |
|Email | 6922 |
|Email + call | 2223|

There *'Email'* method had the most unique customers, followed by *'Call'* and then *'Email + call'*.

## 5. Overall Revenue Spread
The overall revenue from sales records ranges from USD32.54 to USD238.32, with an average of USD93.93. The median revenue is USD89.50. This indicates that while most sales are for modest amounts, there are high-value transactions that significantly impact the overall revenue.

![Overall Revenue Spread](overall_rev_spread.png)

### Revenue Spread for each Sales Method

|Sales Method | Count | Mean Revenue | Std. Dev. | Min Revenue | 25% Quartile | Median Revenue | 75% Quartile | Max Revenue |
|---|---|---|---|---|---|---|---|---|
|Call | 4781 | $47.60 | $8.61 | $32.54 | $41.47 | $49.07 | $52.68 | $71.36|
|Email |6922 | $97.13 | $11.21 | $78.83 | $87.88 | $95.58 | $105.17 | $148.97|
|Email + call | 2223 | $183.65 | $29.08 | $122.11 | $155.78 | $184.74 | $191.11 | $238.32|

- The 'Email + call' method clearly generates the highest average revenue per sale, at 183.65, significantly higher than both 'Email (97.13)' and 'Call (47.60)'
- The 'Call' method shows a much tighter spread and lower revenue values, suggesting it might be used for lower value transactions or less engaged customers.
- 'Email' is a mid-range performer in terms of average revenue.

![Revenue spread for sales method](rev_spread_by_sm.png)

## 6. Revenue Trends Over Time
An analysis of average revenue by sales methods across weeks reveals consistent patterns:

- The 'Email + call' method consistently maintains the highest average revenue across all weeks where it is present.
- The relative performance of 'Email' and 'Call' methods also remain generally consistent over time, with 'Email' outperforming 'Call' in average revenue per transaction.
- No significant anomalies or sudden shifts in revenue performance were observed for any method across the provided weeks, indicating stable operational patterns for each approach.

![Average Revenue Over Time](avg_revenue_ot.png)

## 7. Customer Characteristics by Sales Method
To understand why certain methods perform well, we examined the average 'years as customer' and 'number of site visits' for customers engaged by each method:

|Sales Method | Avg. Years as Customer | Avg. Site Visits |
|---|---|---|
|Call | 5.16 | 24.42 |
|Email | 5 | 24.73 |
|Email + call | 4.53 | 26.74 |

- 'Email + call' customers: While they have slighlty fewer years as customers on average compared to 'Call' customers, they exhibit the highest average number of site visits. This suggest that the combined 'Email + call' approach might be particularly effective with customers who are more actively engaged with the website or business. This higher engagement could be a factor contributing to the higher revenue per sale.
- 'Email' customers: This group is aligned with the overall average for both years as customer and site visits, representing a broad customer base that responds well to scalable email campaigns.
- 'Call' customers: These customers have the longest average tenure but the lowers average site visits. This could imply that direct calls are necessary for customers who are less digitally active or require more personal interaction, potentially for lower-value or more complex needs that don't translate to higher online engagement.


# Recommended Metric for Monitoring

### Metric: Average Revenue Per Sale for the 'Email + call' Sales Method.

This metric measures the average amount of money generated from each transaction that utilized the 'Email + call' method.

It directly quantifies the effectiveness and profitability of our most promising sales approach. Monitoring this metric will allow the team to track the financial return on their efforts for this specific strategy.

**Initial Value (Based on current data):** $183.65

### How to monitor it
1. Trend Analysis: Track this metric weekly/monthly to observe growth, stability, or decline.
2. Comparative Analysis: Regularly compare its performance against the other sales methods to benchmark its efectiveness.
3. Target Setting: Establish clear revenue targets for this method. (e.g. aiming for $190 per sale by next quarter)
4. Segmentation: Analyze the metric by customer segment (e.g new vs existing, by state) to identify specific areas of strenght.
5. Feedback: Use changes in this metric to inform adjustments to sales scripts, training, or resource allocation for the 'Email + call' team.

# Recommendations

The Analytics team strongly recommends that the sales team continue to use and expand the 'Email + call' sales method. Despite the potential for it to be more time-consuming, the significantly higher average revenue per sale ($183.65) demonstrates its superior efectiveness in generating value per transaction. Investing and optimizing this approach, including potentially providing additional training and resources, will likely yield the highest return on sales effort.

The 'Email' method remains valuable for its wide reach and moderate revenue generation, serving as an efficient tool for a broad customer base. 'Call' method, while necessary for some customers, should be reviewed for its efficiency and whether it is being applied to the most appropiate customer segments, given its lower average revenue.

