# How to think like a data scientist

1. **Clean the data**. Do not assume data is clean
2. **Normalize**. Let's say you're making a list of popular wedding destinations. You could count the number of people flying in for a wedding, but unless you consider the total number of air travellers coming to that city as well, you'll just get a list of cities with busy airports.
3. **Consider Outliers**. Excluding outliers can be a mistake, as well as just including them without a doubt. Outliers can hold important information (e.g. customers that use your product much mor often than the average → leads to **qualitative insights**). But they are not good for **building a model**.
4. **Consider Seasonality**
5. **Consider Context**. Do not ignore size when reporting growth. E.g. when you are just starting your product, technically, your dad signing up count as doubling your user base."
6. **Avoid data vomit**. A dashboard is not of much use, if you do not know where to look.
7. **Avoid metrics that cry wolf**. If you set alarms with a too low threshold, they get sensitive and "whiny", you'll start to ignore them.
8. **Combine data from other sources**. Do not develop the "not collected here" syndrome and gain insights from mashing up data.
9. **Do not focus on noise**. We as humans see patterns everywhere, because we are "hardwired" like that. Do not develop **vanity metrics**, step back, and look at the bigger picture.

# OMTM One Metric That Matters

# Map of (typical) KPI by Business Goal
## Terminology
* **Measure**: Change we observe
* **Metric**: Measure we track over time
* **KPI**: Important Metrics
* **Analytics**: Measures that computer track (subset of our Metrics, not too seldom not helpful)


## 🟢 Growth & Marketing
| KPI                       | What it Tells You                                | Formula / Measurement                          |
|----------------------------|--------------------------------------------------|-----------------------------------------------|
| Customer Acquisition Cost (CAC) | Efficiency of acquiring new customers.          | Marketing + Sales Spend ÷ New Customers        |
| Conversion Rate            | How well leads/visitors turn into customers.     | Conversions ÷ Visitors (or Leads)              |
| Activation Rate            | How many new users reach first meaningful use.   | Activated Users ÷ New Signups                  |
| Uplift / Incremental Lift  | Impact of campaigns/features (A/B tests).        | ΔConversion (Treatment − Control)              |


## 🔵 Product & Engagement
| KPI                        | What it Tells You                                | Formula / Measurement                          |
|-----------------------------|--------------------------------------------------|-----------------------------------------------|
| DAU / MAU                  | Usage level (daily / monthly activity).          | Unique Active Users per Day/Month              |
| DAU/MAU Ratio (Stickiness) | How often monthly users return daily.            | DAU ÷ MAU                                     |
| Time to Value (TTV)        | How quickly users see product value.             | Time between signup and first value action     |
| Cohort Retention           | Long-term engagement by user cohorts.            | Retention % by months since signup             |


## 🟡 Financial Performance
| KPI                        | What it Tells You                                | Formula / Measurement                          |
|-----------------------------|--------------------------------------------------|-----------------------------------------------|
| Revenue                    | Total income generated.                          | Sum of transactions                           |
| ARPU (Avg. Revenue per User)| Monetization efficiency per user.                | Revenue ÷ Active Users                        |
| MRR / ARR                  | Predictable subscription revenue.                | Monthly / Annual recurring subscription fees   |
| Gross Margin               | Profitability after direct costs.                | (Revenue − COGS) ÷ Revenue                    |
| Contribution Margin        | Unit economics after variable costs.             | (Revenue − Variable Costs) ÷ Revenue          |


## 🔴 Customer Experience & Loyalty
| KPI                        | What it Tells You                                | Formula / Measurement                          |
|-----------------------------|--------------------------------------------------|-----------------------------------------------|
| Churn Rate                 | % of customers leaving.                          | Lost Customers ÷ Starting Customers            |
| Retention Rate             | % of customers staying.                          | 1 − Churn Rate                                |
| Customer Lifetime Value (CLV)| Long-term value of a customer.                   | ARPU × Avg. Lifetime (months/years)           |
| Net Promoter Score (NPS)   | Loyalty & advocacy (likelihood to recommend).    | %Promoters − %Detractors (survey 0–10 scale)  |
| Customer Satisfaction (CSAT)| Satisfaction with a specific experience (e.g., support call, checkout). | Avg. satisfaction rating ÷ Max rating (survey) |
| Customer Effort Score (CES)| How easy it was for the customer to achieve their goal. | Avg. ease-of-use score from survey (1–5 or 1–7 scale) |