# 00 — Problem Context and Modeling Scope

## Objective
Build a Marketing Mix Model (MMM) **from scratch** in order to fully understand its assumptions, implications, and limitations, and later compare it with established frameworks (e.g. Robyn, Meridian).

The primary goal is not immediate optimization, but **deep understanding of the model and the decisions it introduces**.


## Business Question
This model aims to answer the following question:

**What is the incremental impact of each marketing channel on weekly purchases (conversions) over time?**


## Intended Business Decisions

This MMM is intended to support:

- **Strategic budget allocation across channels**  
  (how much to invest in each channel over a medium-to-long-term horizon)
- **Scenario analysis**  
  (what happens if spend is increased or decreased in specific channels)


## Justification (intended use, not validation)

Regardless of the final model fit quality, MMMs are designed to support **strategic and portfolio-level decisions**, as they operate on aggregated time-series data and estimate average effects over time.

By construction, this type of model **is not suitable for short-term operational decisions**, such as:
- daily bidding,
- creative optimization,
- user- or campaign-level attribution.


## What This Model Is NOT Expected to Do

This model is **not intended to**:

- Predict the KPI accurately on a week-by-week basis.
- Capture short-term shocks or exceptional events.
- Provide causal attribution at the individual level.
- Replace experiments or incremental lift tests.

The value of MMM lies in explaining **structural and average incremental drivers**, not in operational forecasting.


## KPI and Problem Definition

### Target
Use **ALL_PURCHASES** as the main KPI.

- Metric: number of purchases
- Frequency: weekly
- Aggregation: sum


### Initial Channels
**Paid media spend** will be used as the primary marketing input, aggregated weekly.

Spend is the standard MMM input because:
- it is directly linked to budget decisions,
- it represents a controllable business lever,
- it avoids introducing intermediate metrics at early modeling stages.


### Potential Control Variables

Control variables represent factors not directly controlled by marketing that may influence the KPI.

**Included from the start**
- Trend (time index)
- Seasonality (recurring weekly / yearly patterns)

**Potential (not initially included)**
- Promotions or discount periods
- Price changes
- Product launches
- Relevant external events / holidays


### What Is Initially Excluded

- Clicks and impressions (kept for later exploratory analysis)
- Organic and direct traffic as “channels”  
  (they may be introduced later as control variables)


## Initial Modeling Assumptions

### Persistence of Effects (Adstock)
Marketing investments may have effects that persist over time and influence future purchases.

Without adstock, upper-funnel channels tend to be undervalued.


### Diminishing Returns (Saturation)
The marginal impact of spend decreases as investment increases.

Without saturation, the model may recommend unrealistic allocations, such as allocating the entire budget to a single channel.


### Approximately Linear Relationship After Transformations
After applying adstock and saturation, a linear relationship is assumed to approximate the KPI response.

This approach:
- preserves interpretability,
- captures core MMM dynamics,
- avoids unnecessary complexity in early stages.


### Observational, Correlation-Based Model
The MMM estimates **associations**, not true causal effects.

The model supports strategic decision-making, but **does not prove causality** without additional evidence (e.g. experiments or lift tests).


## Data Assumptions

### Parameter Stability Within the Modeling Window
Effects are assumed to be approximately stable over the analyzed period.

Major structural changes (algorithms, pricing, product, channel mix) may cause a single average effect to be misleading.


## Risks and Limitations

### Omitted Variable Bias
Unobserved factors (promotions, pricing, PR, stock-outs) may bias attribution.


### Multicollinearity
Channels may move together, making it difficult to reliably disentangle their individual effects.


### Functional Misspecification Risk
Incorrect choices of:
- adstock parameters,
- saturation functions,

may distort estimated contributions and scenario recommendations.


### Short-Term Shocks
Sudden events may create spikes or drops not explained by the model and affect coefficient estimates.


### Sensitivity to Time Window and Validation
Results may vary depending on:
- the selected time period,
- the temporal validation strategy.


## Model Evaluation Criteria

The MMM will not be evaluated solely on fit metrics, but also on:

- Coefficient stability over time
- Sign and magnitude coherence
- Out-of-sample temporal behavior
- Ability to separate structural baseline from incremental impact
- Interpretability and economic plausibility


## Incremental Modeling Approach

The modeling approach is intentionally incremental to maintain control over assumptions and enable diagnosis:

1. Baseline linear regression (sanity checks)
2. Adstock (carryover effects)
3. Saturation (diminishing returns)
4. Regularization (Ridge) for coefficient stability
5. Time controls (trend and seasonality)
6. Temporal validation and diagnostics
7. Interpretation, limitations, and next steps
8. (Optional) semi-parametric models (e.g. GAM) and/or Bayesian formulation to quantify uncertainty
