# 8. Post-Launch Plan & Iteration Roadmap

This notebook outlines post-launch monitoring, iteration strategy, and continuous improvement plans for the deployed system.


## 1 Purpose

Once deployed, the goal of this system is not to be “perfect,” but to remain useful, trusted, and operationally safe as production conditions change.

Vehicle configurations evolve, software versions change, and production lines are never static. The post-launch plan focuses on keeping predictions reliable enough to plan with, while ensuring the system fails in predictable and manageable ways.

## 2 What we are monitoring

Model Health
	•	Track MAE and RMSE on recent production data
	•	Monitor errors separately for:
	•	Common configurations
	•	Rare and long-tail configurations (e.g. >130s, >200s)
	•	Watch for shifts in configuration patterns that weren’t present at training time

This ensures we catch degradation early, especially where mistakes are most costly.

Operational Impact
We measure success using operational outcomes, not just model metrics:
	•	Test bench idle time
	•	Frequency of last-minute rescheduling
	•	Stability of daily throughput
	•	Reduced idle runtime as a proxy for energy and CO₂ efficiency

If predictions are improving but planners still need to firefight, the system is not doing its job.

## 3 Iteration and retraining 

Planned Updates
	•	Retrain:
	•	Monthly, or
	•	After ~1,000 new vehicles have passed through the test bench
	•	Incorporate new configuration flags or software variants as they appear

This keeps the model aligned with reality without introducing unnecessary churn.

Retraining Triggers
We retrain earlier if:
	•	Prediction error increases by more than ~10%
	•	Errors concentrate in rare or high-duration configurations
	•	Input data distributions shift meaningfully

This avoids waiting for visible operational damage before acting.

## 4 Safety Nets and Rollback

Rollback Strategy
	•	Always keep the last stable model available
	•	Automatically fall back if:
	•	Error spikes
	•	Prediction latency degrades
	•	Monitoring detects abnormal behaviour

Operational Safeguards
	•	Predictions are presented as estimates, not promises
	•	Low-confidence cases are clearly flagged
	•	Human planners retain override control for critical sequencing decisions

The system supports decision-making — it does not replace it.

## 5 Lifecycle Workflow

End-to-End Flow
Deployment → Monitoring → Iteration → Retraining → Review

This creates a tight feedback loop between:
	•	What the model predicts
	•	What actually happens on the test bench
	•	How planning decisions are affected

The system continuously learns from real production outcomes.

## 6 Why this Matters

Even small improvements in prediction stability lead to:
	•	Better test bench utilization
	•	Fewer reactive changes
	•	Lower energy waste
	•	Measurable CO₂ efficiency gains at scale

The model is a lever for operational efficiency, not an academic exercise.
Post-launch success is defined by planner trust, stability, and sustained impact, not just accuracy scores.
