# SARIMAX Sick Leave Predictor (C Manufacturing)

This model card is part of a series of model cards created for different sectors, including the G Trade and Q Healthcare sectors. Each card highlights the specific parameters, performance, and considerations for applying the SARIMAX model to a unique sector.

## Model Card Version 1.0

---

## 🧪 Model Description
The SARIMAX Sick Leave Predictor is a time series forecasting model developed to predict quarterly sick leave percentages for the **C Manufacturing sector** in the Netherlands. The model uses historical sick leave data from the CBS dataset combined with exogenous variables to forecast sick leave trends up to Q3 2024.

The model accounts for **COVID-19 outliers** and makes specific adjustments for **Q1** to handle post-pandemic recovery trends.

---

## 🌆 Intended Use
The model is intended to:
- Help the **UWV** (Employee Insurance Agency) better predict future sick leave percentages in the manufacturing sector.
- Assist manufacturing companies in planning staffing levels and managing workloads effectively.

---

## 🐲 Model Architecture
- **Model Type**: SARIMAX (Seasonal AutoRegressive Integrated Moving Average with eXogenous factors)
- **Order Parameters**: `(0, 1, 1)`
- **Seasonal Order Parameters**: `(0, 1, 2, 4)`

The model applies **rolling forecasts** for each quarter and uses a **recent 5-year rolling window** to improve predictions for recent years.

---

## 📊 Evaluation
The model was evaluated using the **Mean Absolute Error (MAE)** metric for each quarter.

| **Year**   | **Quarter** | **MAE**  |
|------------|-------------|----------|
| 2022       | Q1          | 0.1347   |
| 2022       | Q2          | 0.0556   |
| 2022       | Q3          | 0.0604   |
| 2022       | Q4          | 0.1171   |
| 2023       | Q1          | 0.1555   |
| 2023       | Q2          | 0.4180   |
| 2023       | Q3          | 0.1086   |
| 2023       | Q4          | 0.0338   |
| 2024       | Q1          | 0.6434   |
| 2024       | Q2          | 0.0084   |
| 2024       | Q3          | 0.1245   |

---

## 🔧 Hyperparameters
- **Order**: `(1, 1, 1)`
- **Seasonal Order**: `(1, 1, 2, 4)`
- **Seasonal Order Q2**: `(0, 1, 1, 4)`

The model applies specific adjustments to **Q1 2022** to handle **COVID-19-related anomalies**.

---

## 🧡 Limitations
- The model's performance may decrease for future quarters (beyond Q3 2024) as it relies heavily on historical data trends.
- The model assumes **seasonality** based on historical data, which might not hold in rapidly changing environments.

---

## 🚧 Future Improvements
- Incorporate more **exogenous variables** (e.g., economic indicators, production levels) to improve prediction accuracy.
- Extend the forecasting period beyond Q3 2024 with additional validation.
- Develop **sector-specific models** to account for unique trends within different manufacturing sub-sectors.

---

## 🛎️ Ethical Considerations
The model outputs should be interpreted with caution. Predictions could affect staffing decisions, which may have real-world implications for workers' well-being and production schedules. It’s essential to consider other non-quantitative factors in decision-making.

---

## 📢 Contact Information
For questions or feedback regarding this model, please contact:

**Name**: [Caroline Hakker]  
**Email**: [c.hakker@vistacollege.nl]  
**Organization**: [Projectteam EAISI UWV ]

