This project implements a Random Forest Regressor for the early detection of acute respiratory diseases (IRA) in Peru. The model predicts the weekly incidence rate by region, using climate and historical case data, while excluding the pandemic years (2020β2021).
- Source 1: Observatorio de Clima y Salud β MINSA
- Source 2: Population from 2017 Census β INEI
- Time coverage: 2017β2025 (updated weekly)
- Geographic coverage: 26 regions of Peru (including Callao and Lima Metropolitana)
- Variables:
- Climate:
tmean,tmax,tmin,humr,ptot - IRA cases:
Cases - Population by region (
population) - Derived variables: incidence per 100k inhabitants, predicted rates, alert levels
- Climate:
- Algorithm: RandomForestRegressor .
- Weekly incidence predictions (
Predicted_Rate) - π¨ Alert levels:
- π’ Green β Low incidence
- π‘ Yellow β Moderate risk
- π΄ Red β High incidence
- Files ready for Power BI visualization (CSV/Excel)
- Graphs of variable importance and model performance
- Integrate predictions into a live dashboard (Power BI).
- Extend forecasting to future weeks/years.