# Load Forecasting â€” Baseline & First Machine Learning Model (h+1)

This notebook focuses on building the **first predictive models** for short-term electricity load forecasting.

Objective:
- Predict the electricity load at **t+1 hour**
- Use historical load, weather, and calendar features
- Establish strong baselines before moving to more complex models

Models covered:
- Naive baseline
- Ridge Regression (linear ML model)

All splits strictly respect **temporal order**.

## 1. Imports & Setup

In [None]:
from pathlib import Path
import pandas as pd

from sklearn.linear_model import Ridge
from sklearn.metrics import mean_absolute_error, mean_squared_error

## 2. Project Paths and Parameters

In [None]:
PROJECT_ROOT = Path.cwd().parents[0]
PROCESSED_BASE_PATH = PROJECT_ROOT / "data" / "processed"

In [None]:
# Parameters
country = "FR"
years = [2023, 2024]

## 3. Load feature-engineered dataset

In [None]:
# Load feature-engineered data
dfs = []
for year in years:
    path = (
        PROCESSED_BASE_PATH
        / f"country={country}"
        / f"year={year}"
        / "load_forecasting_features.parquet"
    )
    dfs.append(pd.read_parquet(path))

df = pd.concat(dfs).sort_index()

In [None]:
df.info()