### **Dataset Feature Dictionary**

This reference table maps the variable names used in the code to their meteorological definitions and significance for Cloudburst Prediction.

#### **1. Core Weather Variables (Raw Data)**
*Measured daily from satellite/station data.*

| Feature Name | Full Name | Definition & Significance |
| :--- | :--- | :--- |
| **`PRECTOTCORR`** | **Precipitation Total Corrected** | **Rainfall (mm/day).** The primary driver. A sudden, extreme spike in this value defines the cloudburst event itself. |
| **`T2M`** | **Temperature at 2 Meters** | **Air Temperature (°C).** Measured at standard human height. Warmer air has a higher capacity to hold water vapor, creating potential for massive storms. |
| **`T2MDEW`** | **Dew Point Temperature** | **Saturation Point (°C).** The temperature at which air must be cooled to become saturated with water vapor. <br> **Key Insight:** If `T2MDEW` is close to `T2M`, the air is holding maximum moisture, a critical precursor to cloudbursts. |
| **`RH2M`** | **Relative Humidity** | **Humidity (%).** The percentage of water vapor in the air relative to the maximum it can hold. Cloudbursts typically require near-saturation levels ($>85-90\%$). |
| **`WS10M`** | **Wind Speed at 10 Meters** | **Wind Velocity (m/s).** Measures how fast air is moving. In mountainous regions like Himachal, wind is responsible for **Orographic Lift**—rapidly pushing moisture up slopes to form storm clouds. |
| **`PS`** | **Surface Pressure** | **Atmospheric Pressure (kPa).** Low-pressure systems act as a vacuum, drawing in moisture and instability from surrounding areas. Sudden pressure drops often signal approaching storms. |
| **`elevation`** | **Elevation** | **Altitude (Meters).** The height of the location above sea level. Topography plays a massive role in where cloudbursts trigger. |

---

#### **2. Temporal Features**
*Captures seasonality and long-term trends.*

| Feature Name | Definition | Significance |
| :--- | :--- | :--- |
| **`DOY`** | **Day of Year (1-365)** | Captures seasonality. Cloudbursts are heavily concentrated in the Monsoon months (roughly DOY 180–240). |
| **`YEAR`** | **Calendar Year** | Allows the model to detect long-term climate trends (e.g., increased frequency of events in recent years). |

---

#### **3. Engineered Features (Lag & Rolling)**
*Gives the model "Memory" of past weather conditions.*

| Feature Pattern | Type | Explanation |
| :--- | :--- | :--- |
| **`_lag_1`** | **Lag Feature** | Value from **1 Day Ago**. (e.g., `PRECTOTCORR_lag_1` is yesterday's rain).<br>*Why:* Wet soil from yesterday's rain makes today's cloudburst more likely to cause flash floods. |
| **`_lag_2` / `_3`** | **Lag Feature** | Value from **2 or 3 Days Ago**. Captures the immediate history leading up to the event. |
| **`_roll_avg_3`** | **Rolling Mean** | **3-Day Average.** Smoothes out noise to show the trend. <br>*Why:* A single hot day is normal, but a 3-day rising average in Temperature (`T2M_roll_avg_3`) indicates a heatwave building up instability. |

---

#### **4. Target Variable**
*The label we are training the model to predict.*

* **`is_cloudburst`**: Binary Classification Label.
    * `0`: **Normal Day**
    * `1`: **Cloudburst Event** (>100mm rain in a short duration)