# 📘 Aircraft AOG Demo: Workflow & Instructions

This document provides a structured overview of the notebooks involved in simulating and analyzing aircraft sensor and maintenance data using Delta Live Tables and Auto Loader in Databricks.

---

## 📂 Updated Notebook Workflow

| Notebook                               | Purpose |
|----------------------------------------|---------|
| `01_Table_Creation.ipynb`              | 🏗️ Creates all Delta tables in Unity Catalog (`raw_sensor_data`, `maintenance_events`, etc.) |
| `02_Synthetic_Data_Generation.ipynb`   | 🧪 Generates synthetic raw sensor and maintenance event data into volume paths using timestamped filenames |
| `03A_Autoloader_Raw_Data.ipynb`        | 📥 Ingests raw sensor data using Auto Loader from `/tmp/raw/` |
| `03B_Autoloader_Maintenance_Events.ipynb` | 🛠 Ingests aircraft maintenance logs using Auto Loader from `/tmp/maintenance/` |
| `04_DLT_Cleaning_Pipeline.ipynb`       | 🧼 DLT pipeline to clean and validate raw sensor data |
| `05_DLT_Prediction_Pipeline.ipynb`     | 🤖 DLT pipeline to generate AOG risk predictions |

---

## 🔁 When to Run Each Notebook

### ▶️ `01_Table_Creation.ipynb`
- Run **once** to create all tables in `arao.aerodemo`.

### ▶️ `02_Synthetic_Data_Generation.ipynb`
- Run to generate new CSV files for both raw and maintenance data.
- Files are saved with a unique timestamp to ensure Auto Loader processes them.

### ▶️ `03A_Autoloader_Raw_Data.ipynb`
- Run after generating new raw sensor data.
- Auto Loader appends new data into `raw_sensor_data`.

### ▶️ `03B_Autoloader_Maintenance_Events.ipynb`
- Run after generating new maintenance event files.
- Auto Loader appends new data into `maintenance_events`.

### ▶️ `04_` and `05_` (Optional)
- Use if you want to run DLT pipelines for data cleansing and prediction.

---

## 📊 Dataset Descriptions

### 📘 `raw_sensor_data`
This dataset simulates daily aircraft telemetry captured from onboard sensors.

| Column           | Type     | Description                                  |
|------------------|----------|----------------------------------------------|
| `timestamp`       | TIMESTAMP | Time of the recorded sensor reading          |
| `aircraft_id`     | STRING    | Unique identifier for an aircraft            |
| `model`           | STRING    | Aircraft model (e.g., A320, B737, A330)      |
| `engine_temp`     | DOUBLE    | Engine temperature reading                   |
| `fuel_efficiency` | DOUBLE    | Estimated fuel efficiency (%)                |
| `vibration`       | DOUBLE    | Vibration index from the engine or structure |

This data reflects operating trends and early indicators of possible failure.

---

### 🛠 `maintenance_events`
This dataset logs scheduled and unscheduled maintenance events.

| Column        | Type   | Description                                     |
|---------------|--------|-------------------------------------------------|
| `aircraft_id`  | STRING | Aircraft that underwent maintenance             |
| `event_date`   | DATE   | Date the maintenance occurred                   |
| `event_type`   | STRING | Description of maintenance (e.g., Engine Repair, Routine Check) |

Used to correlate operational conditions with real maintenance activity for downstream risk prediction or model training.

---