# üöÄ Robot Predictive Maintenance: MLOps Architecture Enhancements

This notebook demonstrates the evolution of the Linear Regression project into a professional MLOps-ready framework. It integrates real-time streaming simulation with cloud-based database connectivity and statistical anomaly detection.

---

## üèóÔ∏è 1. Project Structure Overview
To ensure reproducibility and maintainability, the project follows standard MLOps architectural patterns:

| Directory/File | Description |
| :--- | :--- |
| **`configs/`** | **Config-Driven Development**: Manages DB URLs and thresholds via `experiment_config.yaml`. |
| **`data/raw/`** | **Immutable Data**: Stores the original `RMBR4-2_export_test.csv` for baseline training. |
| **`data/processed/`** | **Processed Artifacts**: Contains `synthetic_test_data.csv` and the generated `alert_events.csv`. |
| **`experiments/`** | **Tracking**: Records model performance metrics (RMSE, R¬≤ scores) in `results.csv`. |
| **`models/`** | **Persistence**: Stores trained weights as `.pkl` files for deployment without retraining. |
| **`src/`** | **Modular Code**: Houses `stream_monitor.py` (DBPoller, AlertEngine, LivePlotter). |

## üîç 2. Threshold Discovery & Maintenance Logic
Unlike static thresholds, these values were derived through **Residual Analysis** of Axis #1 telemetry:

* **MinC (Alert) [1.372A]**: Set at **$2\sigma$** from the residual mean to identify early performance deviations.
* **MaxC (Error) [2.058A]**: Set at **$3\sigma$** from the residual mean to identify critical outliers and potential stress.
* **Persistence Rule ($T$)**: Alerts require **90s ($T=3$)** and Errors require **60s ($T=2$)** of sustained deviation to filter transient noise.

## üõ†Ô∏è 3. Functional Enhancements
The pipeline has been upgraded to handle real-world industrial data complexities:

* **Hybrid Data Ingestion**: Supports **PostgreSQL (Neon)** connectivity via `load_from_db` for scalable cloud-based streaming.
* **Synthetic Stream Simulation**: Uses **Inverse Z-score Standardization** to generate realistic testing data based on training metadata (`mean` and `std`).
* **Advanced Preprocessing**: Integrated **Moving Average Smoothing** to filter high-frequency sensor noise that hinders model accuracy.

## üìä 4. Real-Time Performance & Alerting
The implementation translates mathematical slopes from the Linear Regression model into proactive maintenance actions:

* **Predictive Trend Analysis**: A real-time **Regression Trend Line** is calculated to predict when current levels will cross failure boundaries.
* **Automated Event Logging**: The `AlertEngine` identifies sustained anomalies and logs them directly to `alert_events.csv`.
* **Visual Dashboard**: Overlays Alert (Orange X) and Error (Red Cross) markers on the live telemetry stream for immediate operator visibility.

---

**Authors**: Chao-Chung , Emmanuel, Liggia elena

**Project**: Practical Lab 1 - Streaming Data for Predictive Maintenance with Linear Regression-Based Alerts