This repository contains wind turbine SCADA (Supervisory Control and Data Acquisition) datasets from two distinct wind farms for wind power analysis and forecasting research.
| Dataset | Source | Location | Time Period | Interval | Records |
|---|---|---|---|---|---|
| La Haute Borne | ENGIE Open Data | Meuse, France | 2017 onwards | 10 min | ~8,900+ |
| Turkey Wind Turbine | Kaggle (B. Erisen) | Turkey | 2018 | 10 min | ~50,500+ |
Source: ENGIE Group Open Data Portal
URL: https://opendata-renewables.engie.com/explore/?sort=modified
License: Open License 2.0 (Etalab)
La Haute Borne is an operational wind farm located in the Meuse department of France, operated by ENGIE Group. This dataset provides comprehensive SCADA measurements including detailed turbine operational parameters.
Source: Kaggle - Wind Turbine SCADA Dataset by Berker Isen
URL: https://www.kaggle.com/berkerisen/wind-turbine-scada-dataset
This dataset contains real operational data from a wind turbine in Turkey, widely used for wind power prediction and early fault detection research.
Comprehensive SCADA data from turbine R80711 at La Haute Borne wind farm with 138 features.
| Abbreviation | Description | Unit |
|---|---|---|
| Wind_turbine_name | Turbine identifier | - |
| Date_time | Timestamp (10-min intervals) | - |
| P_avg/min/max/std | Active Power | kW |
| Q_avg/min/max/std | Reactive Power | kVar |
| S_avg/min/max/std | Apparent Power | kVA |
| Ws1_avg, Ws2_avg, Ws_avg | Wind Speed (multiple sensors) | m/s |
| Wa_avg/min/max/std | Wind Direction (Angle) | ° |
| Ba_avg/min/max/std | Blade Pitch Angle | ° |
| Rt_avg/min/max/std | Rotor Speed | RPM |
| Ya_avg/min/max/std | Yaw Angle | ° |
| Ot_avg/min/max/std | Outside Temperature | °C |
| DCs_avg/min/max/std | DC Bus Voltage | V |
| Cm_avg/min/max/std | Converter Module | - |
| Cosphi_avg/min/max/std | Power Factor | - |
| Nf_avg/min/max/std | Grid Frequency | Hz |
| Abbreviation | Description |
|---|---|
| Db1t, Db2t | Drive Bearing Temperatures |
| Dst | Drive Shaft Temperature |
| Gb1t, Gb2t | Gearbox Bearing Temperatures |
| Git | Generator Inlet Temperature |
| Gost | Generator Output Stator Temperature |
| Yt | Yaw Temperature |
| Rbt | Rotor Bearing Temperature |
| Rm | Rotor Motor Temperature |
Note: Each measurement includes statistical aggregations:
_avg(average),_min(minimum),_max(maximum),_std(standard deviation) over the 10-minute interval.
SCADA data from a Turkish wind turbine with 5 features.
| Column | Description | Unit |
|---|---|---|
| Date/Time | Timestamp (10-min intervals) | DD MM YYYY HH:MM |
| LV ActivePower | Active Power Output | kW |
| Wind Speed | Measured Wind Speed | m/s |
| Theoretical_Power_Curve | Expected Power from Power Curve | kWh |
| Wind Direction | Wind Direction | ° |
- Wind Power Forecasting: Short-term and long-term power generation prediction
- Power Curve Modeling: Analyzing the relationship between wind speed and power output
- Anomaly Detection: Identifying operational anomalies and potential faults
- Predictive Maintenance: Early fault detection using SCADA patterns
- Performance Optimization: Turbine efficiency analysis and optimization
- Machine Learning Research: Training ML/DL models for renewable energy applications
- Both datasets use 10-minute sampling intervals, which is the industry standard for wind turbine SCADA systems
- The La Haute Borne dataset provides extensive statistical summaries (avg, min, max, std) for each measurement window
- The Turkey dataset includes a theoretical power curve column useful for performance benchmarking
- Missing values may be present and should be handled appropriately during preprocessing
@misc{engie_la_haute_borne_2018,
title = {La Haute Borne Wind Farm Open Data},
author = {ENGIE Group},
year = {2018},
howpublished = {\url{https://opendata-renewables.engie.com/}}
}
@misc{erisen_wind_turbine_2018,
title = {Wind Turbine SCADA Dataset},
author = {Berker Erisen},
year = {2018},
howpublished = {\url{https://www.kaggle.com/berkerisen/wind-turbine-scada-dataset}}
}