# Power System Data Collection and Validation
This project aims to gather, clean, and validate time series datasets related to power and energy systems. The data will be centralized, documented, and used to support planning and simulation tasks within energy network research.

---

#### Instructions

1. **Run the Code Cells Below**

2. **Explore Further**  
   Detailed Jupyter Notebook is available in each source's subfolder.


#### How to Query the Dataset

You can use the `query_data_sources()` function to filter datasets based on specific criteria such as:

- Type of load (e.g., active, reactive)
- Type of renewable source (e.g., solar, wind)
- Whether the dataset is synthetic or real
- Presence of price/economic data
- Time resolution or geographical coverage

#### Example Queries:

- Find datasets with both solar and wind data  
- Find datasets that are synthetic only  
- Find datasets that include price data and wind generation  
- Filter by specific load types like residential or industrial

Use the function in code cells like this:


In [1]:
from IPython.display import display
from utils.data_catalog import load_data_sources, query_data_sources, show_folder_links

# Load the dataset catalog
df = load_data_sources()
display(df)


Unnamed: 0,Source,Description,Number of Profiles,Profile Types,Load,Renewable,Environment,Economy,Processed,Synthetic,Horizon,Time Resolution,Geographical,Folder
0,AgenceORE_Consumption_lt36kVA,Aggregated half-hourly electricity consumption data from consumption points with power subscriptions below 36kVA.,130,"[load, consumption points, energy consumption]","[active, aggregated, residential]",[],[],[],True,False,2020–2024,[30min],[France],AgenceORE_Consumption_lt36kVA/
1,eCO2mix_France_GenerationBySource,"Real-time and historical electricity generation, consumption, forecast, and emissions data for France, disaggregated by energy source and technology.",36,"[load, production, renewable, forecast, environment]","[consumption, active, national, aggregated]","[solar, wind, hydro, bioenergy]",[co2 intensity],[],True,False,2012-2022,[15min],[France],eCO2mix_France_GenerationBySource/
2,OPSD,Open Power System Data - EU-wide TSO-provided time series,220,"[load, renewable, capacity, price, forecast]","[active, aggregated, national, historical]","[solar, wind]",[],[price],True,False,2015-2020,"[15min, 30min, 60min]","[EU, United Kingdom, Switzerland, Norway, Montenegro, Serbia, Ukraine]",OPSD_TimeSeries/
3,SimBench,Synthetic power system benchmark datasets for grid studies,614,"[load, renewable, powerplant, storage]","[active, reactive, residential, industry, commercial]","[solar, wind, biomass, hydro]",[],[],True,True,2016-2017,[15min],[Germany],SimBench/


In [2]:
# Filter datasets by criteria (e.g., solar + wind)
results = query_data_sources(df, renewable=["solar", "wind"])
display(results)

Unnamed: 0,Source,Description,Number of Profiles,Profile Types,Load,Renewable,Environment,Economy,Processed,Synthetic,Horizon,Time Resolution,Geographical,Folder
0,eCO2mix_France_GenerationBySource,"Real-time and historical electricity generation, consumption, forecast, and emissions data for France, disaggregated by energy source and technology.",36,"[load, production, renewable, forecast, environment]","[consumption, active, national, aggregated]","[solar, wind, hydro, bioenergy]",[co2 intensity],[],True,False,2012-2022,[15min],[France],eCO2mix_France_GenerationBySource/
1,OPSD,Open Power System Data - EU-wide TSO-provided time series,220,"[load, renewable, capacity, price, forecast]","[active, aggregated, national, historical]","[solar, wind]",[],[price],True,False,2015-2020,"[15min, 30min, 60min]","[EU, United Kingdom, Switzerland, Norway, Montenegro, Serbia, Ukraine]",OPSD_TimeSeries/
2,SimBench,Synthetic power system benchmark datasets for grid studies,614,"[load, renewable, powerplant, storage]","[active, reactive, residential, industry, commercial]","[solar, wind, biomass, hydro]",[],[],True,True,2016-2017,[15min],[Germany],SimBench/


In [3]:
# Show folder links for matching datasets
show_folder_links(results)

- [eCO2mix_France_GenerationBySource](eCO2mix_France_GenerationBySource/)
- [OPSD](OPSD_TimeSeries/)
- [SimBench](SimBench/)