# PVGIS Data Download and Exploration
This notebook demonstrates how to download and explore meteorological data from the PVGIS (Photovoltaic Geographical Information System) API for the Sceaux, France station. The data includes hourly global horizontal irradiance (GHI) and temperature measurements, which are useful for solar forecasting research.

**Note:** The file `EUPVGIS_sceaux_2006-2016.csv` distributed with this repository was originally downloaded using PVGIS API version 5.1, which is no longer available. The current code uses version 5.2 of the API, which may return data with slight differences.

In [1]:
import pandas as pd
import urllib.parse

In [2]:
pvgis_entrypoint = "https://re.jrc.ec.europa.eu/api/v5_2/seriescalc?"
params = {
    "lat": 48.779,
    "lon": 2.288,
    "startyear" : 2006,
    "endyear" : 2016,
    "raddatabase" : "PVGIS-SARAH2",
    "angle" : 38,
    "aspect" : -4
}


In [3]:
url = pvgis_entrypoint + urllib.parse.urlencode(params)
url

'https://re.jrc.ec.europa.eu/api/v5_2/seriescalc?lat=48.779&lon=2.288&startyear=2006&endyear=2016&raddatabase=PVGIS-SARAH2&angle=38&aspect=-4'

In [4]:
pvgis_df = pd.read_csv(url, sep=",", skiprows=8,
                              skipfooter=10, engine='python',
                              date_format="%Y%m%d:%H%M",
                              parse_dates=True,
                              index_col=0)


In [5]:
pvgis_df.head()

Unnamed: 0_level_0,G(i),H_sun,T2m,WS10m,Int
time,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1
2006-01-01 00:10:00,0.0,0.0,3.39,4.28,0.0
2006-01-01 01:10:00,0.0,0.0,2.93,4.21,0.0
2006-01-01 02:10:00,0.0,0.0,2.65,4.28,0.0
2006-01-01 03:10:00,0.0,0.0,2.49,4.28,0.0
2006-01-01 04:10:00,0.0,0.0,2.12,4.21,0.0
