# Original LPIS View

This Jupyter notebook is designed to read Land Parcel Identification System (LPIS) geopackage files for various areas of interest (AOIs) across different years. The goal is to provide insights into the spatial distribution and temporal changes of land parcels within these regions.

## Geopackages Files

The notebook expects geopackage files to be organized in a specific directory structure, with each AOI having its dedicated folder containing geopackages for different years. These files are processed to extract and summarize key information, helping in the analysis of land parcel dynamics over time.


In [14]:
from pathlib import Path
import geopandas as gpd
import dask_geopandas

In [15]:
aois = ['Austria', 'Belgium', 'Catalonia','Denmark', 'Netherlands']
years = [2019,2020]

## Geopackages Files

In [8]:
local_path_dir = 'AgriSen-COG/intermediate_outputs/output1_2_lpis_gpkg'

In [9]:
local_path = Path(local_path_dir)

In [12]:
for aoi in aois:
    for year in years:
        file_name = local_path / aoi / f'{aoi}{year}.gpkg'
        df_gpd = gpd.read_file(file_name)
        print(f'{aoi} {year} has {df_gpd.shape[0]} rows.')

Austria 2019 has 2529896 rows.
Austria 2020 has 2614636 rows.
Belgium 2019 has 521133 rows.
Belgium 2020 has 525592 rows.
Catalonia 2019 has 644376 rows.
Catalonia 2020 has 639444 rows.
Denmark 2019 has 587473 rows.
Denmark 2020 has 583936 rows.
Netherlands 2019 has 819146 rows.
Netherlands 2020 has 773139 rows.


## Local Partioned Parquet files

## S3 bucket Partioned Parquet files

In [19]:
for aoi in aois:
    for year in years:
        file_name = f's3://agrisen-cog-v1/LPIS_processing/original_files/{aoi}_{year}_distrib.parquet'
        df_dgpd = dask_geopandas.read_parquet(file_name,storage_options={
            "anon": True,
                    "client_kwargs": {"endpoint_url": 'https://s3-3.services.tselea.info.uvt.ro'}
                }) 
        print(f'{aoi} {year} has {df_dgpd.shape[0].compute()} rows.')

Austria 2019 has 2529896 rows.
Austria 2020 has 2614636 rows.
Belgium 2019 has 521133 rows.
Belgium 2020 has 525592 rows.
Catalonia 2019 has 644376 rows.
Catalonia 2020 has 639444 rows.
Denmark 2019 has 587473 rows.
Denmark 2020 has 583936 rows.
Netherlands 2019 has 819146 rows.
Netherlands 2020 has 773139 rows.
