# Water Productivity through Open-access of Remotely sensed derived data (WaPOR)

**Date modified:** 21 December 2024

## Product Overview

### Background

The **Wa**ter **P**roductivity through **O**pen access of **R**emotely sensed derived data **(WaPOR)** monitors and reports on agricultural water productivity through biophysical measures with a focus on Africa and the Near East. This information assists partner countries improve land and water productivity in both rainfed and irrigated agriculture. WaPOR is developed and produced by the Food and Agriculture Organisation (FAO) of the United Nations.

WaPOR provides numerous datasets related to vegetation productivity and water consumption, and associated meteorological and physical conditions such as soil moisture and precipitation. These datasets can be combined with Digital Earth Africa products, services, and workflows for numerous applications including:

* Monitoring drought conditions
* Monitoring the water use efficiency of crops
* Mapping irrigated areas
* Estimating crop water requirements
* Irrigation scheduling and budgeting

The applications of WaPOR data go beyond water productivity and agriculture. Any application that would benefit from hydrological information and/or vegetation productivity could use WaPOR data.

### Specifications

#### Spatial and temporal coverage

WaPOR data has three levels:

1. Global 300m resolution
2. National 100m resolution (note that this now covers the African continent)
3. Sub-national 20m resolution

The table below covers L1 and L2 datasets. L3 datasets can be viewed in the [WaPOR maps platform](https://data.apps.fao.org/wapor/?lang=en) which is built with the same software as [Digital Earth Africa Maps](https://maps.digitalearth.africa/). L3 datasets cover several regions of interest in northern and eastern Africa. This notebook loads level 3 20m data for Egypt. It is recommended that the [WaPOR maps platform](https://data.apps.fao.org/wapor) is inspected to check the availability of level, variable, and temporal frequency combinations for your area of interest. The maps platform also shows map codes in the data description.

Mapset codes are structured as `level-variable-temporal` frequency as shown below. The temporal frequencies available are:

* A - annual
* M - monthly
* D - dekadal (10 days)

So, for level 3 net primary productivity at dekadal intervals the code is `L3-NPP-D`.

**Table 1: L1 and L2 dataset details.**

|Mapset Code|Mapset Description|Units|Scale Factor|
|---|---|--------------|----|
|L1-AETI-A|Actual EvapoTranspiration and Interception (Global - Annual - 300m)|mm/year|0.1|
|L1-AETI-D|Actual EvapoTranspiration and Interception (Global - Dekadal - 300m)|mm/day|0.1|
|L1-AETI-M|Actual EvapoTranspiration and Interception (Global - Monthly - 300m)|mm/month|0.1|
|L1-E-A|Evaporation (Global - Annual - 300m)|mm/year|0.1|
|L1-E-D|Evaporation (Global - Dekadal - 300m)|mm/day|0.1|
|L1-GBWP-A|	Gross biomass water productivity (Annual - 300m)|kg/m$^3$|0.001|
|L1-I-A|Interception (Global - Annual - 300m)|mm/year|0.1|
|L1-I-D|Interception (Global - Dekadal - 300m)|mm/year|0.1|
|L1-NBWP-A|Net biomass water productivity (Annual - 300m)|kg/m$^3$|0.001|
|L1-NPP-D|Net Primary Production (Global - Dekadal - 300m)|gC/m$^2$/day|0.001|
|L1-NPP-M|Net Primary Production (Global - Monthly - 300m)|gC/m$^2$/month|0.001|
|L1-PCP-A|Precipitation (Global - Annual - Approximately 5km)|mm/year|0.1|
|L1-PCP-D|Precipitation (Global - Dekadal - Approximately 5km)|mm/day|0.1|
|L1-PCP-E|Precipitation (Global - Daily - Approximately 5km)|mm|0.1|
|L1-PCP-M|Precipitation (Global - Monthly - Approximately 5km)|mm/month|0.1|
|L1-QUAL-LST-D|Quality land surface temperature (Global - Dekadal - 300m)|d|1|
|L1-QUAL-NDVI-D|Quality of Normalized Difference Vegetation Index (Global - Dekadal - 300m)|Number of dekads (D)||
|L1-RET-A|Reference Evapotranspiration (Global - Annual - Approximately 30km)|mm/year|0.1|
|L1-RET-D|Reference Evapotranspiration (Global - Dekadal - Approximately 30km)|mm/day|0.1|
|L1-RET-E|Reference Evapotranspiration (Global - Daily - Approximately 30km)|mm/day|0.1|
|L1-RET-M|Reference Evapotranspiration (Global - Monthly - Approximately 30km)|mm/month|0.1|
|L1-RSM-D|Relative Soil Moisture (Global - Dekadal - 300m)|%|0.001|
|L1-T-A|Transpiration (Global - Annual - 300m)|mm/year|0.1|
|L1-T-D|Transpiration (Global - Dekadal - 300m)|mm/day|0.1|
|L1-TBP-A|Total Biomass Production (Global - Annual - 300m)|kg/ha|1|
|L2-AETI-A|Actual EvapoTranspiration and Interception (National - Annual - 100m)|mm/year|0.1|
|L2-AETI-D|Actual EvapoTranspiration and Interception (National - Annual - 100m)|mm/day|0.1|
|L2-AETI-M|Actual EvapoTranspiration and Interception (National - Annual - 100m)|mm/month|0.1|
|L2-E-A|Evaporation (National - Annual - 100m)|mm/year|0.1|
|L2-E-D|Evaporation (National - Dekadal - 100m)|mm/day|0.1|
|L2-GBWP-A|Gross biomass water productivity (Annual - 100m)|kg/m$^3$|0.001|
|L2-I-A|Interception (National - Annual - 100m)|mm/year|0.1|
|L2-I-D|Interception (National - Dekadal - 100m)|mm/day|0.1|
|L2-NBWP-A|Net biomass water productivity (Annual - 100m)|kg/m$^3$|0.001|
|L2-NPP-D|Net Primary Production (National - Dekadal - 100m)|gC/m$^2$/day|0.001|
|L2-NPP-M|Net Primary Production (National - Monthly - 100m)|gC/m$^2$/month|0.001|
|L2-QUAL-NDVI-D|Quality of normalized difference vegetation index (National - Dekadal - 100m)|d|1|
|L2-RSM-D|Relative Soil Moisture (National - Dekadal - 100m)|%|0.001|
|L2-T-A|Transpiration (National - Annual - 100m)|mm/year|0.1|
|L2-T-D|Transpiration (National - Dekadal - 100m)|mm/day|0.1|
|L2-TBP-A|Total Biomass Production (National - Annual - 100m)|kg/ha|1|

Data is available for the region shaded in blue.

**Figure 1: World Settlement Footprint product geographic extent**

<img src="../_static/data_specs/ESA_WorldCover_specs/esa_worldcover_geographic_extent.png" alt="ESA WorldCover Geographic Extent" width="300" align="left"/>

#### Measurements

**Table 2: World Settlement Footprint product measurements**

| Band ID | Description | Units | Data type | No data$^\dagger$ |
|:------------------------------|:--------------|:--------|:--------|---------:|
| wsf_2015 | World Settlement Footprint 2015 | 1 | uint8 | 0.0 |
| wsf_2019 | World Settlement Footprint 2019 | 1 | uint8 | 0.0 |
| wsfevolution | World Settlement Footprint Evolution | 1 | uint8 | 0.0 |
| idcscore | Input Data Consistency score | 1 | uint8 | 0.0 |

### Processing

The **World Settlement Footprint WSF 2015 version 2** (WSF2015 v2) is a 10m resolution binary mask outlining the extent of human settlements globally for the year 2015. Specifically, the WSF2015 v2 is a pilot product generated by combining multiple datasets.The **World Settlement Footprint (WSF) 2015** derived at 10m spatial resolution by means of 2014-2015 multitemporal Landsat-8 and Sentinel-1 imagery (of which ~217K and ~107K scenes have been processed, respectively). The High Resolution Settlement Layer (HRSL) generated by the Connectivity Lab team at Facebook through the employment of 2016 DigitalGlobe VHR satellite imagery and publicly released at 30m spatial resolution for 214 countries (Marconcini et al., 2020).

The **World Settlement Footprint (WSF) 2019** is a 10m resolution binary mask outlining the extent of human settlements globally derived by means of 2019 multitemporal Sentinel-1 (S1) and Sentinel-2 (S2) imagery. Based on the hypothesis that settlements generally show a more stable behavior with respect to most land-cover classes, temporal statistics are calculated for both S1- and S2-based indices. In particular, a comprehensive analysis has been performed by exploiting a number of reference building outlines to identify the most suitable set of temporal features (ultimately including 6 from S1 and 25 from S2). Training points for the settlement and non-settlement class are then generated by thresholding specific features, which varies depending on the 30 climate types of the well-established Köppen Geiger scheme. Next, binary classification based on Random Forest is applied and, finally, a dedicated post-processing is performed where ancillary datasets are employed to further reduce omission and commission errors. Here, the whole classification process has been entirely carried out within the Google Earth Engine platform. To assess the high accuracy and reliability of the WSF2019, two independent crowd-sourcing-based validation exercises have been carried out with the support of Google and Mapswipe, respectively, where overall 1M reference labels have been collected based photointerpretation of very high-resolution optical imagery (Marconcini et al., 2021).

The **World Settlement Footprint (WSF Evolution)** is a 30m resolution dataset outlining the global settlement extent on a yearly basis from 1985 to 2015. Based on the assumption that settlement growth occurred over time, all pixels categorized as non-settlement in the WSF2015 (Marconcini et al., 2020) are excluded a priori from the analysis. Next, for each target year in the past, all available Landsat-5/7 scenes acquired over the given area of interest are gathered and key temporal statistics (i.e., temporal mean, minimum, maximum, etc.) are then extracted for different spectral indices. Among others, these include: the normalized difference built-up index (NDBI), normalized difference vegetation index (NDVI) and modified normalized difference water index (MNDWI). Temporal features proved generally robust if computed over at least 7 clear cloud-/cloud-shadow-free observations; accordingly, if for a given pixel in the target year this constraint is not satisfied, the time frame is enlarged backwards (at 1-year steps) as long as the condition is met.

Starting backwards from the year 2015 - for which the WSF2015 is used as a reference - settlement and non-settlement training samples for the given target year t are iteratively extracted by applying morphological filtering to the settlement mask derived for the year t+1, as well as excluding potentially mislabeled samples by adaptively thresholding the temporal mean NDBI, MNDWI and NDVI. Finally, binary Random Forest classification in performed.

To quantitatively assess the high accuracy and reliability of the dataset, an extensive campaign based on crowdsourcing photointerpretation of very high-resolution airborne and satellite historical imagery has been performed with the support of Google. In particular, for the years 1990, 1995, 2000, 2005, 2010 and 2015, ~200K reference cells of 30x30m size distributed over 100 sites around the world have been labelled, hence summing up to overall ~1.2M validation samples.

It is worth noting that past Landsat-5/7 availability considerably varies across the world and over time. Independently from the implemented approach, this might then result in a lower quality of the final product where few/no scenes have been collected. Accordingly, to provide the users with a suitable and intuitive measure that accounts for the goodness of the Landsat imagery, we conceived the Input Data Consistency (IDC) score, which ranges from 6 to 1 with: 6) very good; 5) good; 4) fair; 3) moderate; 2) low; 1) very low. The IDC score is available on a yearly basis between 1985 and 2015 and supports a proper interpretation of the WSF evolution product.

The WSF evolution and IDC score datasets are organized in 5138 GeoTIFF files (EPSG4326 projection) each one referring to a portion of 2 x 2 degree size (~222 x 222 km) on the ground. WSF evolution values range between 1985 and 2015 corresponding to the estimated year of settlement detection, whereas 0 is no data. A comprehensive publication with all technical details and accuracy figures is currently being finalized. For the time being, please refer to [Marconcini et al,. 2021](https://austriaca.at/0xc1aa5576%200x003c9b4c.pdf).

### Media and example images

**Figure 2: World Settlement Footprint over Kumasi, Ghana**

<img src="../_static/data_specs/WSF/WorldSettlementFootprint_1.png" alt="wsf 2015 and 2019" width="600" align="left"/>

**Figure 3: World Settlement Footprint over Harare, Zimbabwe**

<img src="../_static/data_specs/WSF/WorldSettlementFootprint_2.png" alt="wsf 2015 and 2019" width="600" align="left"/>

**Figure 4: World Settlement Footprint Evolution over Mansoura, Egypt with IDC score for the image**

<img src="../_static/data_specs/WSF/WorldSettlementFootprinEvolution.png" alt="World Settlement Footprint Evolution" width="600" align="left"/>

### References

Mapping our human footprint from space(Accessed on 2023 August). ESA - Mapping Our Human Footprint From Space. https://www.esa.int/Applications/Observing_the_Earth/Mapping_our_human_footprint_from_space

Marconcini, M., Metz-Marconcini, A., Üreyen, S. et al. Outlining where humans live, the World Settlement Footprint 2015. Sci Data 7, 242 (2020). https://doi.org/10.1038/s41597-020-00580-5

Mattia Marconcini, Annekatrin Metz-Marconcini, Thomas Esch and Noel Gorelick. Understanding Current Trends in Global Urbanisation - The World Settlement Footprint Suite. GI_Forum 2021, Issue 1, 33-38 (2021) https://austriaca.at/0xc1aa5576%200x003c9b4c.pdf

G.D.Team (Accessed on 2023 August). EOC Geoservice Map Contexts. EOC Geoservice Map Contexts. https://geoservice.dlr.de/web/maps

### License and Acknowledgements

The World Settlement Footprint is provided free of charge, without restriction of use. For the full license information see the [Creative Commons Attribution 4.0 International License](https://creativecommons.org/licenses/by/4.0/).

## Data Acess

### Amazon Web Service

The World Settlement Footprint 10m 2015 2019, Evolution products are avaliable in AWS S3.

**Table 3: AWS data acess details.**

|AWS S3 details | |
|----------|-------------|
|Bucket ARD | `arn:aws:s3:::wsf_{year}`|
|Bucket | `deafrica-input-datasets` |

The products are hosted on the `wsf_{year}` S3 bucket in the AWS `deafrica-input-datasets`. 

The file paths follow the format:
`s3://deafrica-input-datasets/wsf_{year}/`

### OGC Web Services (OWS)

The World Settlement Footprint 10m product `wsf_2015`, `wsf_2019`, `wsf_evolution` are available through the Digital Earth Africa's OWS. 

**Table 4: OWS data access details.**

|OWS details | |
|----------|-------------|
|Name | `DE Africa Services` |
|Web Map Services (WMS) URL | `https://ows.digitalearth.africa/wms` |
| Web Coverage Service (WCS) URL | `https://ows.digitalearth.africa/wcs`|
| Layer name | `wsf_{year}` |

Digital Earth Africa OWS details can be found at [https://ows.digitalearth.africa/](https://ows.digitalearth.africa/).

For instructions on how to connect to OWS, see [this tutorial](../web_services/index.ipynb).

### Open Data Cube (ODC)

The World Settlement Footprint product can be accessed through the Digital Earth Africa ODC API, which is available through the [Digital Earth Africa Sandbox](https://sandbox.digitalearth.africa/hub/login).

**ODC product name:** `wsf_{year}` and `wsf_evolution`

The `wsf_{year}` product has only one specific band of data which can be called by using the default name, `wsf{year}`, as listed in the table below. 
ODC `Datacube.load` commands without specified bands will load the  `wsf_{year}`  band.

**Table 5: ODC product World Settlement Footprint band names.**

|Band name| Alternative names| Fill value |
| :-: | :-: | :-: |
| wsf{year}                | NaN        |      `0.0` |
| wsfevolution                | NaN        |      `0.0` |