# Introduction to products and measurements

- **Products used:**
[ls8_usgs_sr_scene](https://explorer.digitalearth.africa/ls8_usgs_sr_scene)

- **Prerequisites:** Users of this notebook should have a basic understanding of:
  - How to run a [Jupyter notebook](01_Jupyter_notebooks.ipynb)


## Background

A "datacube" is a digital information architecture that specialises in hosting and cataloguing spatial information.
[Digital Earth Africa (DE Africa)] is based on the [Open Data Cube] infrastructure, and specialises in storing
remotely sensed data, particularly from Earth Observation satellites such as [Landsat] and [Sentinel].

The DESA datacube contains both raw satellite data and derivative data "products".
These data products are often composed of a range of "measurements" such as the suite of remote sensing band values or
statistical product summaries. Before running a query to load data from the datacube, it is useful to know what it
contains.

This notebook demonstrates several straightforward ways to inspect the product and measurement contents of a datacube.

[Digital Earth Africa (DE Africa)]: https://www.digitalearthafrica.org/
[Open Data Cube]: https://www.opendatacube.org/
[Landsat]: https://landsat.gsfc.nasa.gov/
[Sentinel]: http://www.esa.int/Applications/Observing_the_Earth/Copernicus/Overview4


## Description

This notebook demonstrates how to connect to the DESA datacube and interrogate the available products
and measurements stored within.

Topics covered include:

- How to connect to a datacube
- How to list all the products
- How to list a selected product's measurements
- How to interactively visualise data in the datacube


## Getting started

To run this introduction to products and measurements, run all the cells in the notebook starting with the
"Load packages" cell. For help with running notebook cells, refer back to the [Jupyter Notebooks notebook](01_Jupyter_notebooks.ipynb).


### Load packages

The `datacube` package is required to access and work with available data.
The `pandas` package is required to format tables.
The `DcViewer` utility will allow us to interactively explore the products available in the datacube.

In [None]:
import datacube
import pandas as pd
from odc.ui import DcViewer

# Set some configurations for displaying tables nicely
pd.set_option('display.max_colwidth', 200)
pd.set_option('display.max_rows', None)

### Connect to the datacube

After importing the `datacube` package, users need to specify a name for their session, known as the app name.

This name is generated by the user and is used to track down issues with database queries. It does not have any effect
on the analysis. Use a short name that is consistent with the purpose of your notebook such as the
way `02_Products_and_measurements` has been used as the app name in this notebook.

The resulting `dc` object is what we use to access all the data contained within the Digital Earth Africa datacube.

In [None]:
dc = datacube.Datacube(
    app="02_Products_and_measurements",
    env="sandbox-eo3",
)

## List products

Once a datacube instance has been created, users can explore the products and measurements stored within.

The following cell lists all product attributes currently available in the DESA datacube by using the
`dc.list_products().columns` function.

In [None]:
dc.list_products().columns

Any of these can be used to customise the product information returned by the `dc.list_products()` function, as shown
in the next cell.

Additionally, the next cell lists all products that are currently available in the DESA datacube by
using the `dc.list_products()` function.

Products listed under **name** in the following table represent the product options available when querying the
datacube. The table below provides some useful information about each product, including a brief product
**description**, the **instrument** and **platform** the data originated from (e.g. Landsat 8 OLI), and the product's
default **crs** (coordinate reference system) and **resolution** if applicable.

In [None]:
products = dc.list_products()

display_columns = [
    "name",
    "description",
    "platform",
    "instrument",
    "crs",
    "resolution"
]

products[display_columns].sort_index()

## List measurements

Most products are associated with a range of available measurements.
These can be individual satellite bands (e.g. Landsat's near-infrared band) or statistical product summaries.

Using the **name** column of products listed above, let's interrogate the measurements associated with the
`ls8_usgs_sr_scene` product using the `dc.list_measurements()` function.

This product name refers to the US Geological Survey's Landsat 8 Analysis-ready data product.

The table below includes a range of technical information about each band in the dataset, including any **aliases**
which can be used to load the data, the data type or **dtype**, any **flags_definition** that are associated with the
measurement (this information is used for tasks like cloud masking), and the measurement's **nodata** value.

Change the `product` name below and re-run the following cell to explore available measurements associated with other
products.

In [None]:
# product = "spot6_ard_scene"
product = "test_spot6_gauteng_old_eo3"

measurements = dc.list_measurements()
measurements.loc[product]

## Visualising available data

For a more visual way of exploring the data that is available within the Digital Earth Africa datacube, we can use the
interactive `DcViewer` utility or the online [DE Africa Explorer] website.

We will use the `DcViewer` utility in this exercise.

Select a product from the drop-down menu on the top-left of the map to show the areas data is available for in blue.
You can also use the back and forward buttons above the map to toggle through time.

The utility is only able to visualise a limited number of datasets at one time.
If the available data footprints do not appear, either press the "show" button on the top right, or zoom further in on
the map.

[DE Africa Explorer]: https://explorer.digitalearth.africa/ls8_usgs_sr_scene

In [None]:
DcViewer(
    dc=dc,
    time='2016',
    products=[product],
    center=(-26.050, 28.190),
    zoom=7
)

## Recommended next steps

For more advanced information about working with Jupyter Notebooks or JupyterLab, you can explore
[JupyterLab documentation page](https://jupyterlab.readthedocs.io/en/stable/user/notebook.html).

To continue working through the notebooks in this beginner's guide, the following notebooks are designed to be worked
through in the following order:

1. [Jupyter Notebooks](01_Jupyter_notebooks.ipynb)
2. **Products and measurements (this notebook)**
3. [Loading data](03_Loading_data.ipynb)
4. [Plotting](04_Plotting.ipynb)
5. [Performing a basic analysis](05_Basic_analysis.ipynb)
6. [Introduction to numpy](06_Intro_to_numpy.ipynb)
7. [Introduction to xarray](07_Intro_to_xarray.ipynb)
8. [Parallel processing with Dask](08_Parallel_processing_with_dask.ipynb)

Once you have completed the above six tutorials, join advanced users in exploring:

- The "Datasets" directory in the repository, where you can explore DE Africa products in depth.
- The "Frequently used code" directory, which contains a recipe book of common techniques and methods for analysing
  DESA data.
- The "Real-world examples" directory, which provides more complex workflows and analysis case studies.