# Introduction to products and measurements <img align="right" src="../Supplementary_data/dea_logo.jpg">

* **Acknowledgement**: This notebook was originally created by [Digital Eath Australia (DEA)](https://www.ga.gov.au/about/projects/geographic/digital-earth-australia) and has been modified for use in the EY Data Science Program
* **Products used:** 
[ga_s2a_ard_nbar_granule](https://explorer.sandbox.dea.ga.gov.au/ga_s2a_ard_nbar_granule)
* **Prerequisites:** Users of this notebook should have a basic understanding of:
    * How to run a [Jupyter notebook](01_Jupyter_notebooks.ipynb)
    * The basic structure of the DEA [satellite datasets](02_DEA.ipynb)

## Background
A "datacube" is a digital information architecture that specialises in hosting and cataloguing spatial information.
[Digital Earth Australia (DEA)](https://www.ga.gov.au/dea) is based on the [Open Data Cube](https://www.opendatacube.org/) infrastructure, and specialises in storing remotely sensed data, particularly from Earth Observation satellites such as [Landsat](https://landsat.gsfc.nasa.gov/) and [Sentinel-2](https://www.copernicus.eu/en/about-copernicus/infrastructure/discover-our-satellites).

The DEA datacube contains both raw satellite data and derivative data "products".
These data products are often composed of a range of "measurements" such as the suite of remote sensing band values or statistical product summaries. Before running a query to load data from the datacube, it is useful to know what it contains.
This notebook demonstrates several straightforward ways to inspect the product and measurement contents of a datacube.

## Description
This notebook demonstrates how to connect to a datacube and interrogate the available products and measurements stored within.
Topics covered include:

* How to connect to a datacube
* How to list all the products
* How to list all the product measurements
* How to interactively visualise data in the datacube 

***

## Getting started
To run this introduction to products and measurements, run all the cells in the notebook starting with the "Load packages" cell. For help with running notebook cells, refer back to the [Jupyter Notebooks notebook](01_Jupyter_notebooks.ipynb).

### Load packages
The `datacube` package is required to access and work with available data.
The `pandas` package is required to format tables.
The `DcViewer` utility provides an interface for interactively exploring the products available in the datacube.

In [1]:
import datacube
import pandas as pd
from odc.ui import DcViewer

# Set some configurations for displaying tables nicely
pd.set_option('display.max_colwidth', 200)
pd.set_option('display.max_rows', None)

### Connect to the datacube

After importing the `datacube` package, users need to specify a name for their session, known as the app name.

This name is generated by the user and is used to track down issues with database queries.
It does not have any effect on the analysis.
Use a short name that is consistent with the purpose of your notebook such as the way `03_Products_and_measurements` has been used as the app name in this notebook.

The resulting `dc` object provides access to all the data contained within the Digital Earth Australia datacube.

In [2]:
dc = datacube.Datacube(app="03_Products_and_measurements")

## List products

Once a datacube instance has been created, users can explore the products and measurements stored within.

The following cell lists all products that are currently available in the DEA datacube by using the `dc.list_products()` function. 

Products listed under **name** in the following table represent the product options available when querying the datacube. 
The table below provides some useful information about each product, including a brief product **description**, the **instrument** and **platform** the data originated from (e.g. Landsat 8 OLI), and the product's default **crs** (coordinate reference system) and **resolution** if applicable.

> For a comprehensive product description and access to complete product metadata, users are directed to the Geoscience Australia [Content Management Interface](https://cmi.ga.gov.au)



In [3]:
products = dc.list_products()

display_columns = ["name",
                   "description",
                   "platform",
                   "instrument",
                   "crs",
                   "resolution"]

products[display_columns].sort_index()

Unnamed: 0_level_0,name,description,platform,instrument,crs,resolution
id,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1
1,ga_s2a_ard_nbar_granule,Sentinel-2A MSI Definitive ARD - NBAR and Pixel Quality,SENTINEL_2A,MSI,,
2,ga_s2b_ard_nbar_granule,Sentinel-2B MSI Definitive ARD - NBAR and Pixel Quality,SENTINEL_2B,MSI,,


## List measurements


Most products are associated with a range of available measurements.
These can be individual satellite bands (e.g. Landsat's near-infrared band) or statistical product summaries.

The `dc.list_measurements()` function can be used to interrogate the measurements associated with a given product (specified by the **name** column from the table above).
For example, `ga_s2a_ard_nbar_granule` refers to the Geoscience Australia Sentinel-2A MSI Definitive Analysis-ready data product.

The table below includes a range of technical information about each band in the `ga_s2a_ard_nbar_granule` dataset, including any **aliases** which can be used to load the data, the data type or **dtype**, any **flags_definition** that are associated with the measurement (this information is used for tasks like cloud masking), and the measurement's **nodata** value.

Change the `product` name below and re-run the following cell to explore available measurements associated with other products.

In [4]:
product = "ga_s2a_ard_nbar_granule"

measurements = dc.list_measurements()
measurements.loc[product]

Unnamed: 0_level_0,name,dtype,units,nodata,aliases,flags_definition,spectral_definition
measurement,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1
azimuthal_exiting,azimuthal_exiting,float32,1,-999,[azimuthal_exiting],,
azimuthal_incident,azimuthal_incident,float32,1,-999,[azimuthal_incident],,
exiting,exiting,float32,1,-999,[exiting],,
incident,incident,float32,1,-999,[incident],,
relative_azimuth,relative_azimuth,float32,1,-999,[relative_azimuth],,
relative_slope,relative_slope,float32,1,-999,[relative_slope],,
satellite_azimuth,satellite_azimuth,float32,1,-999,[satellite_azimuth],,
satellite_view,satellite_view,float32,1,-999,[satellite_view],,
solar_azimuth,solar_azimuth,float32,1,-999,[solar_azimuth],,
solar_zenith,solar_zenith,float32,1,-999,[solar_zenith],,


## Visualising available data
The interactive `DcViewer` utility provides a more visual way of exploring the data that is available within the Digital Earth Australia datacube. 

After running the cell below, select a product from the drop-down menu on the top-right of the map to show the areas where data are available in blue.
Use the back and forward buttons above the map to toggle through time.

The utility is only able to visualise a limited number of datasets at one time.
If the available data footprints do not appear, either press the "show" button on the top right, or zoom in on the map.

In [5]:
DcViewer(dc=dc, 
         time='2018', 
         width='800px',
         center=(-37.60, 144.90),
         zoom=6)

VBox(children=(HBox(children=(Dropdown(layout=Layout(flex='0 1 auto', width='10em'), options=('ga_s2a_ard_nbar…

## Recommended next steps

To continue working through the notebooks in this beginner's guide, the following notebooks are designed to be worked through in the following order:

1. [Jupyter Notebooks](01_Jupyter_notebooks.ipynb)
2. [Digital Earth Australia](02_DEA.ipynb)
3. **Products and measurements (this notebook)**
4. [Loading data](04_Loading_data.ipynb)
5. [Plotting](05_Plotting.ipynb)
6. [Performing a basic analysis](06_Basic_analysis.ipynb)
7. [Introduction to Numpy](07_Intro_to_numpy.ipynb)
8. [Introduction to Xarray](08_Intro_to_xarray.ipynb)
9. [Parallel processing with Dask](09_Parallel_processing_with_dask.ipynb)

***
## Additional information

**License:** The code in this notebook is licensed under the [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0). 
Digital Earth Australia data is licensed under the [Creative Commons by Attribution 4.0](https://creativecommons.org/licenses/by/4.0/) license.

**Contact:** If you need assistance, please review the FAQ section and support options on the [EY Data Science platform](https://datascience.ey.com/).