# DEA Coastlines generation using command line tools <img align="right" src="https://github.com/GeoscienceAustralia/dea-notebooks/raw/develop/Supplementary_data/dea_logo.jpg">
This notebook demonstrates how to run a DEA Coastlines analysis using command line tools.

### Setup

Set working directory to top level of repo to ensure links work correctly:

In [1]:
cd ..

/home/jovyan/Robbi/dea-coastlines


Update required packages:

In [2]:
pip install -r requirements.in --quiet

You should consider upgrading via the '/env/bin/python -m pip install --upgrade pip' command.[0m[33m
[0mNote: you may need to restart the kernel to use updated packages.


Set analysis parameters:

In [7]:
config_path = 'configs/dea_coastlines_config.yaml'
study_area = 7904
raster_version = 'testing'
vector_version = 'testing'
continental_version = 'testing'

### Run DEA Coastlines analysis
#### Run tidally-constrained raster generation

In [3]:
!python -m coastlines.raster --help

Usage: python -m coastlines.raster [OPTIONS]

Options:
  --config_path TEXT              Path to the YAML config file defining inputs
                                  to use for this analysis. These are
                                  typically located in the `dea-
                                  coastlines/configs/` directory.  [required]
  --study_area TEXT               A string providing a unique ID of an
                                  analysis gridcell that will be used to run
                                  the analysis. This should match a row in the
                                  "id" column of the provided analysis
                                  gridcell vector file.  [required]
  --raster_version TEXT           A unique string proving a name that will be
                                  used for output raster directories and
                                  files. This can be used to version different
                                  analysis outputs.  [req

Example analysis:

In [14]:
!python -m coastlines.raster --config_path {config_path} --study_area {study_area} --raster_version {raster_version} --start_year 1988 --end_year 2021

Perhaps you already have a cluster running?
Hosting the HTTP server on port 38017 instead
<Client: 'tcp://127.0.0.1:44699' processes=1 threads=31, memory=254.70 GB>
2022-08-02 07:05:42 INFO Study area 7931: Loaded study area grid
2022-08-02 07:05:53 INFO Study area 7931: Loaded virtual product
Creating reduced resolution tide modelling array
Modelling tides using FES2014 tide model
Reprojecting tides into original array
100%|███████████████████████████████████████| 1988/1988 [00:52<00:00, 37.57it/s]
2022-08-02 07:08:02 INFO Study area 7931: Finished modelling tide heights
2022-08-02 07:08:03 INFO Study area 7931: Calculating low and high tide cutoffs for each pixel
2022-08-02 07:08:03 INFO Study area 7931: Started exporting raster data
  return func(*(_execute_task(a, cache) for a in args))
  return func(*(_execute_task(a, cache) for a in args))
2022-08-02 07:30:32 INFO Study area 7931: Completed exporting raster data


#### Run vector annual shoreline and rates of change statistics generation

In [5]:
!python -m coastlines.vector --help

Usage: python -m coastlines.vector [OPTIONS]

Options:
  --config_path TEXT            Path to the YAML config file defining inputs
                                to use for this analysis. These are typically
                                located in the `dea-coastlines/configs/`
                                directory.  [required]
  --study_area TEXT             A string providing a unique ID of an analysis
                                gridcell that was previously used to generate
                                raster files. This is used to identify the
                                raster files that will be used as inputs for
                                shoreline extraction, and should match a row
                                in the "id" column of the provided analysis
                                gridcell vector file.  [required]
  --raster_version TEXT         A unique string providing a name that was used
                                to generate raster files

Example analysis:

In [8]:
!python -m coastlines.vector --config_path {config_path} --study_area {study_area} --raster_version {raster_version} --vector_version {vector_version} --start_year 1988 --end_year 2021 --baseline_year 2021

2022-08-15 07:40:28 INFO Study area 7904: Starting vector generation
2022-08-15 07:41:01 INFO Study area 7904: Loaded rasters
2022-08-15 07:41:02 INFO Study area 7904: Loaded tide modelling points
2022-08-15 07:41:58 INFO Study area 7904: Extracted annual shorelines
2022-08-15 07:42:00 INFO Study area 7904: Extracted rates of change points
2022-08-15 07:43:16 INFO Study area 7904: Calculated distances to each annual shoreline
2022-08-15 07:43:22 INFO Study area 7904: Calculated rates of change regressions
2022-08-15 07:43:34 INFO Study area 7904: Calculated all of time statistics
2022-08-15 07:43:34 INFO Study area 7904: Calculated rate of change certainty flags
2022-08-15 07:43:36 INFO Study area 7904: Added region attributes and geohash UID
2022-08-15 07:43:53 INFO Study area 7904: Output vector files written to data/interim/vector/testing/7904_testing


#### Run continental-scale layer generation

In [6]:
!python -m coastlines.continental --help

Usage: python -m coastlines.continental [OPTIONS]

Options:
  --vector_version TEXT           A unique string proving a name that was used
                                  for output vector directories and files.
                                  This is used to identify the tiled annual
                                  shoreline and rates of change layers that
                                  will be combined into continental-scale
                                  layers.  [required]
  --continental_version TEXT      A unique string proving a name that will be
                                  used for output continental-scale layers.
                                  This allows multiple versions of
                                  continental-scale layers to be generated
                                  from the same input vector data, e.g. for
                                  testing different hotspot of coastal change
                                  summary layers. If not

Example analysis:

In [9]:
!python -m coastlines.continental --vector_version {vector_version} --continental_version {continental_version} --shorelines True --ratesofchange True --hotspots True --baseline_year 2021

2022-08-03 04:27:21 INFO Writing data to data/processed/testing
2022-08-03 04:27:23 INFO Merging annual shorelines complete
2022-08-03 04:27:25 INFO Merging rates of change points complete
2022-08-03 04:27:25 INFO Generating continental hotspots
2022-08-03 04:27:27 INFO Calculating 10000 m hotspots
2022-08-03 04:27:27 INFO Calculating 5000 m hotspots
2022-08-03 04:27:28 INFO Calculating 1000 m hotspots
2022-08-03 04:27:30 INFO Writing hotspots complete
2022-08-03 04:27:30 INFO Writing styles in the GeoPackage file


## Example combined analysis
This demonstrates how the three components of DEA Coastlines (raster generation, vector generation and continental layers generation) can be applied automatically to a sequence of input study area grid cells.

In [9]:
# Study areas
study_areas = [3131, 4208, 6727]

In [10]:
# Run raster and vector generation for each study area
for study_area in study_areas:
    print(study_area)
#     !python -m coastlines.raster --config_path {config_path} --study_area {study_area} --raster_version {raster_version} --start_year 1988 --end_year 2021
    !python -m coastlines.vector --config_path {config_path} --study_area {study_area} --raster_version {raster_version} --vector_version {vector_version} --start_year 1988 --end_year 2021 --baseline_year 2021
    
# When complete, combine into single continental outputs
# !python -m coastlines.continental --vector_version {vector_version} --continental_version {continental_version} --shorelines True --ratesofchange True --hotspots True --baseline_year 2021

3131
2022-08-15 07:45:06 INFO Study area 3131: Starting vector generation
2022-08-15 07:45:40 INFO Study area 3131: Loaded rasters
2022-08-15 07:45:41 INFO Study area 3131: Loaded tide modelling points
2022-08-15 07:46:30 INFO Study area 3131: Extracted annual shorelines
2022-08-15 07:46:30 INFO Study area 3131: Extracted rates of change points
2022-08-15 07:46:53 INFO Study area 3131: Calculated distances to each annual shoreline
2022-08-15 07:46:54 INFO Study area 3131: Calculated rates of change regressions
2022-08-15 07:46:58 INFO Study area 3131: Calculated all of time statistics
2022-08-15 07:46:58 INFO Study area 3131: Calculated rate of change certainty flags
2022-08-15 07:46:58 INFO Study area 3131: Added region attributes and geohash UIDs
2022-08-15 07:47:03 INFO Study area 3131: Output vector files written to data/interim/vector/testing/3131_testing
4208
2022-08-15 07:47:06 INFO Study area 4208: Starting vector generation
2022-08-15 07:47:31 INFO Study area 4208: Loaded rast

### Clean up and remove files
Warning: this will delete all data processed with a matching unique version name. Only run these cells if you no longer need these processed outputs!

In [None]:
# rm -rf data/interim/raster/{raster_version}

In [None]:
# rm -rf data/interim/vector/{vector_version}

In [None]:
# rm -rf data/processed/{continental_version}

***

## Additional information

**License:** The code in this notebook is licensed under the [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0). 
Digital Earth Australia data is licensed under the [Creative Commons by Attribution 4.0](https://creativecommons.org/licenses/by/4.0/) license.

**Contact:** For assistance with any of the Python code or Jupyter Notebooks in this repository, please post a [Github issue](https://github.com/GeoscienceAustralia/dea-coastlines/issues/new).

**Last modified:** July 2022