Add IRI #3

t-downing · 2023-10-04T17:45:56Z

Key processing steps:

Processing IRI seasonal forecasts globally. Currently this is done as a notebook in the exploration folder, but can be moved to src if we decide we're happy with this method.
Processing ACAPS seasonal calendars (in src)
Aggregating all CODABs for which we have a country_config in AnticiPy, and are included in the ACAPS dataset (in src)

Also includes approx_mask_raster() to up-sample Xarray datasets.

Also includes notebooks in exploration for exploring:

ACAPS seasonal calendars
ASAP seasonal calendar
FEWSNET livelihood zones (just loading)

All very much preliminary stuff, but I imagine we'll be using the seasonal calendars soon so thought you'd want to get a look at the processing.

Also, branch name is definitely too broad in scope! Sorry

t-downing · 2023-10-20T20:58:09Z

I go through the steps for processing the ASAP phenology in exploration/asap_phenology. The slowest step is process_asap_phenology_dekads(), which has to iterate over longitude due to memory constraints.

I'm also using the outputs (in Data/public/processed/glb/asap/season) for the Sahel regional framework analysis.

caldwellst

Okay, not so many comments here as obviously this overlaps a lot with the Sahel codebase, we aren't going to use the livelihoods zones or other admin stuff for now, and the full processed code will be ported into the global monitoring repository.

I put a few comments in the code, but I think here are the overall comments:

Just general comment sin the file, to explain what's being looked at and also just anything interesting or useful found there. Can help in the future when returning to these and other people or even yourself or trying to parse exactly the reasoning for doing certain things.
In this vein, I think just a simple README.md explaining what was explored here would be good, and mentioning the files that are there and maybe a quick explanation of what was done or what to look at.
For this, I think we should just have top level folders for different datasets being explored. Otherwise, this repo will be way too cluttered. For this, I think each folder should be self-contained and require nothing from outside. Basically a way to put everything in one location but without requiring these modules work together. So it would be asap/src, asap/exploration and the read me would sit under the folder as asap/README.md. Then in the overall README.md can just have a line or two explaining what is in the asap folder with a link to the folder itself (so the asap/README.md would be displayed on the GitHub platform.
In the future, might be better to split out the utilities. Not really mattering here, but for instance in the Sahel repo would make it cleaner to read through and connect which work together and which provide disparate functionality.
No requirements file. Not a big deal, but probably best to provide at least. You could use pipreqs to generate the file for the asap folder, and store under there.

caldwellst · 2023-10-05T08:35:39Z

src/utils.py

+    """
+    filepath = (
+        DATA_DIR
+        / "public/processed/glb/acaps/seasonal-events-calendar_processed.csv"


Likely never an issue, but if you separate all folder and file names as individual strings, they will be fully system independent pathing using pathlib.Path, but currently this still assumes the / path separator.

caldwellst · 2023-10-05T08:43:07Z

src/utils.py

+
+
+def process_drought_codabs():
+    """


For exploration this is fine, but for eventual bringing into global monitoring we won't want to request to countries with AnticiPy config files. If we need admin1 at the country level, much easier to use something like https://fieldmaps.io/data which is already aggregated globally and was based off the original CODAB files so in most cases should match the names.

caldwellst · 2023-10-30T09:29:08Z

exploration/asap.md

+# join with CODAB
+# note - some asap1_ids don't match up,
+# hence will be missing from plot
+cod_crop = cod_asap.merge(


cod_asap is not defined in this notebook.

caldwellst · 2023-10-30T10:25:52Z

exploration/asap_phenology.md

+
+```python
+lon, lat = 0, 0
+da.where(da < 251).sel(x=slice(lon, lon + 40), y=slice(lat + 40, lat)).plot()


I know these are just autoplots, but would be good to hear to have a single color band, which would make it easier to see where the intensity lies. And is it the case that this is the # of dekads in 3 months that are in season? if so, there seems to be a large swath with just 1?

caldwellst · 2023-10-30T10:27:23Z

exploration/asap_phenology.md

+# 253 = hard to tell, barely used
+# 252 = hard to tell
+# 251 = no season (desert but also rainforest?)
+lon, lat = -68, -14


Maybe just a comment on where we are looking at.

caldwellst · 2023-10-30T10:51:11Z

exploration/iri.md

+
+```python
+# plot cumulative distribution of probability (reversed)
+df.hist("prob", cumulative=-1, bins=100, density=1)


Lovely plot.

caldwellst · 2023-10-30T10:51:46Z

exploration/iri.md

+geobb = GeoBoundingBox.from_shape(adm0)
+
+ds_adm0 = ds_f.rio.clip(adm0["geometry"], all_touched=True)
+# resample to 0.01 degrees


Question, what's the point of the resampling here instead of just using the base IRI output? Might be good to make clear in the analysis files. Is this something we want to do in the full implementation?

caldwellst · 2023-10-30T11:36:17Z

src/utils.py

+        da_d = da_d.rio.set_spatial_dims("x", "y", inplace=True)
+        # da_d = da_d.astype("uint8")
+        for dekad in dekads:
+            da_d.loc[{"dekad": dekad}] = (


Not sure if more efficient, but might be able to use the modulo operator I referenced in the other PR to reduce this to just the 2 sections connected by | for season 1 and 2.

caldwellst · 2023-10-30T11:36:58Z

src/utils.py

+
+    # save
+    # File too big to save as 36-band raster, so must save as multiple files
+    # Note that saving as an actual Boolean in a NetCDF is even bigger somehow.


That's so weird haha.

t-downing added 11 commits September 20, 2023 09:45

udpate gitignore

b18330c

add iri download

0a512e0

add country plot

6ae77ae

save tif

b637398

process seasons and codabs

5165acc

update acaps

6f0a134

save as cog

54bf40b

add asap seasonal

dc73652

acaps crops adm1

b06b9dc

fewsnet livelihoods

dfd7d92

add comments

4cf6774

t-downing requested a review from zackarno October 4, 2023 17:46

t-downing added 8 commits October 6, 2023 10:24

fewsnet lz / asap admin intersection

94b022c

fix fewsnet duplicate lz

d828e63

asap seasonal calendar processing

c805489

read iri thresholded

350eaf8

asap phenology processing

2269435

added trimester aggregation

d3969dc

sum dekads over trimester

7b6508a

process with senescence

cc67a58

t-downing requested a review from caldwellst October 20, 2023 20:44

clean up comments

b40b85e

caldwellst mentioned this pull request Oct 30, 2023

IRI .download() error OCHA-DAP/ocha-anticipy#209

Open

caldwellst requested changes Oct 30, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add IRI #3

Add IRI #3

t-downing commented Oct 4, 2023

t-downing commented Oct 20, 2023

caldwellst left a comment

caldwellst Oct 5, 2023

caldwellst Oct 5, 2023

caldwellst Oct 30, 2023

caldwellst Oct 30, 2023

caldwellst Oct 30, 2023

caldwellst Oct 30, 2023

caldwellst Oct 30, 2023

caldwellst Oct 30, 2023

caldwellst Oct 30, 2023

Add IRI #3

Are you sure you want to change the base?

Add IRI #3

Conversation

t-downing commented Oct 4, 2023

t-downing commented Oct 20, 2023

caldwellst left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment