One step zarr fill values #223

brianhenn · 2020-04-07T22:58:15Z

This accomplishes two things: 1) preventing true model 0s from being cast to NaNs in the one-step big zarr output, and 2) initializing big zarr arrays with NaNs via full so that if they are not filled in due to a failed timestep or other reason, it is more apparent than using empty which produces arbitrary output.

AnnaKwa · 2020-04-09T15:39:31Z

workflows/one_step_jobs/runfile.py

@@ -193,7 +193,9 @@ def _convert_time_delta_to_float_seconds(a):


 def _merge_monitor_data(paths: Mapping[str, str]) -> xr.Dataset:
-    datasets = {key: xr.open_zarr(val) for key, val in paths.items()}
+    datasets = {
+        key: xr.open_zarr(val, mask_and_scale=False) for key, val in paths.items()


What does the mask and scale argument do?

See the xarray docs: http://xarray.pydata.org/en/stable/generated/xarray.open_zarr.html it allows for turning on and off automatic masking/rescaling based on dataset attributes

AnnaKwa

Thanks for fixing this!

* Feature/one step save baseline (#193) This adds several features to the one-step pipeline - big zarr. Everything is stored as one zarr file - saves physics outputs - some refactoring of the job submission. Sample output: https://gist.github.com/nbren12/84536018dafef01ba5eac0354869fb67 * save lat/lon grid variables from sfc_dt_atmos (#204) * save lat/lon grid variables from sfc_dt_atmos * Feature/use onestep zarr train data (#207) Use the big zarr from the one step workflow as input to the create training data pipeline * One-step sfc variables time alignment (#214) This makes the diagnostics variables appended to the big zarr have the appropriate step and forecast_time dimensions, just as the variables extracted by the wrapper do. * One step zarr fill values (#223) This accomplishes two things: 1) preventing true model 0s from being cast to NaNs in the one-step big zarr output, and 2) initializing big zarr arrays with NaNs via full so that if they are not filled in due to a failed timestep or other reason, it is more apparent than using empty which produces arbitrary output. * adjustments to be able to run workflows in dev branch (#218) Remove references to hard coded dims and data variables or imports from vcm.cubedsphere.constants, replace with arguments. Can provide coords and dims as args for mappable var * One steps start index (#231) Allows for starting the one-step jobs at the specified index in the timestep list to allow for testing/avoiding spinup timesteps * Dev fix/integration tests (#234) * change order of required args so output is last * fix arg for onestep input to be dir containing big zarr * update end to end integration test ymls * prognostic run adjustments * Improved fv3 logging (#225) This PR introduces several improvements to the logging capability of our prognostic run image - include upstream changes to disable output capturing in `fv3config.fv3run` - Add `capture_fv3gfs_func` function. When called this capture the raw fv3gfs outputs and re-emit it as DEBUG level logging statements that can more easily be filtered. - Refactor `runtime` to `external/runtime/runtime`. This was easy since it did not depend on any other module in fv3net. (except implicitly the code in `fv3net.regression` which is imported when loading the sklearn model with pickle). - updates fv3config to master * manually merge in the refactor from master while keeping new names from develop (#237) * lint * remove logging from testing * Dev fix/arg order (#238) * update history * fix positional args * fix function args * update history * linting Co-authored-by: Anna Kwa <annak@vulcan.com> Co-authored-by: brianhenn <brianhenn@gmail.com>

brianhenn added 3 commits April 7, 2020 21:00

switched array creation to full

91e7153

set runfile postprocessing to open zarr without masking

9b6ff4c

linting

76f88be

AnnaKwa mentioned this pull request Apr 8, 2020

adjustments to be able to run workflows in dev branch #218

Merged

AnnaKwa reviewed Apr 9, 2020

View reviewed changes

AnnaKwa approved these changes Apr 9, 2020

View reviewed changes

brianhenn merged commit 522f695 into develop-one-steps Apr 9, 2020

brianhenn deleted the feature/one-step-zarr-fill-value branch April 9, 2020 15:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

One step zarr fill values #223

One step zarr fill values #223

brianhenn commented Apr 7, 2020 •

edited

Loading

AnnaKwa Apr 9, 2020

brianhenn Apr 9, 2020

AnnaKwa left a comment

One step zarr fill values #223

One step zarr fill values #223

Conversation

brianhenn commented Apr 7, 2020 • edited Loading

AnnaKwa Apr 9, 2020

Choose a reason for hiding this comment

brianhenn Apr 9, 2020

Choose a reason for hiding this comment

AnnaKwa left a comment

Choose a reason for hiding this comment

brianhenn commented Apr 7, 2020 •

edited

Loading