baroclinic initialization #31

rheacangeo · 2021-11-23T07:28:27Z

Purpose

To add the computation of the baroclinic perturbation test case (Jablonowski & Williamson JRMS 2006) to enable running performance scripts without reading serialized data.

This PR:

adds validating code (against data generated with the feature/serialize_init branch of fv3gfs-fortran, which will in parallel be PRed into master as data version 7.2.7) that creates the baroclinic initial state in fv3core/fv3core/initialization
Two files distinguish the test case -- baroclinic_jablonowski_williamson.py has the base equations found in the paper and DCMIP2016 operating on numpy arrays and baroclinic.py, which calls the functions from the other file, and does the cubed sphere transformation and data shaping sizing to get it in the format we can use in the Dycore.
Adds code to test the initial state values when run with baroclinic namelists (not currently blocking PRs, but does or will when it works, get tested with the cache generating cron plan).
adds a DycoreState specification using a dataclass and replaces the ArgSpec and get_namespace functionality of fv3core/fv3ocre/stencils/fv_dynamics.py
Updated the fv_dynamics test to use the DycoreState
updates the dynamics runfile for performance testing to create a DycoreState using the baroclinic initialization and replaced reading the serialized data as inputs.

…d_data.ks

…tive to the fortran code

…e into feature/use_new_grid_in_dycore

…hrough the grid_data object instead. A future PR may orgnaize these to be computed at a different stage

… launching simulations that are not baroclinic

…in_dycore

… on the MetricTerms class. To avoid a circular import with MirrorGrid, changed RIGHT_HAND_GRID to be an argument in the context of mirroring, set to to a constant in out calls of it

…k, but are still wrong on a few points

…ther objects

…ridData because the match to the variables used is better and it simplifies the code.

…rom fv_dynamics

…pace into feature/baroclinic-initialization

fv3core/fv3core/stencils/fv_dynamics.py

fv3core/examples/standalone/runfile/dynamics.py

mcgibbon · 2021-11-23T20:32:56Z

fv3core/fv3core/initialization/baroclinic.py

+    lat2, lat3, lat4, lat5 = compute_grid_edge_midpoint_latitude_components(lon, lat)
+    slice_3d = (slice(0, nx), slice(0, ny), slice(None))
+    slice_2d = (slice(0, nx), slice(0, ny))
+    # initialize temperature
+    t_mean = jablo_init.horizontally_averaged_temperature(eta)
+    pt[slice_3d] = cell_average_nine_components(
+        jablo_init.temperature,
+        [eta, eta_v, t_mean],
+        lat,
+        lat_agrid,
+        lat2,
+        lat3,
+        lat4,
+        lat5,
+        slice_2d,
+    )
+
+    # initialize surface geopotential
+    phis[slice_2d] = cell_average_nine_components(
+        jablo_init.surface_geopotential_perturbation,
+        [],
+        lat,
+        lat_agrid,
+        lat2,
+        lat3,
+        lat4,
+        lat5,
+        slice_2d,
+    )


cell_average_nine_components and compute_grid_edge_midpoint_latitude_components are doing something low level and pretty complex - I can't tell what they are doing. compute_grid_edge_midpoint_latitude_components is also only used for this second function.

Is there any change you can make to have it be more clear what's happening at this function's level, by pushing the low-level logic down one call level? For example, even if it's less efficient, by pushing the call to compute_grid_edge_midpoint_latitude_components down one level (removing many low-level call arguments from this function's level), and by pulling the slicing operation (which is a higher level operation you're already using here in the assignment slice) up to this level (e.g. by passing lat[slice_2d] and lat_agrid[slice_2d])?

sure thing. this duplicates the calculation of these latitude factors, but does make it more understandable. I'll implement the suggestion. Pulling the slicing up is a little tricker, as the resulting shapes of the 9 points being averaged are not by default the same, as they are shifted in different directions. I can do this if you prefer, but want to double check before I iterate on it.

mcgibbon · 2021-11-23T20:37:14Z

fv3core/fv3core/initialization/baroclinic.py

+    assert not adjust_dry_mass
+    assert not hydrostatic


Does it make sense for us to just not implement these as keyword arguments? I don't think we have any plans to implement these, the code itself is easier to understand than the rest of the dycore in terms of if we were to implement a new option, and if we did implement a new option we'd probably do it using new functions rather than if flags?

ok sure. The fortran code has a lot of if conditions for different test cases, so it could be potentially confusing to understand and implement the other options. But I am not worried about this as much as I was for the dycore, since this is just an idealized initial state.

I'm also already not following this p_var subroutine to the letter, since it was recomputing pressure variables in the exact same way they were already computed.

do you mean just for p_var, or for the whole module?

mcgibbon · 2021-11-23T20:38:30Z

fv3core/fv3core/initialization/baroclinic.py

+    # TODO: when the dycore state is updated to only include
+    # quantities and no storages, remove the "_quantity" from phis, u and v


You could remove this TODO here, when I go to do it this will be using vscode's refactor functionality so it will update everywhere at once (thanks to your use of a dataclass!). As it stands, this comment would probably remain after the refactor unless a reviewer catches it or I've thought long enough about this particular comment location while writing this message.

Suggested change

# TODO: when the dycore state is updated to only include

# quantities and no storages, remove the "_quantity" from phis, u and v

mcgibbon · 2021-11-23T20:50:29Z

fv3core/fv3core/initialization/dycore_state.py

+    def init_from_serialized_data(cls, serializer, grid, quantity_factory):
+        savepoint_in = serializer.get_savepoint("FVDynamics-In")[0]
+        translate_object = fv3core.testing.TranslateFVDynamics([grid])
+        input_data = translate_object.collect_input_data(serializer, savepoint_in)
+        # making just storages for the moment, revisit when making them all
+        # quantities (maybe use state_from_inputs)
+        translate_object._base.make_storage_data_input_vars(input_data)
+        # used for the translate test as inputs, but are generated by the
+        # MetricsTerms class and are not part of this data class
+        for delvar in ["ak", "bk", "ptop", "ks"]:
+            del input_data[delvar]
+        return cls(**input_data, quantity_factory=quantity_factory)


This method should only be getting used by the tests, and currently fv3core doesn't depend on serialbox other than for testing. Can this get moved to the testing code?

It can. I was thinking it might be useful for performance testing if we end up wanting to test non-baroclinic namelists before we implement reading initial state from disk. But someone wanting to do that can probably figure it out.

fv3core/fv3core/initialization/dycore_state.py

mcgibbon · 2021-11-23T20:54:33Z

fv3core/fv3core/testing/translate.py

@@ -293,6 +296,12 @@ def make_grid_storage(self, pygrid):
            if k in self.data:
                self.make_composite_var_storage(k, self.data[k], shape)
                del self.data[k]
+        for k in TranslateGrid.ee_vars:


nit: This was pre-existing, but can you replace k here and below with key? When I read these my mind doesn't stop telling me it's indexing on the vertical direction.

mcgibbon · 2021-11-23T20:54:52Z

fv3core/fv3core/testing/translate_fvdynamics.py

@@ -282,15 +284,27 @@ def compute_parallel(self, inputs, communicator):
            grid_data.ptop = inputs["ptop"]
            grid_data.ks = inputs["ks"]

-        state = self.state_from_inputs(inputs)
+        input_storages = self.state_from_inputs(inputs)
+        # making sure we init DyCOreState with the exact set of variables


Suggested change

# making sure we init DyCOreState with the exact set of variables

# making sure we init DyCoreState with the exact set of variables

fv3gfs-util/fv3gfs/util/quantity.py

…king one. we will revisit in a future PR to pull this up to the driver

… dycore state

rheacangeo

Thanks for the quick turnaround on feedback!

rheacangeo · 2021-11-23T21:33:55Z

fv3core/fv3core/initialization/baroclinic.py

+    lat2, lat3, lat4, lat5 = compute_grid_edge_midpoint_latitude_components(lon, lat)
+    slice_3d = (slice(0, nx), slice(0, ny), slice(None))
+    slice_2d = (slice(0, nx), slice(0, ny))
+    # initialize temperature
+    t_mean = jablo_init.horizontally_averaged_temperature(eta)
+    pt[slice_3d] = cell_average_nine_components(
+        jablo_init.temperature,
+        [eta, eta_v, t_mean],
+        lat,
+        lat_agrid,
+        lat2,
+        lat3,
+        lat4,
+        lat5,
+        slice_2d,
+    )
+
+    # initialize surface geopotential
+    phis[slice_2d] = cell_average_nine_components(
+        jablo_init.surface_geopotential_perturbation,
+        [],
+        lat,
+        lat_agrid,
+        lat2,
+        lat3,
+        lat4,
+        lat5,
+        slice_2d,
+    )


sure thing. this duplicates the calculation of these latitude factors, but does make it more understandable. I'll implement the suggestion. Pulling the slicing up is a little tricker, as the resulting shapes of the 9 points being averaged are not by default the same, as they are shifted in different directions. I can do this if you prefer, but want to double check before I iterate on it.

rheacangeo · 2021-11-23T21:50:40Z

fv3core/fv3core/initialization/baroclinic.py

+    assert not adjust_dry_mass
+    assert not hydrostatic


ok sure. The fortran code has a lot of if conditions for different test cases, so it could be potentially confusing to understand and implement the other options. But I am not worried about this as much as I was for the dycore, since this is just an idealized initial state.

rheacangeo · 2021-11-23T21:52:43Z

fv3core/fv3core/initialization/baroclinic.py

+    assert not adjust_dry_mass
+    assert not hydrostatic


I'm also already not following this p_var subroutine to the letter, since it was recomputing pressure variables in the exact same way they were already computed.

rheacangeo · 2021-11-23T21:54:00Z

fv3core/fv3core/initialization/baroclinic.py

+    assert not adjust_dry_mass
+    assert not hydrostatic


do you mean just for p_var, or for the whole module?

rheacangeo · 2021-11-23T21:58:08Z

fv3core/fv3core/initialization/baroclinic.py

+    islice, jslice, slice_3d, slice_2d = compute_slices(nx, ny)
+    # Slices with extra buffer points in the horizontal dimension
+    # to accomodate averaging over shifted calculations on the grid
+    isliceb, jsliceb, slice_3db, slice_2db = compute_slices(nx + 1, ny + 1)


sure thing, good point.

rheacangeo · 2021-11-23T22:01:44Z

fv3core/fv3core/initialization/baroclinic_jablonowski_williamson.py

+eta_s = 1.0  # surface level
+eta_t = 0.2  # tropopause


rheacangeo · 2021-11-23T22:02:38Z

fv3core/fv3core/initialization/baroclinic_jablonowski_williamson.py

+# maximum windspeed amplitude - close to windspeed of zonal-mean time-mean
+# jet stream in troposphere
+u0 = 35.0  # From Table VI of DCMIP2016
+# [lon, lat] of zonal wind perturbation centerpoint at 20E, 40N
+pcen = [math.pi / 9.0, 2.0 * math.pi / 9.0]  # From Table VI of DCMIP2016
+u1 = 1.0
+pt0 = 0.0
+eta_0 = 0.252
+eta_s = 1.0  # surface level
+eta_t = 0.2  # tropopause
+t_0 = 288.0
+delta_t = 480000.0
+lapse_rate = 0.005  # From Table VI of DCMIP2016
+surface_pressure = 1.0e5  # units of (Pa), from Table VI of DCMIP2016
+# NOTE RADIUS = 6.3712e6 in FV3 vs Jabowski paper 6.371229e6
+R = constants.RADIUS / 10.0  # Perturbation radiusfor test case 13


thanks! paid close attention to the paper on this one, which thankfully matches pretty well to what the fortran code does!

rheacangeo · 2021-11-23T22:03:46Z

fv3core/fv3core/initialization/dycore_state.py

+    def init_from_serialized_data(cls, serializer, grid, quantity_factory):
+        savepoint_in = serializer.get_savepoint("FVDynamics-In")[0]
+        translate_object = fv3core.testing.TranslateFVDynamics([grid])
+        input_data = translate_object.collect_input_data(serializer, savepoint_in)
+        # making just storages for the moment, revisit when making them all
+        # quantities (maybe use state_from_inputs)
+        translate_object._base.make_storage_data_input_vars(input_data)
+        # used for the translate test as inputs, but are generated by the
+        # MetricsTerms class and are not part of this data class
+        for delvar in ["ak", "bk", "ptop", "ks"]:
+            del input_data[delvar]
+        return cls(**input_data, quantity_factory=quantity_factory)


It can. I was thinking it might be useful for performance testing if we end up wanting to test non-baroclinic namelists before we implement reading initial state from disk. But someone wanting to do that can probably figure it out.

fv3core/fv3core/initialization/dycore_state.py

rheacangeo · 2021-11-23T22:06:22Z

fv3core/fv3core/testing/translate.py

@@ -293,6 +296,12 @@ def make_grid_storage(self, pygrid):
            if k in self.data:
                self.make_composite_var_storage(k, self.data[k], shape)
                del self.data[k]
+        for k in TranslateGrid.ee_vars:


mcgibbon · 2021-11-23T22:35:51Z

fv3core/fv3core/initialization/baroclinic.py

+    lon,
+    lat,
+    lat_agrid,
+    grid_slice,


The type of this argument is non-intuitive and would be particularly helpful to type hint.

Suggested change

grid_slice,

grid_slice: Tuple[slice, slice],

If you do still want to pull grid_slice up one level, the way you would do it is by passing in lon and lat sliced on their own compute domains - that is, up to nx+1 and ny+1 since they're defined on cell corners. Then below, every access to lat or lon should be clipping off 1 point. pt6 becomes lat[:-1, :-1] while pt9 becomes lat[:-1, 1:].

fv3core/fv3core/initialization/baroclinic.py

Co-authored-by: Jeremy McGibbon <jeremym@allenai.org>

…slice as needed, slicing at a level up

mcgibbon

Looks great! Looking forward to making use of this.

rheacangeo and others added 30 commits November 16, 2021 00:32

changes moved over from the feature/use_grid branch of fv3core

594b934

replace state.ptop and state.ks with self.grid_data.ptop and self.gri…

abf235b

…d_data.ks

update fortran changelof to reflect naming changes in the dycore rela…

ae1e500

…tive to the fortran code

linting bites again

693b10e

Merge branch 'main' into feature/use_new_grid_in_dycore

8341c1c

baroclinic initialization moved over from fv3core

05d3488

use lon/lat

5d94665

specific humidity moved to jablonowski file

e4bebdd

test all the init stte variables

0c2cc6f

some renaming

a5f9315

adjusting slicing

f0fb0b5

more slice adjustments

42ed9a1

pulling up slicing

27ad293

improve slicing of jablo method

68447d8

fixing merge with master issues

5ac36ba

Merge branch 'feature/use_new_grid_in_dycore' of github.com:ai2cm/pac…

3d78614

…e into feature/use_new_grid_in_dycore

reverting computing edge_factors as A2B class methods, passing them t…

3414fb2

…hrough the grid_data object instead. A future PR may orgnaize these to be computed at a different stage

adding comments to other local grid variables

208619f

keeping the acoustics read grid method, which could come in handy for…

881dac6

… launching simulations that are not baroclinic

remove redendant comments

8db3f36

removing the np proprty from StencilConfig

5f8309c

adding docstrings for the coriolis parameters

fb706ce

fixes in response to review comments

2eacf63

Merge remote-tracking branch 'origin/main' into feature/use_new_grid_…

21ef672

…in_dycore

moving grid constants out of global_constant and adding as attributes…

5c6910d

… on the MetricTerms class. To avoid a circular import with MirrorGrid, changed RIGHT_HAND_GRID to be an argument in the context of mirroring, set to to a constant in out calls of it

Merge branch 'main' into feature/use_new_grid_in_dycore

d7f011a

fix bug in error message

7d8847e

use quantities for baroclinic initialization. halo updates almost wor…

d9d0ad4

…k, but are still wrong on a few points

some reorg

18b1fa7

linting

3040618

rheacangeo and others added 13 commits November 21, 2021 20:36

merged updates to grid pr

a6c0e43

move initialization into its own directory

231468a

make the quantity factory the MetricTerms uses available for use by o…

096ce5f

…ther objects

use an instance of MetricTerms for state initialization rather than G…

5878aef

…ridData because the match to the variables used is better and it simplifies the code.

update the performance script to use state initialization

3e89e69

merged in main

3adf1d1

add back TRACER_DIM

bcde5cd

adding DycoreState

61d9b16

linting

beade16

adding initialization init file

7d4f5f9

the test for FVDynamics uses the DycoreState and ArgSpec is removed f…

d037da6

…rom fv_dynamics

Merge branch 'feature/baroclinic-initialization' of github.com:ai2cm/…

fce5d77

…pace into feature/baroclinic-initialization

linting the merge

ad378d7

rheacangeo requested a review from mcgibbon November 23, 2021 07:28

mcgibbon reviewed Nov 23, 2021

View reviewed changes

rheacangeo added 2 commits November 23, 2021 13:24

for this PR, have Physics call be passed a DycoreState rather than ma…

e481b29

…king one. we will revisit in a future PR to pull this up to the driver

changes in response to review comments and fixing the Physics call to…

2dd86ca

… dycore state

rheacangeo commented Nov 23, 2021

View reviewed changes

mcgibbon reviewed Nov 23, 2021

View reviewed changes

rheacangeo and others added 2 commits November 23, 2021 14:42

Update fv3core/fv3core/initialization/baroclinic.py

674f281

Co-authored-by: Jeremy McGibbon <jeremym@allenai.org>

pull the slicing of every lat variable in the cell average, and only …

7007ca0

…slice as needed, slicing at a level up

mcgibbon approved these changes Nov 23, 2021

View reviewed changes

Merge branch 'main' into feature/baroclinic-initialization

273549f

rheacangeo enabled auto-merge (squash) November 23, 2021 23:01

rheacangeo merged commit 8021a8e into main Nov 24, 2021

rheacangeo deleted the feature/baroclinic-initialization branch November 24, 2021 00:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

baroclinic initialization #31

baroclinic initialization #31

rheacangeo commented Nov 23, 2021

mcgibbon Nov 23, 2021

rheacangeo Nov 23, 2021

mcgibbon Nov 23, 2021

rheacangeo Nov 23, 2021

rheacangeo Nov 23, 2021

rheacangeo Nov 23, 2021

mcgibbon Nov 23, 2021

rheacangeo Nov 23, 2021

mcgibbon Nov 23, 2021

rheacangeo Nov 23, 2021

mcgibbon Nov 23, 2021

rheacangeo Nov 23, 2021

mcgibbon Nov 23, 2021

rheacangeo left a comment

rheacangeo Nov 23, 2021

rheacangeo Nov 23, 2021

rheacangeo Nov 23, 2021

rheacangeo Nov 23, 2021

rheacangeo Nov 23, 2021

rheacangeo Nov 23, 2021

rheacangeo Nov 23, 2021

rheacangeo Nov 23, 2021

rheacangeo Nov 23, 2021

mcgibbon Nov 23, 2021

mcgibbon Nov 23, 2021

mcgibbon left a comment

		# TODO: when the dycore state is updated to only include
		# quantities and no storages, remove the "_quantity" from phis, u and v

	# making sure we init DyCOreState with the exact set of variables
	# making sure we init DyCoreState with the exact set of variables

baroclinic initialization #31

baroclinic initialization #31

Conversation

rheacangeo commented Nov 23, 2021

Purpose

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rheacangeo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mcgibbon left a comment

Choose a reason for hiding this comment