4D (time evolving) depth S-grids by delandmeterp · Pull Request #660 · Parcels-code/Parcels

delandmeterp · 2019-10-02T13:34:44Z

In this PR, we provide a Field.from_netcdf that allows to read vertical grids evolving in time. The depth dimension must be Field.depthfield where 'depthfieldis the name of the field on which the data of the depth are provided, such thatU.grid.depth= depthfield.data`

…native to SWASH and allowing time varying depths in RectilinearSgrid for both defer and full load

for grid dim

erikvansebille · 2019-10-02T16:35:00Z

parcels/field.py

+                    kwargs['depth_field'] = dimensions['depth'][6:]
+                else:
+                    depth = filebuffer.read_depth
+                print('Temporary print rather than error: Time varying depth data cannot be read in netcdf files yet')


Does this temporary print need to be here?

Also, would this require a tutorial to explain to users how to use this new 4D S-grid functionality?

happy to help creating one. I have plenty of (light) simulations that could be cleaned up and used for it.

parcels/field.py

erikvansebille · 2019-10-02T16:38:33Z

parcels/field.py

            return np.zeros(1)

+    @property
+    def read_depth_dim(self):


Confusingly named method? Would this be better read_depth_4d or so? And the other one read_depth_1d?

parcels/field.py

and the depth field is set with `set_depth_from_field`

delandmeterp · 2019-10-11T08:06:38Z

Some small issue:
Since we work with dimension['depth'] = 'not_yet_set', we can't compare dimensions anymore to check if fields are on the same grid. So the quick check performed in the fieldset does not work anymore and dimensions need to be loaded for every field with not_yet_set flag (this wasn't occurring with dimension Field.XXX... So we have some cost here, but we gained a lot in clarity)

cjongedijk · 2020-02-06T20:42:32Z

I seem not authorised to merge the master into the origin/branch but locally I have merged since the update for the memory leak. There is still the same memory problem as discussed in #703 #711 and #668 and even with the master as it was this morning, I still seem to have it (in 1h I ran out of my RAM). It seemed solved and tested for other users so I was wondering @erikvansebille or @CKehl, could you merge the master in this branch as there are no conflicts (yet) and then if we're all on the same page we could hopefully fix where the memory issue still seems to persist.

erikvansebille · 2020-02-07T09:23:06Z

Hi @cjongedijk. I have just merged master into this PR. To be honest, I'm not really sure what the status of this PR is; it was mostly a discussion between you and @delandmeterp, right?

We have done (and are still doing) quite some optimisations of memory usage, so it may be that this updated version fixes your RAM issues. See also http://oceanparcels.org/faq.html#performance

Let us know how it goes!

CKehl · 2020-02-07T09:23:47Z

Ho - so, I also encounter still some problems with long-running simulations, but not due to the memory but because there are file access conflicts, which I intent to resolve soon. Can you confirm (via the procedure in http://oceanparcels.org/faq.html#performance) that your error actually comes from memory overflow ?

CKehl · 2020-02-07T09:25:10Z

Also: I am right now encountering some issues with 4D data myself when compiling the MPI documentation notebook for 4D simulations, so we're working very much on the same issue, I believe.

cjongedijk · 2020-02-07T14:23:07Z

@erikvansebille thanks for the merge. @CKehl are you using this branch to simulate with 4D data? I haven't ran into any problems yet but @delandmeterp and me worked on it indeed but nothing has changd since I have not had any issues and could not really do anything or check any performance differences to the 3D fields with memory running out so fast on my longer sims. I should have flagged earlier that I cannot merge myself but I have locally been merging all the time and since I was understanding being the only user of this branch didnt bother pushing back to the repository. I use 4D fields for all of my simulations, however only the ones where the fields are large I run into memory problems, not for ones that have small fields and run for a longer time. I have monitored my memory over two simulations I did this week, one that has run (and is still running) with small fields and one that ran out of memory in one hour with large fields (similar pset sizes). I will post here in a minute how it looks and we can continue from there.

cjongedijk · 2020-02-07T14:37:33Z

so this is how it looks like for me. I randomly exported the memory during the last few days sorry for the intermittent signal compared to the graphs @CKehl showed in the other threads , working on an automated memory tracker for python processes only but haven't done that yet. approximately 2 days in my slowly aggregating first (light) simulation, I had started one that gives me lots of troubles. I swapped after about 1 hour and then I killed the simulation. I had the long running one on purpose in the background because I saw there was a memory drop earlier whilst simultaneously running sims. This seems to indeed the case around 48 h into the sim where I kill the trouble simulation and the other one continues on a lower memory load.

cjongedijk · 2020-02-07T14:39:47Z

with 'one field size' I mean the size of the netcdf file containing all variables/field information for one time step.

cjongedijk · 2020-02-10T19:35:31Z

updated: after the weekend my 'light' simulation linearly increased the memory uptake as expected, and with netdata I tracked yesterday where my local system was only doing the parcels run with similar memory behaviour as in #703 https://github.com/OceanParcels/parcels/issues/668#issuecomment-576667920 tomorrow morning I'll have to kill it to free up space for other projects.

Fig 1:

Fig 2: netdata output of my total RAM used for 24h.

Also, following the conversation in #740 today is that testing for _fromnemo specifically or also generally for _fromNetCDF? It seems like that data format of NEMO (sigma grids 3d+time, C grid) generally is very similar to mine.

CKehl · 2020-02-12T10:23:22Z

Hi Cleo - sorry to come back to you that late. Indeed this memory issue was keeping me up now the last 2 weeks, cause when I fixed it initially, I ran all tests just with 2D+time data. In 3D+time, I ran into massive issues because I wasn't converting the field_chunksize things properly for those cases - they are more 'involved'. Also, there was a separate issue popping up very often when running Erik's Galapagos case - conflicting file access, because with the lazy loading now actually implemented properly, the locking mechanism needs to be managed. As you already saw in #740 , those issue should be resolved now. I tested #740 by running the full galapagos case (the backward simulation) as it is in the repository (https://github.com/OceanParcels/GalapagosBasinPlastic) successfully on the cluster in a job system without crashes, using auto-chunking. Thus, if we can rebase this branch (update it to the current master), then it would be worth testing again. I am also running the python notebook (https://nbviewer.jupyter.org/github/OceanParcels/parcels/blob/master/parcels/examples/documentation_MPI.ipynb) now on a cluster with the 3D data, regenerating the images. It would be very kind if you were running you simulation with the updates again. Conversly, I'll now browse through your changes in this branch, make a review, and get some inspiration from your changes were I may still need to adapt things.

CKehl · 2020-02-12T10:32:00Z

BTW: if you wanna see how to define field_chunksize for 3D fields, have a look on #731 , the file parcels/examples/documentation_MPI.ipynb. There, in the second part of the file, you have the function set_cmems_fieldset_3D(cs), where cs=field_chunksize. That is for how to do that for general Field[set].from_netcdf() access - for NEMO and the others, that step is done automatically (though only tested in detail with NEMO, via the Galapagos case).

CKehl

Looks good so far from the functionality - also didn't see anything that could conflict with other functionalities. Still, tests from @cjongedijk will be very important. That is because, if you work with a high-resolution depth column so that you actually prefer to also chunk the depth, then all the computattions asking for field.grid.depth[0] and field.grid.depth[-1] may either cause conflicts (but that's just a hypothesis).

parcels/field.py

Rename `depth_index` to `depth_indices` to highlight that it can be a vector

…aused the tests to fail.

…le data repository. Also requires a fix cause the import currently fails when trying to determine the minimal depth from the initial-empty grid(set).

… stuck with the varying-grid/field issue from #782.

…necessary check-size console prints

erikvansebille · 2020-04-08T09:05:10Z

parcels/examples/example_dask_chunk_OCMs.py

+        dask.config.set({'array.chunk-size': '128MiB'})
+    field_set = fieldset_from_swash(chunk_mode)
+    npart = 20
+    # lonp = [i for i in np.arange(start=9.1, stop=11.3, step=0.1)[0:20]]


Stray comment?

erikvansebille · 2020-04-08T09:05:32Z

parcels/examples/example_dask_chunk_OCMs.py

+    npart = 20
+    # lonp = [i for i in np.arange(start=9.1, stop=11.3, step=0.1)[0:20]]
+    lonp = [i for i in 9.5 + (-0.2 + np.random.rand(npart) * 2.0 * 0.2)]
+    # latp = [i for i in 12.7+(-0.25+np.random.rand(npart)*2.0*0.25)]


another stray debug comment?

erikvansebille · 2020-04-08T09:10:25Z

parcels/examples/example_dask_chunk_OCMs.py

+    if chunk_mode == 'auto':
+        chs = 'auto'
+    elif chunk_mode == 'specific':
+        # chs = {'x': 4, 'j': 4, 'z': 7, 'z_u': 6, 't': 1}


stray debug comment?

CKehl · 2020-04-08T10:11:34Z

parcels/examples/example_dask_chunk_OCMs.py


+def fieldset_from_swash(chunk_mode):
+    filenames = path.join(path.join(path.dirname(__file__), 'SWASH_data'), 'field_*.nc')
+    # filenames = []


to-be-removed comment

cej4917 and others added 9 commits September 19, 2019 14:45

updated from_netcdf for allowing depth downwards negative as that is …

5233072

…native to SWASH and allowing time varying depths in RectilinearSgrid for both defer and full load

Merge branch 'master' into 4d_depth_sgrids

56ffe46

cleaner index search for negative depths scipy

bf806a5

cleaner index search for negative depths JIT

634a3f6

read_depth returns empty array for 4d s-grids

ebf638e

non defered_load data do not load grid in computeTimeChunk

cccd212

flake cleaning

fe851a2

Merge branch 'master' into 4d_depth_sgrids

451b325

automated way to check for field dependency

9a6007d

for grid dim

delandmeterp requested a review from erikvansebille October 2, 2019 13:34

erikvansebille reviewed Oct 3, 2019

View reviewed changes

delandmeterp added 2 commits October 10, 2019 12:06

for depth in field, the dimensions is set to not_yet_set

49be845

and the depth field is set with `set_depth_from_field`

removing debug print

e513b48

Merge branch 'master' into 4d_depth_sgrids

c4eb6b8

Merge branch 'master' into 4d_depth_sgrids

62e8ef9

CKehl approved these changes Feb 12, 2020

View reviewed changes

parcels/field.py Outdated Show resolved Hide resolved

erikvansebille added 3 commits February 12, 2020 15:58

Implementing suggestion by@ckehl

dec323c

Rename `depth_index` to `depth_indices` to highlight that it can be a vector

Merge branch 'master' into 4d_depth_sgrids

a7f65dc

Merge branch 'master' into 4d_depth_sgrids

dea807b

erikvansebille and others added 11 commits March 19, 2020 13:41

Merge branch 'master' into 4d_depth_sgrids

c4f5588

Merge branch 'master' into 4d_depth_sgrids

a58e724

Merge branch 'master' into 4d_depth_sgrids

1ae24e8

Merge branch 'master' into 4d_depth_sgrids

21b5435

#660 - prepared a chunksize test for SWASH file import

8ce2666

#660 - fixed error from incorrect master-merge in fieldset.py, that c…

674f306

…aused the tests to fail.

#660 - enabled the NetcdfFileBuffer to auto-parse SWASH nc dimensions

363051c

#660 - removed unnecessary comments.

bd04471

#660 - finished test formulation. Requires update on the online examp…

03d5b7a

…le data repository. Also requires a fix cause the import currently fails when trying to determine the minimal depth from the initial-empty grid(set).

#660 - adapted the get_examples script to include SWASH data

b1e0968

#660 - fixed correct particle initialisation due to Cleo's input. Now…

b17a098

… stuck with the varying-grid/field issue from #782.

erikvansebille mentioned this pull request Apr 7, 2020

Fluctuating surface elevation #788

Closed

CKehl and others added 3 commits April 7, 2020 15:01

Merge branch 'master' into 4d_depth_sgrids

6885e9b

#660 - fixed some last chunking issues for the swash test; removed un…

88905da

…necessary check-size console prints

#660 - fixed PEP8 linter error

049d96e

erikvansebille reviewed Apr 8, 2020

View reviewed changes

Adding tutorial on time-varying depth dimensions

0b220b1

CKehl reviewed Apr 8, 2020

View reviewed changes

#660 - removed stray comments from the review feedback

729fb59

erikvansebille merged commit d9184d4 into master Apr 8, 2020

erikvansebille deleted the 4d_depth_sgrids branch June 23, 2023 12:47

Conversation

delandmeterp commented Oct 2, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

delandmeterp commented Oct 11, 2019

Uh oh!

cjongedijk commented Feb 6, 2020

Uh oh!

erikvansebille commented Feb 7, 2020

Uh oh!

CKehl commented Feb 7, 2020

Uh oh!

CKehl commented Feb 7, 2020

Uh oh!

cjongedijk commented Feb 7, 2020

Uh oh!

cjongedijk commented Feb 7, 2020

Uh oh!

cjongedijk commented Feb 7, 2020

Uh oh!

cjongedijk commented Feb 10, 2020

Uh oh!

CKehl commented Feb 12, 2020

Uh oh!

CKehl commented Feb 12, 2020

Uh oh!

CKehl left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants