Update to have the 'p' array written to the same file as 'q' and 'aux' #509

weslowrie · 2015-03-05T16:19:40Z

I was playing around with the option to have derived variables written out as described here: http://www.clawpack.org/pyclaw/classes.html#outputting-derived-quantities and noticed that it has two separate calls to the write function, which is different to how the 'aux' array is treated. I decided to combine it all into one call, and make it such that when using the 'compute_p' option, it would add a new dataset to the hdf5 file, or in the ascii case create a file with the name 'fort.pxxxx' to make it consistent with writing out the aux array.

This seems cleaner, but maybe there are some other factors I missed?

I didn't look into 'binary.py' or 'netcdf.py' or into the 'petclaw' directory. Also at first look, it does not look like the ability to read 'p' variables would be necessary?

ketch · 2015-03-06T02:46:41Z

This is part of a number of IO issues that we have never fully sorted out.

I think the current thinking is that writing q and optionally aux is what you need in order to restart the simulation from that point. Thus these files form a checkpoint. The p values, on the other hand, are purely for post-processing purposes. Thus one might want to output p much more frequently than q, or even turn off q altogether. That is not possible at present but would be easy to add under the current code. It has been important in the past to have p in a separate file also because of file sizes and the cost of transferring huge files from a supercomputer or cluster to another file system.

So I appreciate the cleanliness of the code (very much!), but I think this change runs contrary to what we want, long-term.

weslowrie · 2015-03-09T14:35:00Z

@ketch, thanks for the feedback, and that makes sense. I actually frequently have the issue of transferring these sometimes huge files over a network.

With that said, having a single call to the output routine with flags turning on/off output might still be useful? But for the output at different times/frequencies, it might make sense to keep the calls separate?

ketch · 2015-03-10T13:45:56Z

With that said, having a single call to the output routine with flags turning on/off output might still be useful? But for the output at different times/frequencies, it might make sense to keep the calls separate?

I'm not sure I understand -- seting controller.output_format = None does turn off all output.

This part of the code could definitely use some love. If you want to propose your vision for how it should work, that would be wonderful.

weslowrie · 2015-03-11T12:36:11Z

I'm not sure I understand -- seting controller.output_format = None does turn off all output.

I was thinking more along the lines of turning on/off each component of the output (q, aux, and p) even if they are going to be written to different files. The code currently has this capability, but makes two calls to the output routine.

This part of the code could definitely use some love. If you want to propose your vision for how it should work, that would be wonderful.

I'll think some more about possible ways to modify this part of the code.

ahmadia · 2015-03-11T15:06:20Z

I was thinking more along the lines of turning on/off each component of the output (q, aux, and p) even if they are going to be written to different files. The code currently has this capability, but makes two calls to the output routine.

This makes sense to me. @weslowrie - will you be at SIAM this week?

weslowrie · 2015-03-12T00:54:33Z

@ahmadia I was really hoping to make it to SIAM this week, but I couldn't make it work. Maybe the next similar conference.

mandli · 2015-10-24T14:24:13Z

What's the status on this PR?

weslowrie · 2015-10-26T16:49:13Z

I have abandoned this idea, as it probably does not make sense to combine the output files for primary variables and the derived variables. Maybe these routines can still be improved/optimized?

ketch · 2015-10-28T12:05:24Z

I agree -- we'd like to move to a model where we have checkpoint files (infrequent) and separately output files with whatever is needed for post-processing (more frequent). General improvements to this part of the code would certainly be welcome.

Based on Wes' comments, I'm going to close this PR for now.

Update to have the 'p' array written to the same file as 'q' and 'aux'

2316a22

ketch closed this Oct 28, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update to have the 'p' array written to the same file as 'q' and 'aux' #509

Update to have the 'p' array written to the same file as 'q' and 'aux' #509

weslowrie commented Mar 5, 2015

ketch commented Mar 6, 2015

weslowrie commented Mar 9, 2015

ketch commented Mar 10, 2015

weslowrie commented Mar 11, 2015

ahmadia commented Mar 11, 2015

weslowrie commented Mar 12, 2015

mandli commented Oct 24, 2015

weslowrie commented Oct 26, 2015

ketch commented Oct 28, 2015

Update to have the 'p' array written to the same file as 'q' and 'aux' #509

Update to have the 'p' array written to the same file as 'q' and 'aux' #509

Conversation

weslowrie commented Mar 5, 2015

ketch commented Mar 6, 2015

weslowrie commented Mar 9, 2015

ketch commented Mar 10, 2015

weslowrie commented Mar 11, 2015

ahmadia commented Mar 11, 2015

weslowrie commented Mar 12, 2015

mandli commented Oct 24, 2015

weslowrie commented Oct 26, 2015

ketch commented Oct 28, 2015