New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Checksum errors keep popping up #190
Comments
Also Is it something I did? Seems like it's impossible to restart my experiments unless I play dirty and go in the restart |
was the restart created on the same machine? could be a little/big endian issue. I think FMS just writes out the bytes via a Probably something simpler, but thats all that comes to mind at the moment. I agree that usually this only happens when one manually manipulates the fields in, say, another program. |
there is a flag that disables checksums btw, if you cant find it then I'll have a look a bit later. Obviously not recommended in general... |
Navid had this problem before and we did the checksum thing bit it didn't work. It was only that one variable too.
Navid, might be time to make a MOM6 issue.
|
Seems more of at FMS issue but probably best to start with MOM6. Guessing you have confirmed that this happens from a manual execution, and is not a payu problem? |
@aidanheerdegen, sure I can do. I just want to make sure that this is not payu related. [Either way possibly it's @marshallward who will be attending the mom6 issue anyway... ha :)] |
@marshallward no I haven't done that because, actually, I don't know how to do a manual restart without |
Oh! Did you do a code update? I think the If this is what's happening, then I do think the checksum will be different. |
I have recompiled my executable 2 months ago (early Apr...) Did the |
I could have swore that I had seen the method in the latest FMS. But now that I'm looking, all I can see the old version, so I guess I must have been mistaken. Really sorry about that. The only other idea I can see after looking at the code is that there is a sensitivity to the expected fill value. If, say, the CF fill value (1e20) got changed to somethig like the netCDF fill value (something like 9.96e34) then it could cause problems. But I'm really reaching here. Probably best to just
and to do both of these independent of Payu. |
How about "Hallcroft method"? |
@adcroft gave the very wise suggestion of searching for older errors, which came up with this: https://github.com/NOAA-GFDL/MOM6/issues/824 It seems this might be a bug with checksums on 1d arrays, such as a dynamic Anyway I will ask @MJHarrison-GFDL when he comes in, since he seems to have figured it out last time. (If it's still happening then we might want to reopen this in MOM6, but lets confirm first) In the meantime, it seems ok for you to do |
@marshallward, yes I probably changed number of nodes since I'm systematically changing from submitting to |
We tried this: &fms_io_nml
checksum_required=.false.
/ but it still complained about that field (no others). I couldn't find That is more of an FMS issue I guess. |
@aidanheerdegen This looks like a MOM6 config rather than an FMS config. MOM6 actually overrides Try adding this setting to |
@navidcy yes it's useful info, and matches the other issue. I haven't noticed it myself but it looks like the original issue was never resolved and may stillb be present. Hopefully will get some time soon to look into it. |
@marshallward , adding |
Sounds good, thanks for letting me know. I guess we can close this, but can you keep aside the experiment somewhere on raijin? I will grab it and try to reproduce the problem and hopefully find a fix for it this time. |
I keep getting errors of this sort:
See, e.g., job 9363751 in
/home/552/nc3020/SOchanBcBtEddySat/layer2/layer2_tau5e-0_manyshortridges
. (Possibly the logs are archived because I swept.)The text was updated successfully, but these errors were encountered: