-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Print NetCDF error message if pFIO_NetCDF4_FileFormatterMod::open() fails #962
Print NetCDF error message if pFIO_NetCDF4_FileFormatterMod::open() fails #962
Conversation
pfio/NetCDF4_FileFormatter.F90
Outdated
end if | ||
!$omp end critical |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A thought from @tclune as we looked at this: Should the !$omp end critical
come after the write?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the suggestion. That make sense. I fixed it.
The reason being that |
b840f72
to
ac2b378
Compare
Doing a test now with GEOS. |
Well, I tested this and as far as I can see, it's zero-diff and doesn't do anything in a GEOS run. |
I've added @tclune as a reviewer as well. Just so he sees it. |
I am also facing this kind of issue in GCHP 13.4.0. Any solution? |
@gopikrishnangs44 What version of MAPL is in GCHP 13.4.0. It's possible it has an older MAPL without the fix from @LiamBindle ? |
@mathomp4 MAPL 2.6.3 |
@gopikrishnangs44 Ooh. Yeah, that's before this was put into 2.8.3, and I'm not sure it was ever backported to a 2.6 release. Your best bet might be to try and put the changes from @LiamBindle into your code and rebuild. Then at least you can figure out what file is failing. |
I have installed the model using spack. Could you please guide me through the things necessary for a solution |
Ah. Spack. Now that is harder as you don't really have the ability to hand-edit the code and recompile. Hmm. Can you provide your |
PLease find the attached @mathomp4 |
@gopikrishnangs44 Okay. I know how to turn on the ExtData debug prints with GEOS, I suppose you can try it with yours. In my console:
class: streamhandler
formatter: basic
unit: OUTPUT_UNIT
level: DEBUG
...
CAP:
level: WARNING
root_level: DEBUG and get things like:
in my run log. Now it looks like GCHP has extra stuff in their yaml, so maybe this won't do it? First thing to try I suppose. NOTE: This gets very verbose! |
I have replaced my logging with and tried to run the model, but got the same error |
@gopikrishnangs44 And you didn't get a zillion prints from extdata in your log? |
Dear @mathomp4, I tried installing GCHP 14.1 with updated MAPL. Still I am having the same issue. This is my jobscript for slurm:
I am facing the same issue at the same time step for the new version as well. log file: |
@gopikrishnangs44 Well the answer is one of your input files is missing a date or stopped being provided or something. That's what we need to find out. Maybe try running with this part of your CAP.EXTDATA:
handlers: [mpi_shared]
level: WARNING
root_level: DEBUG
propagate: false as well as the other debugs for CAP and CONSOLE. Perhaps this is hijacking the debug messages? |
Dear @mathomp4 I changed logging.yml to
But the error is still there. |
@gopikrishnangs44 At this point I'm not sure what to do as I don't have access to your run or model or machine to try things out. Some file is missing or missing data. You might try asking the GCHP folks on how to turn on the debug logger prints for ExtData in their model. I know how to do it in GEOS, but obviously it's different in GCHP. Beyond that, it's adding prints to the MAPL you are building with and re-building. |
Description
Hi all, this is a trivial update to print
nf90_strerror(status)
ifnf90_open()
in pFIO_NetCDF4_FileFormatterMod::open() fails.Motivation and Context
Currently there is not error handling if
nf90_open()
fails in pFIO_NetCDF4_FileFormatterMod::open(). As a result, a file missing read permissions will cause an error message that looks like this:and the offending file isn't printed.
Now an error message like this is written to stderr
follow by the normal stack traces following the failed
_VERIFY(status)
.Types of changes
Checklist: