Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fixes #2657 #2658

Merged
merged 3 commits into from
Mar 20, 2024
Merged

fixes #2657 #2658

merged 3 commits into from
Mar 20, 2024

Conversation

bena-nasa
Copy link
Collaborator

Description

Related Issue

As the title says this fixes #2657
If the the binary output is bit shaved, I copy to a new state before writing.

In reality this is only necessary for instantaneous output but I was trying to avoid a whole slew of conditions and it doesn't matter in the end if we make a few extra copies

I've marked this as zero-diff because is is zero-diff for the GEOSgcm.x although it is not for the LDAS.

Motivation and Context

How Has This Been Tested?

Types of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Trivial change (affects only documentation or cleanup)

Checklist:

  • I have tested this change with a run of GEOSgcm (if non-trivial)
  • I have added one of the required labels (0 diff, 0 diff trivial, 0 diff structural, non 0-diff)
  • I have updated the CHANGELOG.md accordingly following the style of Keep a Changelog

@bena-nasa bena-nasa added 0 Diff The changes in this pull request have verified to be zero-diff with the target branch. bugfix labels Mar 18, 2024
@bena-nasa bena-nasa requested a review from a team as a code owner March 18, 2024 20:21
atrayano
atrayano previously approved these changes Mar 18, 2024
weiyuan-jiang
weiyuan-jiang previously approved these changes Mar 18, 2024
@mathomp4 mathomp4 dismissed stale reviews from weiyuan-jiang and atrayano via 9c31ea7 March 19, 2024 12:35
@bena-nasa bena-nasa requested a review from a team as a code owner March 19, 2024 12:35
tclune
tclune previously approved these changes Mar 19, 2024
Copy link
Collaborator

@tclune tclune left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@bena-nasa I think this needs to be changed to nonzero diff. If I understood correctly it really can change answers.

Also ... does this possibly impact the results that the modeling team reported yesterday? Were we shaving one of their fields or only a diagnostic?

@mathomp4 mathomp4 marked this pull request as draft March 19, 2024 12:41
@tclune tclune added Non 0-diff and removed 0 Diff The changes in this pull request have verified to be zero-diff with the target branch. labels Mar 19, 2024
@mathomp4
Copy link
Member

mathomp4 commented Mar 19, 2024

Converting to draft. Per @biljanaorescanin the debug runs of the LDAS are throwing a failure:

GEOS-ESM/GEOSldas#734 (comment)

Intel:

forrtl: severe (408): fort: (3): Subscript #1 of the array DOIT has value -1 which is less than the lower bound of 1

Image PC Routine Line Source
libMAPL.base.so 00002B36FD528A78 biniomod_mp_mapl_ 1988 BinIO.F90
libMAPL.history.s 00002B36FAC560FA mapl_historygridc 3641 MAPL_HistoryGridComp.F90

GNU:

At line 1988 of file /discover/nobackup/borescan/SystemTests/builds/LDAS_GNUCONUS/CURRENT/GEOSldas/src/Shared/@MAPL/base/BinIO.F90
Fortran runtime error: Index '-1' of dimension 1 of array 'doit' below lower bound of 1

I suppose the "good" thing is it's the same error it seems.

@mathomp4
Copy link
Member

@bena-nasa I think this needs to be changed to nonzero diff. If I understood correctly it really can change answers.

Also ... does this possibly impact the results that the modeling team reported yesterday? Were we shaving one of their fields or only a diagnostic?

@tclune I don't think anyone in the AGCM world operates in such a way to trigger this. But I'll work with @bena-nasa to get specifics on when this could be triggered so we can let people know in the release notes and changelog.

@bena-nasa
Copy link
Collaborator Author

@biljanaorescanin
Matt showed me the error you were seeing in the debug run. I will look into this. Hang tight

need to copy whole state
@mathomp4
Copy link
Member

The CI failure looks to be some weird issue on their side. I'm re-launching

@biljanaorescanin
Copy link

With last commit all my land tests passed.

@bena-nasa
Copy link
Collaborator Author

bena-nasa commented Mar 20, 2024

I did some spot checking of the output files by using that converter, made sure that the shaved vs non-shaved output was right in that each variable was "right" in that they were basically the same modulo the bit shaving. I.E. I didn't do something like somehow write the variables in the wrong order in the binary file in say the shaved path.
So yeah, sounds like good to go

@tclune tclune marked this pull request as ready for review March 20, 2024 12:58
@tclune tclune self-requested a review March 20, 2024 12:58
@tclune tclune merged commit c5707a2 into main Mar 20, 2024
31 checks passed
@tclune tclune deleted the hotfix/bmauer/fixes-#2657 branch March 20, 2024 12:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Bit shaving and binary file output in History
6 participants