-
Notifications
You must be signed in to change notification settings - Fork 162
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Migrate to Rocky8 spack-stack installations on Jet #2377
Comments
FYI @souopgui |
There is a kJet maintenance downtime planned for 03/26; |
All UFS model tests pass with Rocky8 install of spack-stack on Jet; Log file attached. |
@InnocentSouopgui-NOAA Since it won't be possible to validate against CentOS on Jet, I have adjusted the acceptance criteria to being able to run cycled and forecast-only experiments. |
I have everything running smoothly on xjet and kjet partitions. The changes I made include:
@DavidHuber-NOAA, DO you want to have a look at those runs before I submit the pull requests for those changes? |
Cycled experiments (48+ hours) at resolutions
Forecast only experiment (48+ hours) at resolutions
|
- Update Jet module file to use Rocky8 installation of spack-stack; - Jet has been upgraded to the Rocky8 Linux OS and present module file no longer works Resolves #60 Refs NOAA-EMC/global-workflow#2377
@DavidHuber-NOAA How to run a forecast only experiment on Jet?
|
@InnocentSouopgui-NOAA If you run an S2SW forecast-only experiment, they should populate automatically from files stored on-site in @KateFriedman-NOAA @WalterKolczynski-NOAA Would one of you be able to run an rsync to check/update the P8 coupled IC files on Jet? |
I synced some of the prototype_ICs to Jet this morning (the Based on the settings in |
I have been trying to recompile and test the global workflow that I had compiled on Jet’s CentOS7 on Rocky8. The workflow was previously compiled using intel/18.0.5.274 and impi/2018.4.274, and this version is compiled using intel/2022.1.2 and impi/2022.1.2. I have recompiled the workflow, including the global model component and the same version of all previously utilized libraries. When I test the global model using the same initial conditions as previously run on the CentOS7 workflow, however, I get noticeable differences between the two versions. At 0 h, I am getting a difference of up to 1.5% the post-processed 250 hPa horizontal wind speed. By 72 h, this difference grows to up to 15.5%. To my knowledge, I am using the same model version, the same post-processor version (which when tested alone does not seem to have any issues in reproducibility), the same initial conditions, the same namelist file, the same version of the dependency libraries, the same fix files, and stochastic physics turned off in both versions. The only differences that I can tell are the compiler and MPI versions. Are differences this large, starting from 0h, expected from only changing the versions of compiler and/or MPI? Have others been facing a similar issue? Thank you for any help you can provide or if you can direct me to the best place to ask this question! |
@erinaj16 I would expect some differences, even using the same code and other libraries (assuming nothing else changed beyond intel and OS). This is a big jump in intel version and a new OS for the version you're trying to continue using. If we saw large differences between the version of the system immediately preceding the Rocky8 upgrade (CentOS7 spack-stack) and the system ported to Rocky8 spack-stack then I would be concerned. We can't support older versions (e.g. intel 2018) so I'm not sure how to help from the workflow side of things. From your information it seems like it's just the forecast model showing differences? If so, you could show the above to the ufs-weather-model folks and see if they have some thoughts on the differences. They likely can't support older versions either though, just a caution. :-/ |
@InnocentSouopgui-NOAA I merged the Fit2Obs PR, cut a new tag (wflow.1.1.1) and installed it on Jet: I will install this new tag on all supported platforms. You can update the |
I have installed the Fit2Obs tag on Hera and both WCOSS2s. @WalterKolczynski-NOAA please install this new Fit2Obs tag on Orion, thanks!
|
@InnocentSouopgui-NOAA I have installed the updated prepobs Please update this line in the Jet workflow modulefile: https://github.com/NOAA-EMC/global-workflow/blob/develop/modulefiles/module_base.jet.lua#L46. Change "feature-GFSv17_com_reorg_log_update" to "dev/gfsv17". |
@KateFriedman-NOAA, @DavidHuber-NOAA, |
@InnocentSouopgui-NOAA I have approved your TC_tracker PR but will wait and give the TC_tracker CM a chance to review, comment, and/or approve before I merge and update our installs. |
@InnocentSouopgui-NOAA Since the update within Fit2Obs is only for building on Jet, the other systems shouldn't be impacted by going to this new version. We can run CI tests for the PR branch on the other platforms to check for any impacts. |
It looks like the runtime version for Fit2Obs is pulled from |
Yep! If we hit issues with Fit2Obs in the CI testing before merge then we can change it just for Jet in |
@KateFriedman-NOAA, |
My bad, sorry, forgot we needed to update that within the |
@DavidHuber-NOAA, we are missing two things for the migration to Rocky8 on Jet.
Shall I open the pull request for the migration of global workflow for review while waiting for those two, or wait a little? |
@InnocentSouopgui-NOAA Yes, I think it's fine if you do that. I will ping Russ and Jiayi on the GSI and TC_trracker PRs, respectively, to see if we can get them merged. EDIT: |
# Description Migrates Global Workflow to Rocky8 spack-stack installations on Jet. Jet has moved from CentOS7 to Rocky8. Resolves #2377 Refs NOAA-EMC/UPP#919 Refs NOAA-EMC/gfs-utils#60 Refs NOAA-EMC/GSI#732 Refs NOAA-EMC/GSI-Monitor#130 Refs NOAA-EMC/GSI-utils#33
What new functionality do you need?
Jet is upgrading to the Rocky8 Linux OS, which requires a new spack-stack installation on the platform. The global workflow, subcomponents, and external dependencies will need to be recompiled and tested on the platform before full transition in mid-April.
What are the requirements for the new functionality?
/lfs4/HFIP/hfv3gfs/role.epic/spack-stack/spack-stack-1.6.0/envs/gsi-addon-dev-rocky8
/lfs4/HFIP/hfv3gfs/role.epic/spack-stack/spack-stack-1.5.1/envs/unified-env-rocky8
Acceptance Criteria
The global workflow is able to run in both cycled and forecast-only modes at resolutions up to C384 with all subcomponents and external dependencies running successfully.
Suggest a solution (optional)
No response
The text was updated successfully, but these errors were encountered: