Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improved station sampler #2838

Merged
merged 29 commits into from
Jun 24, 2024
Merged

Conversation

metdyn
Copy link
Contributor

@metdyn metdyn commented May 21, 2024

Types of change(s)

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Trivial change (affects only documentation or cleanup)

Checklist

  • Tested this change with a run of GEOSgcm
  • Ran the Unit Tests (make tests)

Description

The previous version for station sampler which is the first (and the simplest) sampler we implemented a year ago shows significant slowness for NOAA GHCNd (124954 stations). The previous version used LocStream on root only. Now we have added intermediate steps: LS(uniformly distributed on cores), LS (w CS bkg), MPI_gatherV. Running ExtDatadirver.x on discover with 8640 cores for C1440-L137 shows 1.39 s for GHCNd for 1 write statement with 10 3d-variables.

Summary: 490 s runtime is reduced to 6.9 s for station sampler with GHCNd input

@metdyn metdyn requested a review from a team as a code owner May 21, 2024 17:35
@metdyn metdyn added the 0 Diff Trivial The changes in this pull request are trivially zero-diff (documentation, build failure, &c.) label May 21, 2024
tclune
tclune previously requested changes May 21, 2024
Copy link
Collaborator

@tclune tclune left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

See inline comments.

@metdyn
Copy link
Contributor Author

metdyn commented May 23, 2024

Thank you, Tom. I need to do a few control experiment to examine the effect on netCDF write and how much regridding time has changed. I am closing this PR to work offline.

@metdyn metdyn closed this May 23, 2024
@metdyn metdyn reopened this May 23, 2024
@metdyn
Copy link
Contributor Author

metdyn commented Jun 13, 2024

work offline for improvement

@metdyn metdyn closed this Jun 13, 2024
@metdyn metdyn reopened this Jun 17, 2024
@metdyn
Copy link
Contributor Author

metdyn commented Jun 24, 2024

See inline comments.

Major efficiency issues have been resolved.

@metdyn metdyn dismissed tclune’s stale review June 24, 2024 15:10

The code has been changed significantly since the last review.

@tclune tclune self-requested a review June 24, 2024 15:14
@mathomp4
Copy link
Member

490 s runtime is reduced to 6.9 s for station sampler with GHCNd input

🤯

@mathomp4
Copy link
Member

Something is wrong with CircleCI. It's not reporting the success. Merging with powers.

@mathomp4 mathomp4 merged commit d60d6f2 into develop Jun 24, 2024
36 of 38 checks passed
@mathomp4 mathomp4 deleted the feature/ygyu/revise_stataion_sampler branch June 24, 2024 16:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
0 Diff Trivial The changes in this pull request are trivially zero-diff (documentation, build failure, &c.)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants