Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MPI AMIP FINE memory issues #192

Closed
juliasloan25 opened this issue Dec 14, 2022 · 4 comments · Fixed by #279
Closed

MPI AMIP FINE memory issues #192

juliasloan25 opened this issue Dec 14, 2022 · 4 comments · Fixed by #279
Assignees
Labels
bug Something isn't working 🔥 Urgent

Comments

@juliasloan25
Copy link
Member

juliasloan25 commented Dec 14, 2022

Recently, the MPI AMIP FINE test in CI has begun failing nondeterministically, initially here. The error suggests that this is due to a lack of memory.

For now, this issue is being addressed by increasing the memory allocation for this test (see PR #195), but we should find and fix the root cause of this problem.

One task that may help enforce reproducibility is adding compat entries in the Project.toml files in this repo.

ClimaCore #1113 has resolved the memory usage issues. #279 removes the extra memory allocations we had added to the buildkite driver as a workaround.

Fixed by #279.

@juliasloan25 juliasloan25 added the bug Something isn't working label Dec 14, 2022
@LenkaNovak
Copy link
Collaborator

@charleskawczynski would you be able to help with this please? Does ClimaAtmos suffer from similar issues?

@charleskawczynski
Copy link
Member

I think I'd recommend first updating to the latest versions. If this job is failing consistently, then maybe change it to a soft fail. I have some ideas about what might help, I'll chat with @simonbyrne tomorrow

@juliasloan25
Copy link
Member Author

The memory issues seem to have originated in ClimaCore PR#978. ClimaCore has relevant issues #948 and #1064, which note the issue there. Hopefully once those are addressed, the memory usage issues in the coupler will be resolved as well.

@LenkaNovak
Copy link
Collaborator

Thanks for the update, @juliasloan25 ! :)

@juliasloan25 juliasloan25 self-assigned this Feb 7, 2023
@bors bors bot closed this as completed in #279 Apr 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working 🔥 Urgent
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants