Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support detecting compute nodes of E3SM machines #134

Closed
chengzhuzhang opened this issue Sep 21, 2023 · 7 comments · Fixed by #135
Closed

Support detecting compute nodes of E3SM machines #134

chengzhuzhang opened this issue Sep 21, 2023 · 7 comments · Fixed by #135

Comments

@chengzhuzhang
Copy link

chengzhuzhang commented Sep 21, 2023

There has been issues raised that when running on compute nodes, analysis tools won't auto detect machines: e.g. E3SM-Project/zppy#406
The suggested solution for now is to manually specify machine name, e.x

[default]
machine = chrysalis

or if running in E3SM-Unified, to use the environment variable, e.x.

machine = os.environ["E3SMU_MACHINE"]

It might be viable to detect machines based on compute node names if there is a pattern. Though it can be challenging based on comment from @xylar: E3SM-Project/zppy#511 (comment)

It seems that we can start to add support for certain machines that has patterns in their compute nodes names.

@xylar
Copy link
Collaborator

xylar commented Sep 21, 2023

Perfect, thanks! I will give it a try...

@xylar
Copy link
Collaborator

xylar commented Sep 21, 2023

@chengzhuzhang, we have a backup option already, where we supposedly create a file when mache gets installed and read the machine name from it:
https://github.com/E3SM-Project/mache/blob/main/mache/discover.py#L59-L67
I wonder why that's not working.

@xylar
Copy link
Collaborator

xylar commented Sep 21, 2023

The answer is that I clearly forgot to implement this on conda-forge.

@chengzhuzhang
Copy link
Author

@chengzhuzhang, we have a backup option already, where we supposedly create a file when mache gets installed and read the machine name from it: https://github.com/E3SM-Project/mache/blob/main/mache/discover.py#L59-L67

Smart!

@xylar
Copy link
Collaborator

xylar commented Sep 21, 2023

This should be fixed in different ways by conda-forge/mache-feedstock#54 and #135. Hopefully, we really clobber this one once and for all...

@xylar
Copy link
Collaborator

xylar commented Sep 21, 2023

(Sorry for getting a bit annoyed earlier. It's been a long, long day...)

@chengzhuzhang
Copy link
Author

Thank you for getting to it so quickly. (and the day is indeed too long, looking at current Berlin time...)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants