Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue on Quartz: HYDU_sock_connect unable to connect #148

Closed
spasmann opened this issue Jan 16, 2024 · 2 comments
Closed

Issue on Quartz: HYDU_sock_connect unable to connect #148

spasmann opened this issue Jan 16, 2024 · 2 comments
Labels
hpc Issues relating to HPC deployments

Comments

@spasmann
Copy link
Collaborator

Running into an issue on Quartz when I submit jobs from my PC (jobs run normally when submitting submitting from my Mac). All PC batch jobs fail with multiple errors like:

[proxy:0:23@quartz2664] HYDU_sock_connect (lib/utils/sock.c:140): unable to connect from "quartz2664" to "quartz79" (No route to host)
[proxy:0:23@quartz2664] main (proxy/pmip.c:105): unable to connect to server quartz79 at port 43377 (check for firewalls!)
srun: error: quartz2664: task 23: Exited with exit code 5

This is occurring regardless of branch or input deck. Based on similar bugs here and here it seems to be an issue either with my ssh or hostname issue.

I've submitted an LC-Hotline ticket and will report back what they say.

@jpmorgan98
Copy link
Collaborator

Are you sure you are using the MPI package from quartz and building off of that? Type which mpirun, which mpiexec and or which srun. Also check the module maybe? If you tell me exactly what to pull I can start trying out different versions/distributions of MPI

@jpmorgan98 jpmorgan98 added the hpc Issues relating to HPC deployments label Jan 19, 2024
@spasmann
Copy link
Collaborator Author

spasmann commented Feb 7, 2024

Never heard back from LC-Hotline, but the issue went away the next day. Closing the issue for now.

@spasmann spasmann closed this as completed Feb 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
hpc Issues relating to HPC deployments
Projects
None yet
Development

No branches or pull requests

2 participants