-
Notifications
You must be signed in to change notification settings - Fork 233
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TACC Singularity container in RHEL getting Fatal error in PMPI_Init_thread: Other MPI error, error stack #5137
Comments
Hi @RobbieTheK, sorry for being slow to respond. Yes, both issues seem to be related to the interaction between the images and the system you are running the images on. In order to say more I would need to know more about the system you are using, but here are some general information about the images:
|
Re: 1st error, correct not a TACC system was just trying to see if it would work. This is a Bright Computing 9.1 cluster running RHEL 8 with Slurm 20, openmpi/gcc/64/4.1.5a1 I'm using an interactive srun job -c4 -n4 as options.
Same error:
|
I have not tried using |
@RobbieTheK Can I assume that using a batch script solved the problem? |
No I ended up building and compiling candi and deal II. I'd be happy to try again with sbatch but I mentioned I did try with srun to no avail. |
What I meant to ask is whether you found a way to make it work for you? |
I don't know that I have anything to offer. I don't know much about singularity (or containers in general) and I don't work on the TACC machines. I'm also not sure we have the resources as a project to really figure this out. @gassmoeller @tjhei Do you have anything to offer? Or should we just say "We'd love to provide this, but we can't" and close the issue? |
Using
geodynamics/aspect:latest-tacc
, with Singularity version 3.7.1 on RHEL 8 with OpenMPI 4.1.5a1I also took at shot at using the Docker image and pulling it into Singularity
That's likely from not having the same version of OpenMPI, am I correct?
The text was updated successfully, but these errors were encountered: