Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FDS6 6.5.3 - Fatal error in PMPI_init_thread Windows MPI runs #5162

Closed
rpas1231 opened this issue Jun 22, 2017 · 16 comments
Closed

FDS6 6.5.3 - Fatal error in PMPI_init_thread Windows MPI runs #5162

rpas1231 opened this issue Jun 22, 2017 · 16 comments
Assignees

Comments

@rpas1231
Copy link

rpas1231 commented Jun 22, 2017

Hi,

I have recently started using FDS6 at work. I am trying to run mpiexec across multiple computers. I have used the FDS 6 user guide to setup MPI runs on windows. I have test the commands below and seem to be working fine.

mpiexec -n 4 fds CHID.fds test_mpi -> returns hello world
mpiexec -hosts 2 machine1 1 machine2 1 test_mpi -> returns hello world
mpiexec - hosts 1 machine1 2 -wdir \shared_directory_on_the_network\path fds CHID.fds
mpiexec - hosts 1 machine2 2 -wdir \shared_directory_on_the_network\path fds CHID.fds (from the other machine)

However, when get around specifying (shown below) the working directory and the two machines, I get the error in the image attached.

mpiexec - hosts 2 machine1 1 machine2 1 -wdir \shared_directory_on_the_network\path fds CHID.fds

I have tried doing this at home across two computers and works fine as well. The only thing I can think of is that the issue is related to either one of the following:

  • Work-Network restrictions
  • Or the fact that the shared directory is located in C:\program files... (which it is not advised as per the FDS6 user guide; however this was the only option IT gave at work)

I would appreciate any information or alternatives regarding this issue. This is the first time posting so please advise if you require me to upload any extra info. I am attaching the test file 2MeshSim.fds used for mpi testing and the snippet of the error I am getting.
Thanks,
Roger

mpiexec_error

2MeshSim.fds.txt

@rpas1231 rpas1231 changed the title FDS6 6.5.3 FDS6 6.5.3 - Fatal error in PMPI_init_thread Windows MPI runs Jun 22, 2017
@mcgratta mcgratta self-assigned this Jun 22, 2017
@mcgratta
Copy link
Contributor

For one thing, you are trying to run a 2 mesh case with four CPUs; 2 on machine1 and 2 on machine2. Also, I do not understand what this command means:

mpiexec -n 4 fds CHID.fds test_mpi -> returns hello world

Are you trying to run fds or the test_mpi program?

@rpas1231
Copy link
Author

rpas1231 commented Jun 22, 2017 via email

@mcgratta
Copy link
Contributor

I cannot say what is wrong. If you say that you can run a case across two machines, I cannot say why you can or cannot run another case on another pair of machines. I do not understand the error message.

@rpas1231
Copy link
Author

rpas1231 commented Jun 22, 2017 via email

@sbenkorichi
Copy link
Collaborator

Hmm, I haven't had a chance to try it myself to dig deep into it, but might be this an issue of administration privileges issue, on both computers you are trying to use.
I would suggest you try follow these steps, and let us know at which step you can't proceed further.

https://github.com/firemodels/fds/wiki/Installing-and-Running-FDS-on-a-Windows-PC

@rpas1231
Copy link
Author

rpas1231 commented Jun 23, 2017 via email

@mcgratta
Copy link
Contributor

Make sure that your computers are on a Windows Domain Network; that is, you can login to each machine using the same name and password. But even then, my experience with running FDS across two Windows machines is that it sometimes fails with an error message I cannot understand. I am not an expert on Windows or networks, so I cannot say what your particular problem is.

Is FDS installed on both machines? Same version? Same location? Are the two machines identical in the version of the OS?

@rpas1231
Copy link
Author

rpas1231 commented Jun 26, 2017 via email

@mcgratta
Copy link
Contributor

login to each computer and see if you can access the shared directory EXACTLY as you have specified it in your mpiexec command. Do not use a directory within Program Files or other restricted parts of the computer.

@rpas1231
Copy link
Author

rpas1231 commented Jun 27, 2017 via email

@mcgratta
Copy link
Contributor

Any luck with this issue?

@rpas1231
Copy link
Author

rpas1231 commented Aug 17, 2017 via email

@mcgratta
Copy link
Contributor

This is why more and more users are switching to linux clusters or cloud computing services. I cannot offer much advice about Windows.

@sbenkorichi
Copy link
Collaborator

@rpas1231
We've just released FDS 6.6 , See if that can help with your issue.
Report back.

@rpas1231
Copy link
Author

rpas1231 commented Nov 3, 2017 via email

@heyJ0w
Copy link

heyJ0w commented Aug 6, 2018

I want to know what the solution was, because I also have the same problem

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants