-
Notifications
You must be signed in to change notification settings - Fork 274
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MPICH2 fpi.exe hanging on Windows XP #36
Comments
Originally by Ayer, Timothy C. on 2008-08-04 09:32:36 -0500 This message has 1 attachment(s) |
Originally by Ayer, Timothy C. on 2008-08-04 09:32:36 -0500 Attachment added: |
Originally by Jayesh Krishna on 2008-08-04 09:58:49 -0500 Hi, (PS: Instead of the "-hosts" option you could try using the "-machinefile" Regards, -----------------------------------------------------------+------------ I am testing MPICH2 MPICH2-1.0.7 Windows XP (sp2). I have installed it The following tests work fine from hostA, both prompt for a number of mpiexec.exe -hosts 2 hostA hostA \hostA\temp\fpi.exe mpiexec.exe -hosts 2 hostB hostB \hostA\temp\fpi.exe The following test hangs when submitted from hostA (in MPI_Bcast). It mpiexec.exe -hosts 2 hostA hostB \hostA\temp\fpi.exe Any suggestions would be appreciated. Also let me know if you want me Thanks, Timothy C. Ayer <<fpi.f>> Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36 |
Originally by Jayesh Krishna on 2008-08-04 09:58:49 -0500 Attachment added: |
Originally by Jayesh Krishna on 2008-08-04 13:01:41 -0500 Attachment added: |
Originally by Jayesh Krishna on 2008-08-04 13:01:41 -0500 You should try, mpiexec.exe -map y:\hostA\temp -hosts 2 hostA hostB y:\fpi.exe Let us know if it works for you. (PS: The shared drive is accessible across machines because the drive is Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] The exe can be directly accessed from hostB by executing Note: I did try: mpiexec.exe -map y:\hostA\temp -hosts 2 hostA hostB The interesting part is that it gets through the initialization:
All execute. Thanks, From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] How (what mechanism) does hostB access data (exe) in hostA ? Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Thanks Jayesh for the quick reply. This is a network availabe UNC path - I am familiar with the machines file - I was just using the command line From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Hi, (PS: Instead of the "-hosts" option you could try using the "-machinefile" Regards, -----------------------------------------------------------+------------ I am testing MPICH2 MPICH2-1.0.7 Windows XP (sp2). I have installed it The following tests work fine from hostA, both prompt for a number of mpiexec.exe -hosts 2 hostA hostA \hostA\temp\fpi.exe mpiexec.exe -hosts 2 hostB hostB \hostA\temp\fpi.exe The following test hangs when submitted from hostA (in MPI_Bcast). It mpiexec.exe -hosts 2 hostA hostB \hostA\temp\fpi.exe Any suggestions would be appreciated. Also let me know if you want me Thanks, Timothy C. Ayer <<fpi.f>> Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36 |
Originally by Ayer, Timothy C. on 2008-08-13 13:26:47 -0500 Hello, Was there actually a bug that has been fixed? ...so I should download I had sent some smpd -d output to Jayesh Krishna on 8/5/2008 but did not Thanks for your help. -----Original Message----- ------------------------------------------------------------+---------------
closed ------------------------------------------------------------+--------------- Changes (by thakur):
Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment:4 |
Originally by Ayer, Timothy C. on 2008-08-13 13:32:48 -0500 Sorry, I did read that message I was just a little surprised. Thank you. Tim -----Original Message----- ------------------------------------------------------------+---------------
closed ------------------------------------------------------------+--------------- Comment (by Ayer, Timothy C.): Hello, Was there actually a bug that has been fixed? ...so I should download I had sent some smpd -d output to Jayesh Krishna on 8/5/2008 but did not Thanks for your help. -----Original Message----- ------------------------------------------------------------+---------------
closed ------------------------------------------------------------+--------------- Changes (by thakur):
Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: |
Originally by Jayesh Krishna on 2008-08-13 13:58:26 -0500 Attachment added: |
Originally by Jayesh Krishna on 2008-08-13 13:58:26 -0500 Hi, Can you try compiling icpi.c (MPICH2\examples) and run the program inyour setup (Make sure that the problem is not related to fortran I have seen that some times that the uninstall/install of MPICH2 doesnot result in the dlls being updated correctly (This has lead to some
Send us the results for verification (Sanity check- they should have the Also when running fpi.exe using your setup try leaving the job (or maybe specify a timeout of 10 mins or so) for 10mins or so and see if it (PS: The MPICH2 1.1.0a1 release Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Please find attached the output from the smpd -d procs. Also, the output H:>mpiexec.exe -map v:\10.30.73.170\temp -hosts 2 10.30.73.170 From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] The socket/channel connection between the MPI processes take place during From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] The firewall has been disabled. The inputs were from me entering values for estimating pi...I wanted to I will send the other debug output a little later. Also, as an fyi, we have been running MPICH on thousands of PC's for From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Do you have windows firewall (or any firewall) running on these machines? Why do I see two inputs (10 & 100) in the mpiexec debug output ?Can you send us the debug output of smpd along with mpiexec ?Can you check the status of the remote smpd from each host ?--- On host A, run "smpd -status IPAddressOf_hostB" (PS: I just tried running fpi.exe in a shared drive across two 32-bit Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] This is the same fpi.f which comes with the installation with the The setup is homogenous (both 32-bit). The output is attached. Thanks for your help. Tim From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Are you running fpi.exe (fpi.f) provided with MPICH2 (Have you modifiedthe program ?)? I am assuming that the setup is not heterogeneous (MPICH2 currently doesnot support running jobs across machines with different data models eg: Please provide us with the debug/verbose output when running fpi.exe.Start smpd on both the machines in debug mode (1. Stop any instances of Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Thanks, here is the output (note: I have not included IP address or mpiexec.exe -map y:\IPAddressOf_hostA\temp -hosts 2 IPAddressOf_hostAIPAddressOf_hostB y:\fpi.exe OUTPUT: mpiexec.exe -map y:\IPAddressOf_hostA\temp hostnameXXXXXX (hostname of hostA) From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] The command hostname (c:\windows\system32\hostname.exe) Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] You have "hostname" at the end of the second line...what is that referring From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] What is the error message (output) that you get when you run mpiexec ? mpiexec.exe -map y:\IPAddressOf_hostA\temp -hosts 2 IPAddressOf_hostAIPAddressOf_hostB y:\fpi.exe mpiexec.exe -map y:\IPAddressOf_hostA\temp hostnameRegards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] No this does not work...the behavior is the same. The UNC's should/have From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] You should try, mpiexec.exe -map y:\hostA\temp -hosts 2 hostA hostB y:\fpi.exe Let us know if it works for you. (PS: The shared drive is accessible across machines because the drive is Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] The exe can be directly accessed from hostB by executing Note: I did try: mpiexec.exe -map y:\hostA\temp -hosts 2 hostA hostB The interesting part is that it gets through the initialization:
All execute. Thanks, From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] How (what mechanism) does hostB access data (exe) in hostA ? Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Thanks Jayesh for the quick reply. This is a network availabe UNC path - I am familiar with the machines file - I was just using the command line From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Hi, (PS: Instead of the "-hosts" option you could try using the "-machinefile" Regards, -----------------------------------------------------------+------------ I am testing MPICH2 MPICH2-1.0.7 Windows XP (sp2). I have installed it The following tests work fine from hostA, both prompt for a number of mpiexec.exe -hosts 2 hostA hostA \hostA\temp\fpi.exe mpiexec.exe -hosts 2 hostB hostB \hostA\temp\fpi.exe The following test hangs when submitted from hostA (in MPI_Bcast). It mpiexec.exe -hosts 2 hostA hostB \hostA\temp\fpi.exe Any suggestions would be appreciated. Also let me know if you want me Thanks, Timothy C. Ayer <<fpi.f>> Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36 |
Originally by Rajeev Thakur on 2008-08-13 14:10:11 -0500 Tim, Rajeev |
Originally by Ayer, Timothy C. on 2008-08-13 14:11:09 -0500 Hi Jayesh, Great to hear from you. I will try your suggestions (icpi.c and slow Also here is the output you requested. I have been wondering why the dates Thanks, C:\WINDOWS\system32>dir c:\windows\system32\mpe*.dll Directory of c:\windows\system32 04/04/2008 05:46 PM 135,168 mpe.dll C:\WINDOWS\system32> C:\WINDOWS\system32>dir dir c:\windows\system32\mpich2*.dll Directory of C:\WINDOWS\system32 Directory of C:\WINDOWS\system32 04/04/2008 05:28 PM 1,110,016 mpich2.dll -----Original Message----- ------------------------------------------------------------+--------------- ------------------------------------------------------------+--------------- Comment (by Jayesh Krishna): Hi, Can you try compiling icpi.c (MPICH2\examples) and run the program inyour setup (Make sure that the problem is not related to fortran I have seen that some times that the uninstall/install of MPICH2 doesnot result in the dlls being updated correctly (This has lead to some
Send us the results for verification (Sanity check- they should have the Also when running fpi.exe using your setup try leaving the job (or maybe specify a timeout of 10 mins or so) for 10mins or so and see if it (PS: The MPICH2 1.1.0a1 release Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Please find attached the output from the smpd -d procs. Also, the output H:>mpiexec.exe -map v:\10.30.73.170\temp -hosts 2 10.30.73.170 From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] The socket/channel connection between the MPI processes take place during From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] The firewall has been disabled. The inputs were from me entering values for estimating pi...I wanted to I will send the other debug output a little later. Also, as an fyi, we have been running MPICH on thousands of PC's for From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Do you have windows firewall (or any firewall) running on these machines? Why do I see two inputs (10 & 100) in the mpiexec debug output ?Can you send us the debug output of smpd along with mpiexec ?Can you check the status of the remote smpd from each host ?
(PS: I just tried running fpi.exe in a shared drive across two 32-bit Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] This is the same fpi.f which comes with the installation with the The setup is homogenous (both 32-bit). The output is attached. Thanks for your help. Tim From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Are you running fpi.exe (fpi.f) provided with MPICH2 (Have you modifiedthe program ?)? I am assuming that the setup is not heterogeneous (MPICH2 currently doesnot support running jobs across machines with different data models eg: Please provide us with the debug/verbose output when running fpi.exe.Start smpd on both the machines in debug mode (1. Stop any instances of Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Thanks, here is the output (note: I have not included IP address or mpiexec.exe -map y:\IPAddressOf_hostA\temp -hosts 2 IPAddressOf_hostAIPAddressOf_hostB y:\fpi.exe OUTPUT: mpiexec.exe -map y:\IPAddressOf_hostA\temp hostnameXXXXXX (hostname of hostA) From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] The command hostname (c:\windows\system32\hostname.exe) Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] You have "hostname" at the end of the second line...what is that referring From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] What is the error message (output) that you get when you run mpiexec ? mpiexec.exe -map y:\IPAddressOf_hostA\temp -hosts 2 IPAddressOf_hostAIPAddressOf_hostB y:\fpi.exe mpiexec.exe -map y:\IPAddressOf_hostA\temp hostnameRegards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] No this does not work...the behavior is the same. The UNC's should/have From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] You should try, mpiexec.exe -map y:\hostA\temp -hosts 2 hostA hostB y:\fpi.exe Let us know if it works for you. (PS: The shared drive is accessible across machines because the drive is Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] The exe can be directly accessed from hostB by executing Note: I did try: mpiexec.exe -map y:\hostA\temp -hosts 2 hostA hostB The interesting part is that it gets through the initialization:
All execute. Thanks, From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] How (what mechanism) does hostB access data (exe) in hostA ? Regards, From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Thanks Jayesh for the quick reply. This is a network availabe UNC path - I am familiar with the machines file - I was just using the command line From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Hi, (PS: Instead of the "-hosts" option you could try using the "-machinefile" Regards, -----------------------------------------------------------+------------ I am testing MPICH2 MPICH2-1.0.7 Windows XP (sp2). I have installed it The following tests work fine from hostA, both prompt for a number of mpiexec.exe -hosts 2 hostA hostA \hostA\temp\fpi.exe mpiexec.exe -hosts 2 hostB hostB \hostA\temp\fpi.exe The following test hangs when submitted from hostA (in MPI_Bcast). It mpiexec.exe -hosts 2 hostA hostB \hostA\temp\fpi.exe Any suggestions would be appreciated. Also let me know if you want me Thanks, Timothy C. Ayer <<fpi.f>> -- Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: |
Originally by Ayer, Timothy C. on 2008-08-13 14:13:57 -0500 Thanks for letting me know. I knew something was up...this explains it :) Jayesh and I are currently "discussing" it. ;) -----Original Message----- ------------------------------------------------------------+--------------- ------------------------------------------------------------+--------------- Comment (by Rajeev Thakur): Tim, Rajeev Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: |
Originally by Jayesh Krishna on 2008-08-13 14:37:57 -0500 Attachment added: |
Originally by Jayesh Krishna on 2008-08-13 14:37:57 -0500 Hi, Uninstall MPICH2 on the hosts involved in your job.Manually delete the MPICH2 dlls from windows\system32 directory (Pleasebe careful! Make sure that you delete only mpich2_.dll & mpe_.dll) Re-install MPICH2 1.0.7 (stable version) on the hosts/nodes .Re-compile cpi.c/fpi.c and try running your job.Let us know the results. Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Ayer, Timothy C.): Hi Jayesh, Great to hear from you. I will try your suggestions (icpi.c and slow Also here is the output you requested. I have been wondering why the Thanks, C:\WINDOWS\system32>dir c:\windows\system32\mpe*.dll Directory of c:\windows\system32 04/04/2008 05:46 PM 135,168 mpe.dll C:\WINDOWS\system32> C:\WINDOWS\system32>dir dir c:\windows\system32\mpich2*.dll Directory of C:\WINDOWS\system32 Directory of C:\WINDOWS\system32 04/04/2008 05:28 PM 1,110,016 mpich2.dll -----Original Message----- ------------------------------------------------------------+-------------Reporter: "Ayer, Timothy C." timothy.ayer@pw.utc.com | ------------------------------------------------------------+-------------Comment (by Jayesh Krishna): Hi, Can you try compiling icpi.c (MPICH2\examples) and run the program inyour setup (Make sure that the problem is not related to fortran I have seen that some times that the uninstall/install of MPICH2 doesnot result in the dlls being updated correctly (This has lead to some
the Also when running fpi.exe using your setup try leaving the job (or maybe specify a timeout of 10 mins or so) for 10mins or so and see if it (PS: The MPICH2 1.1.0a1 release (http://www.mcs.anl.gov/research/projects/mpich2/downloads/index.php?s=dow Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Please find attached the output from the smpd -d procs. Also, the H:>mpiexec.exe -map v:\10.30.73.170\temp -hosts 2 10.30.73.170
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] The socket/channel connection between the MPI processes take place
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] The firewall has been disabled. The inputs were from me entering values for estimating pi...I wanted to I will send the other debug output a little later. Also, as an fyi, we have been running MPICH on thousands of PC's for
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Do you have windows firewall (or any firewall) running on thesemachines Why do I see two inputs (10 & 100) in the mpiexec debug output ?Can you send us the debug output of smpd along with mpiexec ?Can you check the status of the remote smpd from each host ?
(PS: I just tried running fpi.exe in a shared drive across two 32-bit Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] This is the same fpi.f which comes with the installation with the The setup is homogenous (both 32-bit). The output is attached. Thanks for your help. Tim
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Are you running fpi.exe (fpi.f) provided with MPICH2 (Have youmodified I am assuming that the setup is not heterogeneous (MPICH2 currentlydoes Please provide us with the debug/verbose output when running fpi.exe.Start smpd on both the machines in debug mode (1. Stop any instances of Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Thanks, here is the output (note: I have not included IP address or mpiexec.exe -map y:\IPAddressOf_hostA\temp -hosts 2 IPAddressOf_hostAIPAddressOf_hostB y:\fpi.exe OUTPUT: mpiexec.exe -map y:\IPAddressOf_hostA\temp hostnameXXXXXX (hostname of hostA)
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] The command hostname (c:\windows\system32\hostname.exe) Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] You have "hostname" at the end of the second line...what is that
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] What is the error message (output) that you get when you run mpiexec ? mpiexec.exe -map y:\IPAddressOf_hostA\temp -hosts 2 IPAddressOf_hostAIPAddressOf_hostB y:\fpi.exe mpiexec.exe -map y:\IPAddressOf_hostA\temp hostnameRegards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] No this does not work...the behavior is the same. The UNC's should/have
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] You should try, mpiexec.exe -map y:\hostA\temp -hosts 2 hostA hostB y:\fpi.exe Let us know if it works for you. (PS: The shared drive is accessible across machines because the drive is Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] The exe can be directly accessed from hostB by executing Note: I did try: mpiexec.exe -map y:\hostA\temp -hosts 2 hostA hostB The interesting part is that it gets through the initialization:
All execute. Thanks,
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] How (what mechanism) does hostB access data (exe) in hostA ? Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Thanks Jayesh for the quick reply. This is a network availabe UNC pathwhy do I need to map a drive? I am familiar with the machines file - I was just using the command line
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Hi, (PS: Instead of the "-hosts" option you could try using the Regards, -----------------------------------------------------------+------------ I am testing MPICH2 MPICH2-1.0.7 Windows XP (sp2). I have installed it The following tests work fine from hostA, both prompt for a number of mpiexec.exe -hosts 2 hostA hostA \hostA\temp\fpi.exe mpiexec.exe -hosts 2 hostB hostB \hostA\temp\fpi.exe The following test hangs when submitted from hostA (in MPI_Bcast). It mpiexec.exe -hosts 2 hostA hostB \hostA\temp\fpi.exe Any suggestions would be appreciated. Also let me know if you want me Thanks, Timothy C. Ayer
-- -- Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: |
Originally by Jayesh Krishna on 2008-08-13 14:50:00 -0500 Hi, Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Jayesh Krishna): Hi, Uninstall MPICH2 on the hosts involved in your job.Manually delete the MPICH2 dlls from windows\system32 directory (Pleasebe careful! Make sure that you delete only mpich2_.dll & mpe_.dll) # Re-compile cpi.c/fpi.c and try running your job.Let us know the results. Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Ayer, Timothy C.): Hi Jayesh, Great to hear from you. I will try your suggestions (icpi.c and slow Also here is the output you requested. I have been wondering why the Thanks, C:\WINDOWS\system32>dir c:\windows\system32\mpe*.dll Directory of c:\windows\system32 04/04/2008 05:46 PM 135,168 mpe.dll C:\WINDOWS\system32> C:\WINDOWS\system32>dir dir c:\windows\system32\mpich2*.dll Directory of C:\WINDOWS\system32 Directory of C:\WINDOWS\system32 04/04/2008 05:28 PM 1,110,016 mpich2.dll -----Original Message----- ------------------------------------------------------------+-------------
Owner: ------------------------------------------------------------+-------------Comment (by Jayesh Krishna): Hi, Can you try compiling icpi.c (MPICH2\examples) and run the program inyour setup (Make sure that the problem is not related to fortran I have seen that some times that the uninstall/install of MPICH2 doesnot result in the dlls being updated correctly (This has lead to some
the Also when running fpi.exe using your setup try leaving the job (ormay (PS: The MPICH2 1.1.0a1 release (http://www.mcs.anl.gov/research/projects/mpich2/downloads/index.php?s=dow Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Please find attached the output from the smpd -d procs. Also, the H:>mpiexec.exe -map v:\10.30.73.170\temp -hosts 2 10.30.73.170
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] The socket/channel connection between the MPI processes take place
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] The firewall has been disabled. The inputs were from me entering values for estimating pi...I wanted to I will send the other debug output a little later. Also, as an fyi, we have been running MPICH on thousands of PC's for
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Do you have windows firewall (or any firewall) running on thesemachines Why do I see two inputs (10 & 100) in the mpiexec debug output ?Can you send us the debug output of smpd along with mpiexec ?Can you check the status of the remote smpd from each host ?
(PS: I just tried running fpi.exe in a shared drive across two 32-bit Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] This is the same fpi.f which comes with the installation with the The setup is homogenous (both 32-bit). The output is attached. Thanks for your help. Tim
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Are you running fpi.exe (fpi.f) provided with MPICH2 (Have youmodified I am assuming that the setup is not heterogeneous (MPICH2 currentlydoes Please provide us with the debug/verbose output when running fpi.exe.Start smpd on both the machines in debug mode (1. Stop any instances of Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Thanks, here is the output (note: I have not included IP address or mpiexec.exe -map y:\IPAddressOf_hostA\temp -hosts 2IPAddressOf_hostA OUTPUT: mpiexec.exe -map y:\IPAddressOf_hostA\temp hostnameXXXXXX (hostname of hostA)
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] The command hostname (c:\windows\system32\hostname.exe) Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] You have "hostname" at the end of the second line...what is that
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov]
that mpiexec.exe -map y:\IPAddressOf_hostA\temp -hosts 2IPAddressOf_hostA mpiexec.exe -map y:\IPAddressOf_hostA\temp hostnameRegards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] No this does not work...the behavior is the same. The UNC's
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] You should try, mpiexec.exe -map y:\hostA\temp -hosts 2 hostA hostB y:\fpi.exe
(PS: The shared drive is accessible across machines because the drive Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] The exe can be directly accessed from hostB by executing Note: I did try: mpiexec.exe -map y:\hostA\temp -hosts 2 hostA hostB The interesting part is that it gets through the initialization:
All execute. Thanks,
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] How (what mechanism) does hostB access data (exe) in hostA ? Regards,
From: Ayer, Timothy C. [mailto:timothy.ayer@pw.utc.com] Thanks Jayesh for the quick reply. This is a network availabe UNC pathwhy do I need to map a drive? I am familiar with the machines file - I was just using the command
From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov]
need (PS: Instead of the "-hosts" option you could try using the Regards, -----------------------------------------------------------+------------ -----------------------------------------------------------+------------
it
intervals, accept input, and produce and estimate of PI
<\hostA\temp\fpi.exe>
<\hostA\temp\fpi.exe>
does prompt for input (number of intervals) but once entered it hangs.
<\hostA\temp\fpi.exe>
me
-- -- -- Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: |
Originally by Jayesh Krishna on 2008-08-13 14:50:00 -0500 Attachment added: |
Originally by Ayer, Timothy C. on 2008-08-13 14:53:03 -0500 That's a bummer I thought for sure that must be it....oh well. I will Thanks, -----Original Message----- ------------------------------------------------------------+--------------- ------------------------------------------------------------+--------------- Comment (by Jayesh Krishna): Hi, Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Jayesh Krishna): Hi, Uninstall MPICH2 on the hosts involved in your job.Manually delete the MPICH2 dlls from windows\system32 directory (Pleasebe careful! Make sure that you delete only mpich2_.dll & mpe_.dll) # Re-compile cpi.c/fpi.c and try running your job.
Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Ayer, Timothy C.): Hi Jayesh, Great to hear from you. I will try your suggestions (icpi.c and slow Also here is the output you requested. I have been wondering why the Thanks, C:\WINDOWS\system32>dir c:\windows\system32\mpe*.dll
04/04/2008 05:46 PM 135,168 mpe.dll C:\WINDOWS\system32> C:\WINDOWS\system32>dir dir c:\windows\system32\mpich2*.dll
04/04/2008 05:28 PM 1,110,016 mpich2.dll -----Original Message----- ------------------------------------------------------------+-------------
Owner: ------------------------------------------------------------+-------------Comment (by Jayesh Krishna):
does
the
may
(http://www.mcs.anl.gov/research/projects/mpich2/downloads/index.php?s=dow
output
during
the
machines
modified
IPAddressOf_hostA
referring
that
IPAddressOf_hostA
should/have
is
command
line
need
"-machinefile"
mpich2-bugs@mcs.anl.gov] -----------------------------------------------------------+------------ -----------------------------------------------------------+------------
it
I
me
-- -- -- Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: |
Originally by Jayesh Krishna on 2008-08-13 15:01:56 -0500 Attachment added: |
Originally by Jayesh Krishna on 2008-08-13 15:01:56 -0500 Hi, Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Ayer, Timothy C.): That's a bummer I thought for sure that must be it....oh well. I will Thanks, -----Original Message----- ------------------------------------------------------------+-------------Reporter: "Ayer, Timothy C." timothy.ayer@pw.utc.com | ------------------------------------------------------------+-------------Comment (by Jayesh Krishna): Hi, Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Jayesh Krishna): Hi, Uninstall MPICH2 on the hosts involved in your job.Manually delete the MPICH2 dlls from windows\system32 directory(Please Re-compile cpi.c/fpi.c and try running your job.
Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Ayer, Timothy C.):
response).
dates on mpich2sshm.dll and mpich2sshmp.dll seem so old (from 2005)???
<<<<<<<<<<<<<<<<
[mailto:owner-mpich2-bugs@mcs.anl.gov] ------------------------------------------------------------+-------------
Owner: ------------------------------------------------------------+-------------
in
have
may
(http://www.mcs.anl.gov/research/projects/mpich2/downloads/index.php?s=dow
output
during
to
the
machines
modified
IPAddressOf_hostA
referring
?
IPAddressOf_hostA
should/have
is
command
hostB
path
line
need
"-machinefile"
mpich2-bugs@mcs.anl.gov] -----------------------------------------------------------+------------ -----------------------------------------------------------+------------
it
of
It
me
https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: -- -- -- Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: |
Originally by Ayer, Timothy C. on 2008-08-13 15:08:53 -0500 Will do. From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Hi, Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Ayer, Timothy C.): That's a bummer I thought for sure that must be it....oh well. I will Thanks, -----Original Message----- ------------------------------------------------------------+--------------- ------------------------------------------------------------+--------------- Comment (by Jayesh Krishna): Hi, Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Jayesh Krishna): Hi, Uninstall MPICH2 on the hosts involved in your job.Manually delete the MPICH2 dlls from windows\system32 directory(Please Re-compile cpi.c/fpi.c and try running your job.
Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Ayer, Timothy C.):
response).
dates on mpich2sshm.dll and mpich2sshmp.dll seem so old (from 2005)???
[mailto:owner-mpich2-bugs@mcs.anl.gov ------------------------------------------------------------+-------------
Owner: ------------------------------------------------------------+-------------
in
the
may
(http://www.mcs.anl.gov/research/projects/mpich2/downloads/index.php?s=dow
mailto:timothy.ayer@pw.utc.com ]
output
mailto:jayesh@mcs.anl.gov ]
during
mailto:timothy.ayer@pw.utc.com ]
to
the
mailto:jayesh@mcs.anl.gov ]
machines
mailto:timothy.ayer@pw.utc.com ]
mailto:jayesh@mcs.anl.gov ]
modified
mailto:timothy.ayer@pw.utc.com ]
IPAddressOf_hostA
mailto:jayesh@mcs.anl.gov ]
mailto:timothy.ayer@pw.utc.com ]
referring
mailto:jayesh@mcs.anl.gov ]
?
IPAddressOf_hostA
mailto:timothy.ayer@pw.utc.com ]
should/have
mailto:jayesh@mcs.anl.gov ]
is
mailto:timothy.ayer@pw.utc.com ]
command
hostB
mailto:jayesh@mcs.anl.gov ]
mailto:timothy.ayer@pw.utc.com ]
path
line
mailto:jayesh@mcs.anl.gov ]
need
"-machinefile"
mpich2-bugs@mcs.anl.gov] -----------------------------------------------------------+------------ -----------------------------------------------------------+------------
it
It
me
https://trac.mcs.anl.gov/projects/mpich2/ticket/36 >
https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: -- -- -- Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: |
Originally by Ayer, Timothy C. on 2008-08-13 15:08:54 -0500 Attachment added: |
Originally by Ayer, Timothy C. on 2008-09-11 12:08:42 -0500 Jayesh, I apologize for the delay. I hope to get back to this soon but other items Thanks, From: Ayer, Timothy C. Will do. From: Jayesh Krishna [mailto:jayesh@mcs.anl.gov] Hi, Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Ayer, Timothy C.): That's a bummer I thought for sure that must be it....oh well. I will Thanks, -----Original Message----- ------------------------------------------------------------+--------------- ------------------------------------------------------------+--------------- Comment (by Jayesh Krishna): Hi, Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Jayesh Krishna): Hi, Uninstall MPICH2 on the hosts involved in your job.Manually delete the MPICH2 dlls from windows\system32 directory(Please Re-compile cpi.c/fpi.c and try running your job.
Regards, -----Original Message----- ------------------------------------------------------------+----------- ------------------------------------------------------------+----------- Comment (by Ayer, Timothy C.):
response).
dates on mpich2sshm.dll and mpich2sshmp.dll seem so old (from 2005)???
[mailto:owner-mpich2-bugs@mcs.anl.gov ------------------------------------------------------------+-------------
Owner: ------------------------------------------------------------+-------------
in
the
may
(http://www.mcs.anl.gov/research/projects/mpich2/downloads/index.php?s=dow
mailto:timothy.ayer@pw.utc.com ]
output
mailto:jayesh@mcs.anl.gov ]
during
mailto:timothy.ayer@pw.utc.com ]
to
the
mailto:jayesh@mcs.anl.gov ]
machines
mailto:timothy.ayer@pw.utc.com ]
mailto:jayesh@mcs.anl.gov ]
modified
mailto:timothy.ayer@pw.utc.com ]
IPAddressOf_hostA
mailto:jayesh@mcs.anl.gov ]
mailto:timothy.ayer@pw.utc.com ]
referring
mailto:jayesh@mcs.anl.gov ]
?
IPAddressOf_hostA
mailto:timothy.ayer@pw.utc.com ]
should/have
mailto:jayesh@mcs.anl.gov ]
is
mailto:timothy.ayer@pw.utc.com ]
command
hostB
mailto:jayesh@mcs.anl.gov ]
mailto:timothy.ayer@pw.utc.com ]
path
line
mailto:jayesh@mcs.anl.gov ]
need
"-machinefile"
mpich2-bugs@mcs.anl.gov] -----------------------------------------------------------+------------ -----------------------------------------------------------+------------
it
It
me
https://trac.mcs.anl.gov/projects/mpich2/ticket/36 >
https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: -- -- -- Ticket URL: https://trac.mcs.anl.gov/projects/mpich2/ticket/36#comment: |
Originally by Ayer, Timothy C. on 2008-09-11 12:08:42 -0500 Attachment added: |
Originally by Jayesh Krishna on 2008-10-23 16:24:38 -0500 Attachment added: |
Originally by Jayesh Krishna on 2008-10-23 16:24:38 -0500
|
Originally by Ayer, Timothy C. on 2008-10-27 11:49:11 -0500
|
Originally by Ayer, Timothy C. on 2008-10-27 11:49:11 -0500 Attachment added: |
Originally by jayesh on 2008-10-27 12:08:43 -0500 Closing the ticket for now - reopen when user provides more information -Jayesh |
Originally by "Ayer, Timothy C." timothy.ayer@pw.utc.com on 2008-08-04 09:32:36 -0500
I am testing MPICH2 MPICH2-1.0.7 Windows XP (sp2). I have installed it on 2
hosts (hostA, hostB) and trying to run the fpi.exe built with fmpich2.lib.
The code is hanging in a MPI_Bcast call. The fpi.exe source is attached.
The following tests work fine from hostA, both prompt for a number of
intervals, accept input, and produce and estimate of PI
mpiexec.exe -hosts 2 hostA hostA \hostA\temp\fpi.exe <\hostA\temp\fpi.exe>
mpiexec.exe -hosts 2 hostB hostB \hostA\temp\fpi.exe <\hostA\temp\fpi.exe>
The following test hangs when submitted from hostA (in MPI_Bcast). It does
prompt for input (number of intervals) but once entered it hangs. I have
launched the smpd process using smpd -d but see no output from the smpd
after I enter an interval value
mpiexec.exe -hosts 2 hostA hostB \hostA\temp\fpi.exe <\hostA\temp\fpi.exe>
Any suggestions would be appreciated. Also let me know if you want me to
send debug output.
Thanks,
Tim
Timothy C. Ayer
High Performance Technical Computing
United Technologies - Pratt & Whitney
timothy.ayer@pw.utc.com
(860) 565 - 5268 v
(860) 565 - 2668 f
<<fpi.f>>
The text was updated successfully, but these errors were encountered: