-
Notifications
You must be signed in to change notification settings - Fork 936
Open
Labels
Milestone
Description
Background information
What version of Open MPI are you using? (e.g., v1.10.3, v2.1.0, git branch name and hash, etc.)
4.1.2
Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)
Following these steps:
Downloaded OpenMPI 4.1.2 from https://www.open-mpi.org/software/ompi/v4.1/
cd openmpi-4.1.2
./configure --prefix=/home/software/huyxii/openmpi-4.1.2
make
make install
Please describe the system on which you are running
Operating system/version: CentOS Linux release 7.4.1708 (Core)
Details of the problem
mpirun could run successfully when running hello_c in the example directory:
mpirun -np 5 hello_c
Hello, world, I am 3 of 5, (Open MPI v4.1.2, package: Open MPI huyxii@mu01 Distribution, ident: 4.1.2, repo rev: v4.1.2, Nov 24, 2021, 106)
Hello, world, I am 0 of 5, (Open MPI v4.1.2, package: Open MPI huyxii@mu01 Distribution, ident: 4.1.2, repo rev: v4.1.2, Nov 24, 2021, 106)
Hello, world, I am 1 of 5, (Open MPI v4.1.2, package: Open MPI huyxii@mu01 Distribution, ident: 4.1.2, repo rev: v4.1.2, Nov 24, 2021, 106)
Hello, world, I am 2 of 5, (Open MPI v4.1.2, package: Open MPI huyxii@mu01 Distribution, ident: 4.1.2, repo rev: v4.1.2, Nov 24, 2021, 106)
Hello, world, I am 4 of 5, (Open MPI v4.1.2, package: Open MPI huyxii@mu01 Distribution, ident: 4.1.2, repo rev: v4.1.2, Nov 24, 2021, 106)
When using mpiexec on some programs, I just get error like this:
mpiexec -np 5 maker -base Fn_Male maker_bopts.ctl maker_exe.ctl maker_opts.ctl --ignore_nfs_tmp
STATUS: Parsing control files...
STATUS: Processing and indexing input FASTA files...
[mu01:30587] *** Process received signal ***
[mu01:30587] Signal: 段错误 (11)
[mu01:30587] Signal code: Address not mapped (1)
[mu01:30587] Failing at address: 0x4b0
[mu01:30587] [ 0] /lib64/libpthread.so.0(+0xf6d0)[0x2b89f23636d0]
[mu01:30587] [ 1] /usr/lib64/perl5/CORE/libperl.so(Perl_csighandler+0x22)[0x2b89f12f3042]
[mu01:30587] [ 2] /lib64/libpthread.so.0(+0xf6d0)[0x2b89f23636d0]
[mu01:30587] [ 3] /lib64/libc.so.6(__poll+0x2d)[0x2b89f2663f0d]
[mu01:30587] [ 4] /home/huyxii/software/openmpi-4.1.2/lib/libopen-pal.so.40(+0x8a3a8)[0x2b89fce5d3a8]
[mu01:30587] [ 5] /home/huyxii/software/openmpi-4.1.2/lib/libopen-pal.so.40(opal_libevent2022_event_base_loop+0x196)[0x2b89fce53e76]
[mu01:30587] [ 6] /home/huyxii/software/openmpi-4.1.2/lib/libopen-pal.so.40(+0x3cfbe)[0x2b89fce0ffbe]
[mu01:30587] [ 7] /lib64/libpthread.so.0(+0x7e25)[0x2b89f235be25]
[mu01:30587] [ 8] /lib64/libc.so.6(clone+0x6d)[0x2b89f266ebad]
[mu01:30587] *** End of error message ***
Primary job terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
[mu01:30592] *** Process received signal ***
[mu01:30592] Signal: 段错误 (11)
[mu01:30592] Signal code: Address not mapped (1)
[mu01:30592] Failing at address: 0x4b0
[mu01:30592] [ 0] /lib64/libpthread.so.0(+0xf6d0)[0x2b9c0e3996d0]
[mu01:30592] [ 1] /usr/lib64/perl5/CORE/libperl.so(Perl_csighandler+0x22)[0x2b9c0d329042]
[mu01:30592] [ 2] /lib64/libpthread.so.0(+0xf6d0)[0x2b9c0e3996d0]
[mu01:30592] [ 3] /usr/lib64/perl5/CORE/libperl.so(Perl_csighandler+0x0)[0x2b9c0d329020]
[mu01:30592] [ 4] /lib64/libpthread.so.0(+0xf6d0)[0x2b9c0e3996d0]
[mu01:30592] [ 5] /lib64/libc.so.6(__poll+0x2d)[0x2b9c0e699f0d]
[mu01:30592] [ 6] /home/huyxii/software/openmpi-4.1.2/lib/libopen-pal.so.40(+0x8a3a8)[0x2b9c18e933a8]
[mu01:30592] [ 7] /home/huyxii/software/openmpi-4.1.2/lib/libopen-pal.so.40(opal_libevent2022_event_base_loop+0x196)[0x2b9c18e89e76]
[mu01:30592] [ 8] /home/huyxii/software/openmpi-4.1.2/lib/libopen-pal.so.40(+0x3cfbe)[0x2b9c18e45fbe]
[mu01:30592] [ 9] /lib64/libpthread.so.0(+0x7e25)[0x2b9c0e391e25]
[mu01:30592] [10] /lib64/libc.so.6(clone+0x6d)[0x2b9c0e6a4bad]
[mu01:30592] *** End of error message ***
SIGTERM received
SIGTERM received
SIGTERM received
mpiexec noticed that process rank 1 with PID 0 on node mu01 exited on signal 11 (Segmentation fault).
Thanks in advance for any help.