Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

make check fails on Power8 with gcc 5 #59

Open
pszi1ard opened this issue Mar 23, 2016 · 4 comments
Open

make check fails on Power8 with gcc 5 #59

pszi1ard opened this issue Mar 23, 2016 · 4 comments

Comments

@pszi1ard
Copy link

$ ./configure --enable-float --enable-fma --enable-vsx && make -j96 && make check

Executing "/tmp/fftw-3.3.5/tests/bench --verbose=1 --verify 'ok5o11x4o10x6o10x11o11_4' --verify 'ik5o11x4o10x6o10x11o11_4' --verify '//obr9x5x24' --verify '//ofr9x5x24' --verify 'obr9x5x24' --verify 'ibr9x5x24' --verify 'ofr9x5x24' --verify 'ifr9x5x24' --verify '//obc9x5x24' --verify '//ibc9x5x24' --verify '//ofc9x5x24' --verify '//ifc9x5x24' --verify 'obc9x5x24' --verify 'ibc9x5x24' --verify 'ofc9x5x24' --verify 'ifc9x5x24' --verify 'ok12hx4hx13o01x13e10' --verify 'ik12hx4hx13o01x13e10' --verify 'obrd4x2x4x10v8' --verify 'ibrd4x2x4x10v8' --verify 'ofrd4x2x4x10v8' --verify 'ifrd4x2x4x10v8' --verify '//obcd4x2x4x10v8' --verify '//ibcd4x2x4x10v8' --verify '//ofcd4x2x4x10v8' --verify '//ifcd4x2x4x10v8' --verify 'obcd4x2x4x10v8' --verify 'ibcd4x2x4x10v8' --verify 'ofcd4x2x4x10v8' --verify 'ifcd4x2x4x10v8' --verify 'okd11088o11' --verify 'ikd11088o11' --verify 'obr8x4x8x4_6' --verify 'ibr8x4x8x4_6' --verify 'ofr8x4x8x4_6' --verify 'ifr8x4x8x4_6' --verify '//obc8x4x8x4_6' --verify '//ibc8x4x8x4_6' --verify '//ofc8x4x8x4_6' --verify '//ifc8x4x8x4_6' --verify 'obc8x4x8x4_6' --verify 'ibc8x4x8x4_6' --verify 'ofc8x4x8x4_6' --verify 'ifc8x4x8x4_6'"
ok5o11x4o10x6o10x11o11_4 2.13067e-07 3.24252e-06 2.67794e-07
ik5o11x4o10x6o10x11o11_4 2.16295e-07 3.14798e-06 2.42186e-07
//obr9x5x24 1.76873e-07 5.80388e-07 1.90459e-07
//ofr9x5x24 2.16065e-07 5.80388e-07 1.864e-07
Found relative error 3.978176e+13 (impulse 1)
[...]

Reprodced on Power8 with gcc 5.2 and 5.3, with gcc 4.8.4 and 4.9.1 the tests pass.

@dolfim
Copy link

dolfim commented Jun 7, 2017

I have the same problem with version 3.3.6-pl2.
I managed to avoid it by removing the option --enable-vsx in the double precision compilation. (in single precision I can leave the option and all works fine)

Was there any insight in this issue?

@arwild01
Copy link

I am also seeing this on 3.3.6-pl2 with and I'm also using --enable-vsx with the double precision build.

On my single precision build I had used --enable-altivec and --enable-vsx and I had to remove --enable-altivec to get the single precision build to pass (but like @dolfim said, I could leave --enable-vsx and the single precision build passed with it).

@edmondac
Copy link

For FFTW 3.3.8 on POWER9, I had to build without MPI (we normally use --enable-mpi) in order for make check to pass reliably. GCC 7.3.0 via EasyBuild.

Some info:

perl -w ../tests/check.pl --verbose --random --maxsize=10000 -c=10  --mpi "mpirun -np 3 `pwd`/mpi-bench"
Executing "mpirun -np 3 /dev/shm/build/edmondac-admin-3/FFTW/3.3.8/gompi-2018b/fftw-3.3.8/mpi/mpi-bench --verbose=1   --verify 'obr24x3' --verify 'ibr24x3' --verify 'ofr24x3' --verify 'ifr24x3' --verify 'obc24x3' --verify 'ibc24x3' --verify 'ofc24x3' --verify 'ifc24x3' --
verify 'obc715' --verify 'ibc715' --verify 'ofc715' --verify 'ifc715' --verify 'ok6bx3o11x13h' --verify 'ik6bx3o11x13h' --verify 'ofr]12x8x11x12' --verify 'ifr]12x8x11x12' --verify 'obc]12x8x11x12' --verify 'ibc]12x8x11x12' --verify 'ofc]12x8x11x12' --verify 'ifc]12x8x11x
12' --verify 'obr[13x12x7v4' --verify 'ibr[13x12x7v4' --verify 'obc[13x12x7v4' --verify 'ibc[13x12x7v4' --verify 'ofc[13x12x7v4' --verify 'ifc[13x12x7v4' --verify 'ok[10bx9o11x5o01x6h' --verify 'ik[10bx9o11x5o01x6h' --verify 'obrd9x2v10' --verify 'ibrd9x2v10' --verify 'of
rd9x2v10' --verify 'ifrd9x2v10' --verify 'obcd9x2v10' --verify 'ibcd9x2v10' --verify 'ofcd9x2v10' --verify 'ifcd9x2v10' --verify 'okd4o00x10bx5hx10hv1' --verify 'ikd4o00x10bx5hx10hv1' --verify 'ok]10o11x7o10x2o10v1' --verify 'ik]10o11x7o10x2o10v1'"
obr24x3 1.6452e-07 2.24783e-07 1.44539e-07
ibr24x3 1.66983e-07 2.24783e-07 1.53977e-07
ofr24x3 1.72034e-07 2.24783e-07 1.42717e-07
<SNIP>
ibrd9x2v10 1.61473e-07 1.85446e-07 1.8394e-07
ofrd9x2v10 1.94954e-07 1.85446e-07 9.63506e-08
Found relative error 2.452118e-01 (time shift)
       0   0.627020716667   0.000000000000     0.627020716667   0.000000000000
       1   0.963276386261   0.000000000000     0.963276386261   0.000000000000
<SNIP>
--------------------------------------------------------------------------
MPI_ABORT was invoked on rank 2 in communicator MPI_COMM_WORLD
with errorcode 1.

NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
You may or may not see output from other processes, depending on
exactly when Open MPI kills them.
--------------------------------------------------------------------------
[bear-pg0305u03a.bear.cluster:96003] 2 more processes have sent help message help-mpi-api.txt / mpi-abort
[bear-pg0305u03a.bear.cluster:96003] Set MCA parameter "orte_base_help_aggregate" to 0 to see all help / error messages
FAILED mpirun -np 3 /dev/shm/build/edmondac-admin-3/FFTW/3.3.8/gompi-2018b/fftw-3.3.8/mpi/mpi-bench:  --verify 'obr24x3' --verify 'ibr24x3' --verify 'ofr24x3' --verify 'ifr24x3' --verify 'obc24x3' --verify 'ibc24x3' --verify 'ofc24x3' --verify 'ifc24x3' --verify 'obc715' --verify 'ibc715' --verify 'ofc715' --verify 'ifc715' --verify 'ok6bx3o11x13h' --verify 'ik6bx3o11x13h' --verify 'ofr]12x8x11x12' --verify 'ifr]12x8x11x12' --verify 'obc]12x8x11x12' --verify 'ibc]12x8x11x12' --verify 'ofc]12x8x11x12' --verify 'ifc]12x8x11x12' --verify 'obr[13x12x7v4' --verify 'ibr[13x12x7v4' --verify 'obc[13x12x7v4' --verify 'ibc[13x12x7v4' --verify 'ofc[13x12x7v4' --verify 'ifc[13x12x7v4' --verify 'ok[10bx9o11x5o01x6h' --verify 'ik[10bx9o11x5o01x6h' --verify 'obrd9x2v10' --verify 'ibrd9x2v10' --verify 'ofrd9x2v10' --verify 'ifrd9x2v10' --verify 'obcd9x2v10' --verify 'ibcd9x2v10' --verify 'ofcd9x2v10' --verify 'ifcd9x2v10' --verify 'okd4o00x10bx5hx10hv1' --verify 'ikd4o00x10bx5hx10hv1' --verify 'ok]10o11x7o10x2o10v1' --verify 'ik]10o11x7o10x2o10v1'
make[3]: *** [check-local] Error 1
make[3]: Leaving directory `/dev/shm/build/edmondac-admin-3/FFTW/3.3.8/gompi-2018b/fftw-3.3.8/mpi'
make[2]: *** [check-am] Error 2
make[2]: Leaving directory `/dev/shm/build/edmondac-admin-3/FFTW/3.3.8/gompi-2018b/fftw-3.3.8/mpi'
make[1]: *** [check] Error 2
make[1]: Leaving directory `/dev/shm/build/edmondac-admin-3/FFTW/3.3.8/gompi-2018b/fftw-3.3.8/mpi'
make: *** [check-recursive] Error 1

@Flamefire
Copy link

This is still an issue with FFTW 3.3.10:

./configure --prefix=/tmp/software/FFTW/3.3.10-GCC-11.3.0  --build=powerpc64le-unknown-linux-gnu  --host=powerpc64le-unknown-linux-gnu --enable-single --with-pic --enable-openmp --enable-shared --enable-threads --enable-vsx --enable-altivec
...
perl -w ./check.pl  -r -c=30 -v `pwd`/bench
...
Executing "/tmp/balex/FFTW/3.3.10/GCC-11.3.0/fftw-3.3.10/tests/bench --verbose=1   --verify 'ofc28' --verify 'ifc28' --verify 'ok10296e00' --verify 'ik10296e00' --verify 'obr8x6v7' --verify 'ibr8x6v7' --verify 'ofr8x6v7' --verify 'ifr8x6v7' --verify '//obc8x6v7' --verify '//ibc8x6v7' --verify '//ofc8x6v7' --verify '//ifc8x6v7' --verify 'obc8x6v7' --verify 'ibc8x6v7' --verify 'ofc8x6v7' --verify 'ifc8x6v7' --verify 'ok7e01x5e00x10e00v9' --verify 'ik7e01x5e00x10e00v9' --verify 'obrd6x6x9x4*6' --verify 'ibrd6x6x9x4*6' --verify 'ofrd6x6x9x4*6' --verify 'ifrd6x6x9x4*6' --verify '//obcd6x6x9x4*6' --verify '//ibcd6x6x9x4*6' --verify '//ofcd6x6x9x4*6' --verify '//ifcd6x6x9x4*6' --verify 'obcd6x6x9x4*6' --verify 'ibcd6x6x9x4*6' --verify 'ofcd6x6x9x4*6' --verify 'ifcd6x6x9x4*6' --verify 'okd24bx2o00x12e11' --verify 'ikd24bx2o00x12e11' --verify '//obr12x8x11' --verify '//ofr12x8x11' --verify 'obr12x8x11' --verify 'ibr12x8x11' --verify 'ofr12x8x11' --verify 'ifr12x8x11' --verify '//obc12x8x11' --verify '//ibc12x8x11' --verify '//ofc12x8x11' --verify '//ifc12x8x11' --verify 'obc12x8x11' --verify 'ibc12x8x11'"
Found relative error 7.304246e+32 (impulse)

Workaround is (still) to pass --disable-altivec to the FFTW single-precision configure.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants