Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem with compiling on NERSC #623

Open
anhph opened this issue Sep 21, 2020 · 6 comments
Open

Problem with compiling on NERSC #623

anhph opened this issue Sep 21, 2020 · 6 comments

Comments

@anhph
Copy link

anhph commented Sep 21, 2020

Hi,

I receive this error when compiling on NERSC Cori:

compilation aborted for ALPSCore/gf/test/fourier_test.cpp (code 2)
make[2]: *** [gf/test/CMakeFiles/fourier_test.dir/build.make:63: gf/test/CMakeFiles/fourier_test.dir/fourier_test.cpp.o] Error 2
make[1]: *** [CMakeFiles/Makefile2:5722: gf/test/CMakeFiles/fourier_test.dir/all] Error 2
make: *** [Makefile:141: all] Error 2

Any help would be appreciated

@egull
Copy link
Collaborator

egull commented Sep 21, 2020 via email

@anhph
Copy link
Author

anhph commented Sep 21, 2020

Here is my make VERBOSE=1. This is where the error occurs

[ 81%] Building CXX object gf/test/CMakeFiles/fourier_test.dir/fourier_test.cpp.o
cd /global/homes/a/axp561/ALPSCore/build/gf/test && /opt/cray/pe/craype/2.6.2/bin/CC -I/global/homes/a/axp561/ALPSCore/build/hdf5/include -I/global/homes/a/axp561/ALPSCore/hdf5/include -I/global/homes/a/axp561/ALPSCore/build/utilities/include -I/global/homes/a/axp561/ALPSCore/utilities/include -isystem /global/homes/a/axp561/ALPSCore/common/deps/gtest-1.7.0/include -I/global/homes/a/axp561/ALPSCore/gf/include -I/global/homes/a/axp561/ALPSCore/build/gf/include -I/global/homes/a/axp561/eigen -O2 -g -DNDEBUG -DBOOST_DISABLE_ASSERTS -std=c++11 -o CMakeFiles/fourier_test.dir/fourier_test.cpp.o -c /global/homes/a/axp561/ALPSCore/gf/test/fourier_test.cpp
/global/homes/a/axp561/eigen/Eigen/src/Core/CwiseUnaryView.h(113): error: expression must be an lvalue or a function designator

@iskakoff
Copy link
Collaborator

Hi,

I receive this error when compiling on NERSC Cori:

compilation aborted for ALPSCore/gf/test/fourier_test.cpp (code 2)
make[2]: *** [gf/test/CMakeFiles/fourier_test.dir/build.make:63: gf/test/CMakeFiles/fourier_test.dir/fourier_test.cpp.o] Error 2
make[1]: *** [CMakeFiles/Makefile2:5722: gf/test/CMakeFiles/fourier_test.dir/all] Error 2
make: *** [Makefile:141: all] Error 2

Any help would be appreciated

what is your cmake flags and so on?
I've just rebuilt ALPSCore on Cori and everything went well. Here are the modules I have loaded:

Currently Loaded Modulefiles:
  1) modules/3.2.11.4                                  8) udreg/2.3.2-7.0.1.1_3.36__g8175d3d.ari           15) dvs/2.12_2.2.157-7.0.1.1_9.4__g083131db          22) craype-hugepages2M
  2) altd/2.0                                          9) ugni/6.0.14.0-7.0.1.1_7.38__ge78e5b0.ari         16) alps/6.6.58-7.0.1.1_6.8__g437d88db.ari           23) cray-hdf5/1.10.5.2
  3) darshan/3.1.7                                    10) pmi/5.0.14                                       17) rca/2.2.20-7.0.1.1_4.51__g8e3fb5b.ari            24) cmake/3.14.4
  4) intel/19.0.3.199                                 11) dmapp/7.1.1-7.0.1.1_4.52__g38cf134.ari           18) atp/2.1.3                                        25) cray-fftw/3.3.8.4
  5) craype-network-aries                             12) gni-headers/5.0.12.0-7.0.1.1_6.31__g3b1768f.ari  19) PrgEnv-intel/6.0.5                               26) python3/3.7-anaconda-2019.10
  6) craype/2.6.2                                     13) xpmem/2.2.20-7.0.1.1_4.13__g0475745.ari          20) craype-haswell
  7) cray-libsci/19.06.1                              14) job/2.2.4-7.0.1.1_3.40__g36b56f4.ari             21) cray-mpich/7.7.10

There is one failed test, but this is known problem on intel compilers:

The following tests FAILED:
	 59 - binop_with_scalar (Failed)

@anhph
Copy link
Author

anhph commented Sep 23, 2020

Here are my modules

  1. modules/3.2.11.4
  2. altd/2.0
  3. darshan/3.1.7
  4. intel/19.0.3.199
  5. craype-network-aries
  6. craype/2.6.2
  7. cray-libsci/19.06.1
  8. udreg/2.3.2-7.0.1.1_3.36__g8175d3d.ari
  9. ugni/6.0.14.0-7.0.1.1_7.38__ge78e5b0.ari
  10. pmi/5.0.14
  11. dmapp/7.1.1-7.0.1.1_4.52__g38cf134.ari
  12. gni-headers/5.0.12.0-7.0.1.1_6.31__g3b1768f.ari
  13. xpmem/2.2.20-7.0.1.1_4.13__g0475745.ari
  14. job/2.2.4-7.0.1.1_3.40__g36b56f4.ari
  15. dvs/2.12_2.2.157-7.0.1.1_9.4__g083131db
  16. alps/6.6.58-7.0.1.1_6.8__g437d88db.ari
  17. rca/2.2.20-7.0.1.1_4.51__g8e3fb5b.ari
  18. atp/2.1.3
  19. PrgEnv-intel/6.0.5
  20. craype-haswell
  21. cray-mpich/7.7.10
  22. craype-hugepages2M
  23. python/3.7-anaconda-2019.10
  24. cmake/3.14.4
  25. boost/1.70.0
  26. cray-hdf5-parallel/1.10.5.2
  27. cray-fftw/3.3.8.4

Here is my Cmake:

cmake .. -DEIGEN3_INCLUDE_DIR=/global/homes/a/axp561/.conda/envs/eigen/include/eigen3 -DCMAKE_INSTALL_PREFIX=/global/homes/a/axp561/ALPSCore_install -DBOOST_ROOT=/usr/common/software/boost/1.70.0/intel/haswell -DCMAKE_CXX_STANDARD=14

But I get a lot of errors, all related to mpi

 12 - mpi_utils_env (Child aborted)
 13 - mpi_utils_comm (Child aborted)
 14 - mpi_utils_bcast (Child aborted)
 15 - mpi_utils_bcast_optional (Child aborted)
 16 - mpi_utils_reduce (Child aborted)
 17 - mpi_utils_env_exceptions1 (Child aborted)
 18 - mpi_utils_env_exceptions2 (Child aborted)
 59 - binop_with_scalar (Failed)
 68 - mpi_merge (Child aborted)
 69 - mpi_merge_uneven (Child aborted)
 70 - repeated_merge (Child aborted)
 71 - zero_vector_mpi (Child aborted)
 91 - serialize_variant_mpi (Child aborted)
 92 - dictionary_bcast_mpi (Child aborted)
 93 - params_bcast_mpi (Child aborted)
 98 - sum_mpi (Child aborted)
 99 - reduce (Child aborted)
100 - signed_obs (Child aborted)
101 - custom_scheduler (Child aborted)
102 - reduce_unavailable_results (Child aborted)
119 - one_index_gf_test_mpi (Child aborted)
120 - four_index_gf_test_mpi (Child aborted)
121 - four_index_gf_test_mismatched_mpi (Child aborted)
122 - four_index_gf_test_mismatched-tail_mpi (Child aborted)
123 - multiarray_bcast_mpi (Child aborted)
124 - mesh_test_mpi (Child aborted)
125 - gf_new_test_mpi (Child aborted)
126 - gf_new_tail_test_mpi (Child aborted)
135 - mpi_twogauss (Child aborted)

Do I need to explicitly set the CC compiler directory for this ?

@galexv
Copy link
Collaborator

galexv commented Sep 24, 2020

Judging by the fact that compilation itself succeeds and only the MPI tests fail, it seems to be the same issue as #211 (feature request #296), addressed by (not yet merged) PR #613 .

I believe PR #613 does not affect the business logic (it changes only CMake scripts), so it could be safely merged.

@anhph You could try to apply that PR to your local repository and see if it works for you; the PR description shows how to use the new features on Cori.

@iskakoff
Copy link
Collaborator

@anhph to run MPI tests on Cori you need to run interactive job, any attempt to call MPI_Init on login node will fail immediately
@galexv I think @anhph run whatever tests compiled but anhph still has at least fourier_test that fails to compile.
I will take more close look today or tomorrow on fourier_test whether I can reproduce it or not.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants