Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

make -j$(nproc) ARCH=minimal VERSION=sdbg test failed about cp2k@master on openEuler20.03_aarch64 #2455

Closed
Tom-python0121 opened this issue Dec 15, 2022 · 7 comments

Comments

@Tom-python0121
Copy link

Tom-python0121 commented Dec 15, 2022

hello,I meet a problem:make -j$(nproc) ARCH=minimal VERSION=sdbg test failed about cp2k@master on openEuler20.03_aarch64

Steps to reproduce the issue

[root@localhost cp2k]# make -j126 ARCH=minimal VERSION=sdbg test
……
>>> /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-15_20-22-40/QS/regtest-loc_powf
    run.inp                                                                                          -    TIMED OUT ( 401.92 sec)
<<< /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-15_20-22-40/QS/regtest-loc_powf (1 of 196) done in 401.92 sec
>>> /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-15_20-22-40/QS/regtest-negf
    li_chain.inp                                                                                     -    TIMED OUT ( 402.10 sec)
<<< /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-15_20-22-40/QS/regtest-negf (2 of 196) done in 402.10 sec
>>> /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-15_20-22-40/QS/regtest-plus_u
    H2O-rks-diag.inp                                                                                 - RUNTIME FAIL (  24.27 sec)
    H2O-rks-otcg.inp                                                                                 - RUNTIME FAIL (  23.40 sec)
    H2O-uks-diag.inp                                                                                 - RUNTIME FAIL (  24.12 sec)
    H2O-uks-otcg.inp                                                                                 - RUNTIME FAIL (  24.97 sec)
    H2O-rks-diag-mulliken.inp                                                                        - RUNTIME FAIL (  28.36 sec)
    H2O-rks-otcg-mulliken.inp                                                                        - RUNTIME FAIL (  28.38 sec)
    H2O-uks-diag-mulliken.inp                                                                        - RUNTIME FAIL (  23.93 sec)
    H2O-uks-otcg-mulliken.inp                                                                        - RUNTIME FAIL (  28.73 sec)
    H2O-rks-diag-lowdin.inp                                                                          - RUNTIME FAIL (  28.53 sec)
    H2O-rks-otcg-lowdin.inp                                                                          - RUNTIME FAIL (  23.80 sec)
    H2O-uks-diag-lowdin.inp                                                                          - RUNTIME FAIL (  28.63 sec)
    H2O-uks-otcg-lowdin.inp                                                                          - RUNTIME FAIL (  28.65 sec)
    H2O-rks-u_ramping.inp                                                                            - RUNTIME FAIL (  28.79 sec)
    H2O-uks-u_ramping.inp                                                                            - RUNTIME FAIL (  24.97 sec)
    H2O-rks-u_ramping_reset.inp                                                                      - RUNTIME FAIL (  28.63 sec)
    H2O-uks-u_ramping_reset.inp                                                                      - RUNTIME FAIL (  23.60 sec)
<<< /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-15_20-22-40/QS/regtest-plus_u (3 of 196) done in 421.76 sec
>>> /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-15_20-22-40/UNIT
    dbt_unittest                                                                                     -           OK ( 261.02 sec)
    dbt_tas_unittest                                                                                 -           OK (  50.17 sec)
    grid_unittest                                                                                    -           OK (  36.79 sec)
    memory_utilities_unittest                                                                        -           OK (   0.03 sec)
    nequip_unittest                                                                                  -           OK (   0.03 sec)
    parallel_rng_types_unittest                                                                      -           OK (   4.62 sec)
    libcp2k_unittest                                                                                 -           OK ( 109.91 sec)

log detail:
test-suilte.log

Can you tell me how to solve this problem?

@mkrack
Copy link
Member

mkrack commented Dec 15, 2022 via email

@Tom-python0121
Copy link
Author

@mkrack This command cannot be used and an error will be reported.

Steps to reproduce the issue

========== Make (sdbg) ==========
make --version
GNU Make 4.3
Built for aarch64-openEuler-linux-gnu
Copyright (C) 1988-2020 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.

========= Python (sdbg) =========
/usr/bin/env python3 --version
Python 3.8.8
----------------------- External Modules ---------------------------------
DBCSR Version: 2.4.1 (2022-08-29)
---------------------------- Modules -------------------------------------
Module system not installed.
usage: do_regtest.py [-h] [--mpiranks MPIRANKS] [--ompthreads OMPTHREADS] [--maxtasks MAXTASKS] [--num_gpus NUM_GPUS] [--timeout TIMEOUT] [--maxerrors MAXERRORS] [--mpiexec MPIEXEC] [--keepalive]
                     [--debug] [--restrictdir RESTRICTDIR] [--skipdir SKIPDIR] [--workbasedir WORKBASEDIR]
                     arch version
do_regtest.py: error: unrecognized arguments: ”—timeout=900”
make[3]: *** [/home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/Makefile:256: test] Error 2
make[2]: *** [/home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/Makefile:151: test] Error 2
make[1]: *** [/home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/Makefile:128: sdbg] Error 2
make: *** [Makefile:123: test] Error 2

@mkrack
Copy link
Member

mkrack commented Dec 16, 2022

As hinted by the usage description of do_regtest.py, try
make -j ARCH=minimal VERSION=sdbg TESTOPTS+=”--timeout 900” test
instead.

@Tom-python0121
Copy link
Author

@mkrack It's no use
[root@localhost spack-src]# make -j ARCH=minimal VERSION=sdbg TESTOPTS+=”--timeout 900” test
Discovering programs ...
make: *** No rule to make target '900”'. Stop.

@alazzaro
Copy link
Member

alazzaro commented Dec 16, 2022

@mkrack It's no use [root@localhost spack-src]# make -j ARCH=minimal VERSION=sdbg TESTOPTS+=”--timeout 900” test Discovering programs ... make: *** No rule to make target '900”'. Stop.

I don't use "+=", could you try with:

make -j ARCH=minimal VERSION=sdbg TESTOPTS="--timeout 900" test

@Tom-python0121
Copy link
Author

@alazzaro The TIMED OUT test case problem still occurs.

Steps to reproduce the issue

[root@localhost cp2k]# make -j ARCH=minimal VERSION=sdbg TESTOPTS="--timeout 900" test
……
>>> /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-16_15-11-52/UNIT
    dbt_unittest                                                                                     -           OK ( 250.21 sec)
    dbt_tas_unittest                                                                                 -           OK (  50.80 sec)
    grid_unittest                                                                                    -           OK (  35.45 sec)
    memory_utilities_unittest                                                                        -           OK (   0.03 sec)
    nequip_unittest                                                                                  -           OK (   0.03 sec)
    parallel_rng_types_unittest                                                                      -           OK (   4.61 sec)
    libcp2k_unittest                                                                                 -           OK ( 112.19 sec)
<<< /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-16_15-11-52/UNIT (1 of 196) done in 453.32 sec
>>> /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-16_15-11-52/QS/regtest-negf
    li_chain.inp                                                                                     -    TIMED OUT ( 900.65 sec)
<<< /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-16_15-11-52/QS/regtest-negf (2 of 196) done in 900.65 sec
>>> /home/stage/root/spack-stage-cp2k-2022.1-eao7ioaielv3adrtxqpvqgxmu6pgfhz6/cp2k/regtesting/TEST-minimal-sdbg-2022-12-16_15-11-52/QS/regtest-loc_powf
    run.inp                                                                                          -    TIMED OUT ( 901.06 sec)

Same as the log above.

@alazzaro
Copy link
Member

We can compare your timings with the https://storage.googleapis.com/cp2k-ci/dashboard_minimal_report.txt
For instance, your libcp2k_unittest takes 112.19s, while the CP2K dashboard reports 2.29s.
For the test li_chain.inp the dashboard reports 271.07s. So, my conclusion is that there is something which makes your execution very slow. Can it be that you are oversubscribing the number of cores? There is an option --maxtasks=2 that you can use in TESTOPTS=, i.e. TESTOPTS="--maxtasks=2 --timeout 900"

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants