improve QuantumESPRESSO easyblock by cleaning up and extending configure step + running test suite #3241

Crivella · 2024-03-01T16:18:11Z

This PR, opened in relation to issue #3234, changes the behavior of the QuantumESPRESSO easyblock in order to make sure the correct compilation flags are used for the majority of versions from 5.x to 7.x.

Also the inclusion of flags has been modularized in order to be easier to mange

The new easyblock has been tested by compiling several QE versions, starting from the latest available intel and foss recipes and using the --try-software-version flag to install different software.
When a valid version of the HDF5 LibXC or ELPA libraries was not readily available it has been manually disabled from the original config file.
In details:

ELPA and LibXC have not been used for QE<7.x (They are available but would need to recompile a different version in order to test)
HDF5 have not been used for QE < 5.2.1 (Support was experimental from 5.0 but in my case i was obtaining segfaults when HDF5 functions were being invoked).

NOTE: Due to a problem with compiling and running QE with the intel toolchain + openmp, openmp was disabled when using the intel2022b

Below the results of a small reframe test (see PR) used to check that all codes are able to reach the end without any errors or segfaults.
(The version was scaled down only to check that all codes reach the JOB DONE line, considering how small this calculation was, the timings themself are not very significative)

… set of flags

easybuild/easyblocks/q/quantumespresso.py

…strings

Co-authored-by: ocaisa <alan.ocais@cecam.org>

…uild-easyblocks into feature-improve_qe_eblock

easybuild/easyblocks/q/quantumespresso.py

Co-authored-by: ocaisa <alan.ocais@cecam.org>

easybuild/easyblocks/q/quantumespresso.py

Co-authored-by: ocaisa <alan.ocais@cecam.org>

Crivella · 2024-03-09T12:54:02Z

@ocaisa Just checked those failures in the bot run. Apparently QE implemented the NPROCS for the test_suite in 7.2 and not 7.0. Before that only run with -parallel/-serial are present.
Added a fix for that

boegelbot · 2024-03-09T17:23:17Z

Test report by @boegelbot

Overview of tested easyconfigs (in order)

SUCCESS QuantumESPRESSO-6.8-foss-2021a.eb
SUCCESS QuantumESPRESSO-6.8-foss-2021b.eb
SUCCESS QuantumESPRESSO-6.8-intel-2021a.eb
FAIL (build issue) QuantumESPRESSO-7.0-foss-2021b.eb (partial log available at https://gist.github.com/boegelbot/5d71d64b40ca7463e21386d2ec3f7c2c)
FAIL (build issue) QuantumESPRESSO-7.0-intel-2021b.eb (partial log available at https://gist.github.com/boegelbot/04bd9c0792734265b1c970681fcb2a2c)
FAIL (build issue) QuantumESPRESSO-7.1-foss-2022a.eb (partial log available at https://gist.github.com/boegelbot/06f6b673da286b20c76d59637cb39050)
FAIL (build issue) QuantumESPRESSO-7.1-intel-2022a.eb (partial log available at https://gist.github.com/boegelbot/23ad7769394e7b1b304cf34cce44fc03)

Build succeeded for 3 out of 7 (7 easyconfigs in total)
cns1 - Linux Rocky Linux 8.9, x86_64, Intel(R) Xeon(R) CPU E5-2667 v3 @ 3.20GHz (haswell), Python 3.6.8
See https://gist.github.com/boegelbot/b9c6f520ca24b3ccca205f450fbb5a18 for a full test report.

ocaisa · 2024-03-09T19:20:02Z

@boegelbot please test @ generoso
EB_ARGS=" QuantumESPRESSO-6.8-foss-2021a.eb QuantumESPRESSO-6.8-foss-2021b.eb QuantumESPRESSO-6.8-intel-2021a.eb QuantumESPRESSO-7.0-foss-2021b.eb QuantumESPRESSO-7.0-intel-2021b.eb QuantumESPRESSO-7.1-foss-2022a.eb QuantumESPRESSO-7.1-intel-2022a.eb "

boegelbot · 2024-03-09T19:20:06Z

@ocaisa: Request for testing this PR well received on login1

PR test command 'EB_PR=3241 EB_ARGS=" QuantumESPRESSO-6.8-foss-2021a.eb QuantumESPRESSO-6.8-foss-2021b.eb QuantumESPRESSO-6.8-intel-2021a.eb QuantumESPRESSO-7.0-foss-2021b.eb QuantumESPRESSO-7.0-intel-2021b.eb QuantumESPRESSO-7.1-foss-2022a.eb QuantumESPRESSO-7.1-intel-2022a.eb " EB_CONTAINER= EB_REPO=easybuild-easyblocks /opt/software/slurm/bin/sbatch --job-name test_PR_3241 --ntasks=4 ~/boegelbot/eb_from_pr_upload_generoso.sh' executed!

exit code: 0
output:

Submitted batch job 13062

Test results coming soon (I hope)...

- notification for comment with ID 1986953955 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

ocaisa · 2024-03-11T11:06:46Z

@boegelbot please test @ jsc-zen3
EB_ARGS=" QuantumESPRESSO-6.8-foss-2021a.eb QuantumESPRESSO-6.8-foss-2021b.eb QuantumESPRESSO-6.8-intel-2021a.eb QuantumESPRESSO-7.0-foss-2021b.eb QuantumESPRESSO-7.0-intel-2021b.eb QuantumESPRESSO-7.1-foss-2022a.eb QuantumESPRESSO-7.1-intel-2022a.eb "

boegelbot · 2024-03-11T11:13:08Z

@ocaisa: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=3241 EB_ARGS=" QuantumESPRESSO-6.8-foss-2021a.eb QuantumESPRESSO-6.8-foss-2021b.eb QuantumESPRESSO-6.8-intel-2021a.eb QuantumESPRESSO-7.0-foss-2021b.eb QuantumESPRESSO-7.0-intel-2021b.eb QuantumESPRESSO-7.1-foss-2022a.eb QuantumESPRESSO-7.1-intel-2022a.eb " EB_REPO=easybuild-easyblocks EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_3241 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

exit code: 0
output:

Submitted batch job 3742

Test results coming soon (I hope)...

- notification for comment with ID 1988182263 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

boegelbot · 2024-03-11T22:42:49Z

Test report by @boegelbot

Overview of tested easyconfigs (in order)

SUCCESS QuantumESPRESSO-6.8-foss-2021b.eb
SUCCESS QuantumESPRESSO-6.8-intel-2021a.eb
SUCCESS QuantumESPRESSO-7.0-foss-2021b.eb
SUCCESS QuantumESPRESSO-7.1-foss-2022a.eb
SUCCESS QuantumESPRESSO-7.1-intel-2022a.eb
SUCCESS ELPA-2021.05.001-foss-2021a.eb
SUCCESS QuantumESPRESSO-6.8-foss-2021a.eb
FAIL (build issue) libxc-5.1.6-intel-compilers-2021.4.0.eb (partial log available at https://gist.github.com/boegelbot/ce50894b2d91f19735629a56e6c0287d)
FAIL (build issue) impi-2021.4.0-intel-compilers-2021.4.0.eb (partial log available at https://gist.github.com/boegelbot/13e6ce6e94c077220f055a5436fbb460)
FAIL (build issue) iimpi-2021b.eb (partial log available at https://gist.github.com/boegelbot/5030355127ab3fc2c2daef92be8fac35)
FAIL (build issue) HDF5-1.12.1-iimpi-2021b.eb (partial log available at https://gist.github.com/boegelbot/9ca4b062cd0e5074ae8642dad60a41c6)
FAIL (build issue) imkl-FFTW-2021.4.0-iimpi-2021b.eb (partial log available at https://gist.github.com/boegelbot/e2d7a90c497a8a0a2fb7559f7d1d1858)
FAIL (build issue) intel-2021b.eb (partial log available at https://gist.github.com/boegelbot/cb73f04ceec7ed67adf6a4b19125f499)
FAIL (build issue) ELPA-2021.05.001-intel-2021b.eb (partial log available at https://gist.github.com/boegelbot/e94be8f0a575bb17d458604bbe923465)
FAIL (build issue) QuantumESPRESSO-7.0-intel-2021b.eb (partial log available at https://gist.github.com/boegelbot/c6b1f537a536ef1f8985c5f759e574aa)

Build succeeded for 7 out of 15 (7 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.3, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.18
See https://gist.github.com/boegelbot/1e2ada209d0c8f375f067c8ba8f07475 for a full test report.

ocaisa · 2024-03-12T08:39:19Z

@Crivella I've got passing tests almost everywhere now (here and in easybuilders/easybuild-easyconfigs#20070), apart from an issue with the software stack for version 7.0 (which has nothing to do with this PR). I'll do a final review now and then merge

ocaisa · 2024-03-12T08:42:59Z

easybuild/easyblocks/q/quantumespresso.py

+                'ph_ahc_diam',  # Test detects a ! as an energy in baseline
+                'tddfpt_magnons_fe',  # Too strict thresholds
+            ], "List of test suite targets that are allowed to fail (name can partially match)", CUSTOM],
+            'test_suite_threshold': [0.97, "Threshold for test suite success rate", CUSTOM],


I'm still not that comfortable with giving a % threshold here, I'd prefer to give a specific number (with the default being zero). The we explicitly number the failures in the easyconfig, expecting it to be version (and perhaps toolchain) specific. Pytorch uses

'max_failed_tests': [0, "Maximum number of failing tests", CUSTOM],

With all the testing I've done, I have the exact numbers for almost everything already, I just need to dig them out. We can then add them to the easyconfigs and do a final rerun of the builds. If we do a rerun of builds, do you think we should add the fast-math as well to see how we do?

With the falkyness of some tests failing just because the absolute/relative errors are slightly higher than the thresholds sets in the baselines, i think this could be tricky.
What I am mostly worried about is that if that number is not carefully curated we might be missing some segfaults that could arise.
This is why before i added what should be the flaky test to test_suite_allow_failures and raised (removed in commit f182aea) if a test not in that list failed (most likely it would fail because the calculation does not actually finish)

With all the testing I've done, I have the exact numbers for almost everything already, I just need to dig them out. We can then add them to the easyconfigs and do a final rerun of the builds. If we do a rerun of builds, do you think we should add the fast-math as well to see how we do?

Sure we could try. This night i just finished testing an easyconfig for 7.3 (will open a PR for it soon) with the option. I think it should work for all versions just by adding

'extra_cflags': '-ffast-math', 'extra_fflags': '-ffast-math', 'extra_fcflags': '-ffast-math', 'extra_f90flags': '-ffast-math',

to the toolchain options

With the falkyness of some tests failing just because the absolute/relative errors are slightly higher than the thresholds sets in the baselines, i think this could be tricky. What I am mostly worried about is that if that number is not carefully curated we might be missing some segfaults that could arise. This is why before i added what should be the flaky test to test_suite_allow_failures and raised (removed in commit f182aea) if a test not in that list failed (most likely it would fail because the calculation does not actually finish)

But i guess we can use the failures array to check the number of failures without the ignored ones.
I would still leave the threshold on the total number (without the ignored ones) though as the ignored tests including relax actually could be excluding a non trivial number of tests failures

boegel · 2024-03-12T11:50:46Z

@Crivella I've got passing tests almost everywhere now (here and in easybuilders/easybuild-easyconfigs#20070), apart from an issue with the software stack for version 7.0 (which has nothing to do with this PR). I'll do a final review now and then merge

Keep in mind that jsc-zen3 is running Rocky 9.x, so the problems you're seeing there probably just means that the older Intel compilers are not compatible with the glibc in Rocky 9.x

easybuild/easyblocks/q/quantumespresso.py

ocaisa · 2024-03-15T13:09:31Z

@boegelbot please test @ jsc-zen3
EB_ARGS=" QuantumESPRESSO-6.8-foss-2021a.eb QuantumESPRESSO-6.8-foss-2021b.eb QuantumESPRESSO-6.8-intel-2021a.eb QuantumESPRESSO-7.0-foss-2021b.eb QuantumESPRESSO-7.1-foss-2022a.eb QuantumESPRESSO-7.1-intel-2022a.eb "

boegelbot · 2024-03-15T13:13:08Z

@ocaisa: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=3241 EB_ARGS=" QuantumESPRESSO-6.8-foss-2021a.eb QuantumESPRESSO-6.8-foss-2021b.eb QuantumESPRESSO-6.8-intel-2021a.eb QuantumESPRESSO-7.0-foss-2021b.eb QuantumESPRESSO-7.1-foss-2022a.eb QuantumESPRESSO-7.1-intel-2022a.eb " EB_REPO=easybuild-easyblocks EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_3241 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

exit code: 0
output:

Submitted batch job 3783

Test results coming soon (I hope)...

- notification for comment with ID 1999632000 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

boegelbot · 2024-03-16T00:27:39Z

Test report by @boegelbot

Overview of tested easyconfigs (in order)

SUCCESS QuantumESPRESSO-6.8-foss-2021a.eb
SUCCESS QuantumESPRESSO-6.8-foss-2021b.eb
FAIL (build issue) QuantumESPRESSO-6.8-intel-2021a.eb (partial log available at https://gist.github.com/boegelbot/dcf06f366a2b8ebde4b0ec5718436f24)
FAIL (build issue) QuantumESPRESSO-7.0-foss-2021b.eb (partial log available at https://gist.github.com/boegelbot/edf1a6a3731459d2b3a9eaa7dce30af3)
SUCCESS QuantumESPRESSO-7.1-foss-2022a.eb
FAIL (build issue) QuantumESPRESSO-7.1-intel-2022a.eb (partial log available at https://gist.github.com/boegelbot/097cde076925ee6f26f70c6acd0804a0)

Build succeeded for 3 out of 6 (6 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.3, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.18
See https://gist.github.com/boegelbot/d294a007eb6c6d4c15dafd780818c7bd for a full test report.

ocaisa · 2024-03-18T21:26:39Z

@boegelbot please test @ jsc-zen3
EB_ARGS=" QuantumESPRESSO-6.8-foss-2021a.eb QuantumESPRESSO-6.8-foss-2021b.eb QuantumESPRESSO-7.1-foss-2022a.eb "

boegelbot · 2024-03-18T21:33:08Z

@ocaisa: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=3241 EB_ARGS=" QuantumESPRESSO-6.8-foss-2021a.eb QuantumESPRESSO-6.8-foss-2021b.eb QuantumESPRESSO-7.1-foss-2022a.eb " EB_REPO=easybuild-easyblocks EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_3241 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

exit code: 0
output:

Submitted batch job 3794

Test results coming soon (I hope)...

- notification for comment with ID 2005027818 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

boegelbot · 2024-03-19T02:49:59Z

Test report by @boegelbot

Overview of tested easyconfigs (in order)

SUCCESS QuantumESPRESSO-6.8-foss-2021a.eb
SUCCESS QuantumESPRESSO-6.8-foss-2021b.eb
SUCCESS QuantumESPRESSO-7.1-foss-2022a.eb

Build succeeded for 3 out of 3 (3 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.3, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.18
See https://gist.github.com/boegelbot/1ee57e8e66d8074a40f5e4b6d054e9e5 for a full test report.

ocaisa · 2024-03-19T15:01:14Z

This has been extensively tested here and via easybuilders/easybuild-easyconfigs#20070 thanks for all the effort @Crivella

Crivella added 3 commits February 29, 2024 17:04

Modularizing QE easyblock, and making sure every version uses correct…

6d255dc

… set of flags

Adjusted and tested eb with most versions of QE from 5.x to 7.x

ea27b0c

linting

0805ae4

ocaisa requested changes Mar 4, 2024

View reviewed changes

Crivella added 3 commits March 4, 2024 18:34

Using only one variable for dflags/repls/external_libs and removed f-…

888f966

…strings

Removed inline returns

2115fef

Removed superfluos comments

4727bef

Crivella force-pushed the feature-improve_qe_eblock branch from 30dd9fc to 4727bef Compare March 4, 2024 17:50

Crivella and others added 6 commits March 4, 2024 19:00

Removed redundant extra_libs

d64c520

Update easybuild/easyblocks/q/quantumespresso.py

25eea48

Co-authored-by: ocaisa <alan.ocais@cecam.org>

More fixes

96ededc

Merge branch 'feature-improve_qe_eblock' of github.com:Crivella/easyb…

48e6406

…uild-easyblocks into feature-improve_qe_eblock

Fixed line length

8c4bd0d

Missed f-strings

03693db

ocaisa self-assigned this Mar 5, 2024

ocaisa added the bug fix label Mar 5, 2024

ocaisa added this to the release after 4.9.0 milestone Mar 5, 2024

Crivella added 2 commits March 6, 2024 14:35

Added running QE test-suite in test_step

a4c79f3

linting

b67a3c7

ocaisa requested changes Mar 6, 2024

View reviewed changes

easybuild/easyblocks/q/quantumespresso.py Outdated Show resolved Hide resolved

easybuild/easyblocks/q/quantumespresso.py Outdated Show resolved Hide resolved

easybuild/easyblocks/q/quantumespresso.py Outdated Show resolved Hide resolved

Crivella and others added 3 commits March 6, 2024 15:45

Update easybuild/easyblocks/q/quantumespresso.py

730592b

Co-authored-by: ocaisa <alan.ocais@cecam.org>

Update easybuild/easyblocks/q/quantumespresso.py

d09af58

Co-authored-by: ocaisa <alan.ocais@cecam.org>

Update easybuild/easyblocks/q/quantumespresso.py

2344c12

Co-authored-by: ocaisa <alan.ocais@cecam.org>

ocaisa reviewed Mar 6, 2024

View reviewed changes

easybuild/easyblocks/q/quantumespresso.py Outdated Show resolved Hide resolved

Crivella and others added 6 commits March 6, 2024 16:47

Update easybuild/easyblocks/q/quantumespresso.py

59ce6d2

Co-authored-by: ocaisa <alan.ocais@cecam.org>

Fixed compatibility with libxc

10fab21

Linting + fix for intel compilation with wannier 90

a3a7a4a

Fixed needed for running test-suite on qe<7.0

2edc824

Made installation of FoX gipaw and wannier90 customizable

3b4bc09

Made installation of EPW customizable + check on QE version >= 6.0

4ebbf52

ocaisa requested changes Mar 12, 2024

View reviewed changes

Crivella added 2 commits March 12, 2024 10:14

Fixed wrong if logic

d349900

Added maximum number of failures on non ignored tests

42b8310

Crivella mentioned this pull request Mar 13, 2024

{chem}[foss/2023a,intel/2023a] QuantumESPRESSO v7.3 easybuilders/easybuild-easyconfigs#20105

Merged

Fixed wrong name

abc63b6

ocaisa requested changes Mar 15, 2024

View reviewed changes

easybuild/easyblocks/q/quantumespresso.py Outdated Show resolved Hide resolved

Update easybuild/easyblocks/q/quantumespresso.py

7ad0303

Crivella mentioned this pull request Mar 15, 2024

{2023.06}[foss/2023a] QuantumESPRESSO 7.3 EESSI/software-layer#504

Closed

ocaisa approved these changes Mar 19, 2024

View reviewed changes

ocaisa merged commit 2bc9e04 into easybuilders:develop Mar 19, 2024
47 checks passed

Crivella deleted the feature-improve_qe_eblock branch March 19, 2024 15:05

Crivella mentioned this pull request Mar 21, 2024

Old flags used in the compilation of Quantum ESPRESSO #3234

Closed

migueldiascosta added the enhancement label Apr 4, 2024

boegel changed the title ~~Feature improve QuantumESPRESSO easyblock~~ improve QuantumESPRESSO easyblock by cleaning up and extending configure step + running test suite Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve QuantumESPRESSO easyblock by cleaning up and extending configure step + running test suite #3241

improve QuantumESPRESSO easyblock by cleaning up and extending configure step + running test suite #3241

Crivella commented Mar 1, 2024 •

edited

Crivella commented Mar 9, 2024

boegelbot commented Mar 9, 2024

ocaisa commented Mar 9, 2024

boegelbot commented Mar 9, 2024

ocaisa commented Mar 11, 2024

boegelbot commented Mar 11, 2024

boegelbot commented Mar 11, 2024

ocaisa commented Mar 12, 2024

ocaisa Mar 12, 2024

ocaisa Mar 12, 2024

Crivella Mar 12, 2024

Crivella Mar 12, 2024 •

edited

Crivella Mar 12, 2024

boegel commented Mar 12, 2024

ocaisa commented Mar 15, 2024

boegelbot commented Mar 15, 2024

boegelbot commented Mar 16, 2024

ocaisa commented Mar 18, 2024

boegelbot commented Mar 18, 2024

boegelbot commented Mar 19, 2024

ocaisa commented Mar 19, 2024 •

edited

improve QuantumESPRESSO easyblock by cleaning up and extending configure step + running test suite #3241

improve QuantumESPRESSO easyblock by cleaning up and extending configure step + running test suite #3241

Conversation

Crivella commented Mar 1, 2024 • edited

Crivella commented Mar 9, 2024

boegelbot commented Mar 9, 2024

Overview of tested easyconfigs (in order)

ocaisa commented Mar 9, 2024

boegelbot commented Mar 9, 2024

ocaisa commented Mar 11, 2024

boegelbot commented Mar 11, 2024

boegelbot commented Mar 11, 2024

Overview of tested easyconfigs (in order)

ocaisa commented Mar 12, 2024

ocaisa Mar 12, 2024

Choose a reason for hiding this comment

ocaisa Mar 12, 2024

Choose a reason for hiding this comment

Crivella Mar 12, 2024

Choose a reason for hiding this comment

Crivella Mar 12, 2024 • edited

Choose a reason for hiding this comment

Crivella Mar 12, 2024

Choose a reason for hiding this comment

boegel commented Mar 12, 2024

ocaisa commented Mar 15, 2024

boegelbot commented Mar 15, 2024

boegelbot commented Mar 16, 2024

Overview of tested easyconfigs (in order)

ocaisa commented Mar 18, 2024

boegelbot commented Mar 18, 2024

boegelbot commented Mar 19, 2024

Overview of tested easyconfigs (in order)

ocaisa commented Mar 19, 2024 • edited

Crivella commented Mar 1, 2024 •

edited

Crivella Mar 12, 2024 •

edited

ocaisa commented Mar 19, 2024 •

edited