Updated test script and GitHub actions #96

Madu86 · 2021-02-28T05:32:00Z

I have implemented a dacdif based runtest script. This should fix issues #94 and #66. I have also updated the GitHub action files with necessary environment variables and creating artifacts.

vwcruzeiro · 2021-02-28T06:04:43Z

Hello @Madu86 . How are users supposed to run the tests now?

agoetz · 2021-02-28T09:39:21Z

tools/runtest

+    fi
+
+    # Check the accuracy
+    accuracy='4.0e-3'


This looks like a dangerously large threshold. Energies should probably agree up to 1.0e-6 Hartree and gradients to 1.0e-4 Hartree/Angstrom.

Different thresholds should also be used for dipole moments, Mulliken charges, and other molecular properties.

This looks like a dangerously large threshold. Energies should probably agree up to 1.0e-6 Hartree and gradients to 1.0e-4 Hartree/Angstrom.

@agoetz @vwcruzeiro How come this threshold is ok with Run.tests.amber then?

It really depends on the test. Most tests are supposed to test one thing - e.g. the printed energies (in kcal/mol) in the Amber output file along a trajectory. In other cases this is not possible. For instance, to test my DFTB3 implementation in AmberTools/test/sqm/dftb3 I test differently for single point calculations and geometry optimizations, and grep out stuff that is too sensitive from the geometry optimizations. That is far from ideal but the best I was able to do within the Amber testing framework and my limited time.

set SQM = $AMBERHOME/bin/sqm set SPDIFF = ../../dacdif set GODIFF = "../../dacdif -a 0.02" set SPTESTS = (h2o.sp cysdip.sp lignin.sp) set GOTESTS = (h2o.go cysdip.go) # single point tests foreach i ($SPTESTS) set input = $i.in set output = $i.out $SQM -O -i $input -o $output || goto error $SPDIFF $output.save $output end # geometry optimization tests # do not check the electronic energy and # core repulsion energy, they are too sensitive! # Also, there may be a different number of optimization steps # on different machines, hence remove xmin output # Also, do not check the final geometry since small numerical # differences on different platforms can result in different # Cartesian coordinates although the internal coordinates are OK foreach i ($GOTESTS) set input = $i.in set output = $i.out $SQM -O -i $input -o $output || goto error grep -v 'Electronic energy' $output > tmp grep -v 'Core-core repulsion' tmp > tmp2 grep -v 'DIPOLE' tmp2 > tmp grep -v 'xmin' tmp > tmp2 sed -e "/Final Structure/,/Calculation Completed/d" tmp2 > $output rm tmp tmp2 $GODIFF $output.save $output end exit(0) error: echo " ${0}: Program error"

Here is an example of HF based QM/MM MD with Orca (amber/test/qmmm_EXTERN/QMMM_MD_Orca/Run.aladip.hf_sto-3g)

../../dacdif -t 1 $output.save $output ../../dacdif -t 3 $restrt.save $restrt

This was after testing numerical differences on different platforms. I truncate the last digit in the Amber mdout file, leaving energies up to 0.001 kcal/mol ~= 1.e-6 Hartree. And truncate Cartesian coordinates to leave an accuracy of 1.e-4 Angstrom.

agoetz

Nice changes. But we need to address the major drawback of dacdif that I have always been complaining about: It employs a single numerical threshold for the entire output file while acceptable thresholds depend on the property and its units. I.e. we need different thresholds for (total) energies, gradients, (Mulliken) charges, dipole moments, etc.

The only way I can see how this can be done is to have different saved / output files for each property type. E.g. test-foo.energies, test-foo.gradients, test-foo.dipole, test-foo.popan etc. Alternatively a modified differ that applies different thresholds in different regions of the output file, with regions identified by regular expressions (ADF does something like that).

agoetz · 2021-02-28T09:51:15Z

Hello @Madu86 . How are users supposed to run the tests now?

After installing run make test or in the installation directory call runtest.

vwcruzeiro · 2021-02-28T23:14:09Z

@Madu86 , the goal in Run.tests.amber is to evaluate the output file as a whole. This is why a large threshold needs to be used in dacdif. This check should help us pick up major bugs (which is probably enough for Amber), but the main test suite in Quick should check energies and forces with a smaller threshold. My suggestion for you is the following: do exactly the same as I do in Run.tests.amber, but also test the total energy and gradients using different files and with a smaller threshold in dacdif. Perhaps we should modify Run.tests.amber to do that so we have a single test suite for Quick and Amber. I can help you with that. Just let me know.

Madu86 · 2021-03-01T23:07:23Z

@agoetz @vwcruzeiro I have improved the new test script based on your suggestions. We are now able to use different cutoffs for different tasks: https://github.com/Madu86/QUICK/blob/issue66/tools/runtest#L170-L217. Please feel free to play with them. I have also incorporated capability to run different types of tests: https://github.com/Madu86/QUICK/blob/issue66/tools/runtest#L79-L81. For eg. if you want to run only serial geometry optimization tests, you can do ./runtest --serial --opt.

Madu86 · 2021-03-01T23:08:43Z

the goal in Run.tests.amber is to evaluate the output file as a whole. This is why a large threshold needs to be used in dacdif. This check should help us pick up major bugs (which is probably enough for Amber), but the main test suite in Quick should check energies and forces with a smaller threshold. My suggestion for you is the following: do exactly the same as I do in Run.tests.amber, but also test the total energy and gradients using different files and with a smaller threshold in dacdif. Perhaps we should modify Run.tests.amber to do that so we have a single test suite for Quick and Amber. I can help you with that. Just let me know.

@vwcruzeiro I see. Thanks. Please check the latest version and let me know your suggestions.

Madu86 added 6 commits February 27, 2021 21:08

updated saved outputs

eaca964

implemented dacdiff based testing

683befb

silenced script execution

15c8951

added dependencies

f2e0c66

implemented setting environment variables and saving test outputs

f33ec65

cleaned up

c0cd590

Madu86 requested review from agoetz and vwcruzeiro February 28, 2021 05:32

Madu86 self-assigned this Feb 28, 2021

This was linked to issues Feb 28, 2021

QUICK tests fail for some GNU and Intel compiler versions #94

Closed

Use Run.tests.amber in GitHub checks #66

Closed

agoetz reviewed Feb 28, 2021

View reviewed changes

agoetz mentioned this pull request Feb 28, 2021

Test opt_wat_rhf_ccpvdz executable quick.cuda.MPI fails with Intel, OpenMPI #98

Closed

Madu86 added 13 commits March 1, 2021 10:07

added printing gradient section name

f3c16fd

updated saved tests

f990303

added different test options

1ea1c3e

added capability to run different tests

b80e410

implemented counting different test types and combined total

cf4e00f

implemented separate function to set files

6e372cb

implemented separate function to clean files

8e86e3e

implemented separate function for energy testing

7d83e39

added separate function for gradient checking

5c57d3f

added separate function for checking optimized geometry

4186549

implemented separate function for checking charges and dipoles

eb80f98

fixed a bug

889de1f

Merge branch 'master' into issue66

232a091

Madu86 added 3 commits March 1, 2021 11:48

placed an error trap for DO_PARALLEL

82ffbb8

disabled mpi runs for mp2

8813ff5

fixed a conditional statement

ce99f6c

Madu86 mentioned this pull request Mar 2, 2021

AWG: fix runscript so MPI codes are always tested with $DO_PARALLEL #101

Closed

Madu86 closed this Mar 2, 2021

Madu86 deleted the issue66 branch March 2, 2021 22:38

Madu86 mentioned this pull request Mar 2, 2021

Updated test script and GitHub actions - 2 #108

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated test script and GitHub actions #96

Updated test script and GitHub actions #96

Madu86 commented Feb 28, 2021

vwcruzeiro commented Feb 28, 2021

agoetz Feb 28, 2021

agoetz Feb 28, 2021

Madu86 Feb 28, 2021

agoetz Feb 28, 2021

agoetz Feb 28, 2021

agoetz left a comment

agoetz commented Feb 28, 2021

vwcruzeiro commented Feb 28, 2021

Madu86 commented Mar 1, 2021

Madu86 commented Mar 1, 2021

Updated test script and GitHub actions #96

Updated test script and GitHub actions #96

Conversation

Madu86 commented Feb 28, 2021

vwcruzeiro commented Feb 28, 2021

agoetz Feb 28, 2021

Choose a reason for hiding this comment

agoetz Feb 28, 2021

Choose a reason for hiding this comment

Madu86 Feb 28, 2021

Choose a reason for hiding this comment

agoetz Feb 28, 2021

Choose a reason for hiding this comment

agoetz Feb 28, 2021

Choose a reason for hiding this comment

agoetz left a comment

Choose a reason for hiding this comment

agoetz commented Feb 28, 2021

vwcruzeiro commented Feb 28, 2021

Madu86 commented Mar 1, 2021

Madu86 commented Mar 1, 2021