Test: make CI test fails with warnings #4493

WHUweiqingzhou · 2024-06-26T07:26:20Z

The List of Changes

Update to commit 494fca4, there are 58 warnings in github CI tests:

Warning: NG]    58 test cases out of 1227 failed.
1: 101_PW_15_lowz
1: 101_PW_15_paw
1: 101_PW_blps_pseudopots
1: 101_PW_upf201_uspp_NaCl
1: 101_PW_Coulomb
1: 102_PW_BPCG
1: 103_PW_CF_CS_S1_smallg
1: 103_PW_CF_CS_S2_smallg
1: 104_PW_NC_magnetic
1: 105_PW_FD_smearing
1: 105_PW_GA_smearing
1: 107_PW_outWfcR
1: 108_PW_RE
1: 108_PW_RE_MB
1: 109_PW_CR_fix_a
1: 109_PW_CR_fix_ab
1: 109_PW_CR_fix_abc
1: 109_PW_CR_fix_ac
1: 109_PW_CR_fix_b
1: 109_PW_CR_fix_bc
1: 109_PW_CR_fix_c
1: 109_PW_CR_moveatoms
1: 110_PW_SY_symmetry_LiRh
1: 111_PW_elec_add
1: 111_PW_elec_minus
1: 111_PW_S2_elec_add
1: 111_PW_S2_elec_minus
1: 115_PW_sol_H2
1: 116_PW_scan_Si2_nspin2
1: 120_PW_KP_MD_NPT
1: 120_PW_KP_MD_NVT
1: 133_PW_DJ_PK
1: 150_PW_15_CR_VDW3
1: 170_PW_MD_1O
1: 170_PW_MD_2O
1: 180_PW_SDFT_10S_M
1: 180_PW_SDFT_10S_P
1: 181_PW_SDFT_5D10S
1: 182_PW_SDFT_ALL
1: 183_PW_MD_SDFT_10S
1: 183_PW_MD_SDFT_5D10S
1: 183_PW_MD_SDFT_ALL
1: 184_PW_BNDPAR_SDFT_10S
1: 184_PW_BNDPAR_SDFT_5D10S
1: 184_PW_KPAR_SDFT_ALL
1: 185_PW_SDFT_10D10S_METHD2
1: 185_PW_SDFT_10S_METHD2
1: 186_PW_SDOS_10D10S
1: 186_PW_SNLKG_10D10S
1: 250_NO_KP_CR_VDW3ABC
1: 281_NO_KP_HSE
1: 286_NO_KP_CR_HSE
1: 384_NO_GO_S1_HSE_loop0_PU
1: 601_NO_TDDFT_H2_restart
1: 601_NO_TDDFT_CO
1: 601_NO_TDDFT_CO_occ
1: 803_PW_LT_bcc
1: 806_PW_LT_st

and 3 warnings for GPU test:

102_PW_CG_GPU
102_PW_DA_davidson_GPU
102_PW_BPCG_GPU

this PR add threshold file in these 58+3 files.
Modify Autotest.sh and make CI test fail once fatal error or warning happen. (Before this PR, CI test fails only when fatal error occurs.)

Please Notice that

After this PR, warnings are also unaccepted as same as the fatal errors. Due to the potential numerical instability inherent to ABACUS, modifications might lead to some unrelated numerical fluctuations, and lead to warning. If a PR fails due to warning. I will suggest that PR should be reviewed by at least 3 developers. If all reviewers think PR is OK. You can update corresponding references and make your PR pass.
PR test: support reading threshold from file 'threshold' in integrate test #4450 from @pxlxingliang enable one can prepare a threshold in each cases directory. For numerically unstable test cases—those that yield different results on different runs, one can establish a tolerance level to relax the error conditions by editing threshold file.

…gs occur

WHUweiqingzhou · 2024-06-26T10:17:25Z

While updating the thresholds for 58 warnings, I encountered a serious bug in the current Autotest workflow. The current mechanism for determining warnings and fatal errors is flawed. For instance, let's take an example that has three indices: energy, force, and stress.

The correct logic should be: if none of the three indices is a fatal error, and there is at least one warning, then it should be classified as a warning; however, if any one of the indices is a fatal error, then the whole test should be classified as a fatal error.

But the current logic is: as soon as any index is deemed as a warning, the workflow exits immediately without checking the subsequent indices, even though they may be completely wrong.

In reality, we actually have 21 force indices with fatal errors hidden behind a warning for the total energy index:

Warning: NG]    21 test cases out of 1405 failed.
1: 101_PW_15_paw
1: 108_PW_RE
1: 109_PW_CR_fix_a
1: 109_PW_CR_fix_ab
1: 109_PW_CR_fix_abc
1: 109_PW_CR_fix_ac
1: 109_PW_CR_fix_b
1: 109_PW_CR_fix_bc
1: 109_PW_CR_fix_c
1: 109_PW_CR_moveatoms
1: 111_PW_S2_elec_add
1: 116_PW_scan_Si2_nspin2
1: 120_PW_KP_MD_NPT
1: 120_PW_KP_MD_NVT
1: 170_PW_MD_1O
1: 170_PW_MD_2O
1: 180_PW_SDFT_10S_M
1: 181_PW_SDFT_5D10S
1: 183_PW_MD_SDFT_ALL
1: 184_PW_BNDPAR_SDFT_10S
1: 184_PW_BNDPAR_SDFT_5D10S

WHUweiqingzhou · 2024-07-03T02:33:45Z

I find #4503 may induce 3 new warnings:

802_PW_LT_fcc
807_PW_LT_bct
810_PW_LT_fco

@jinzx10 could you have a look?
Anyway, I will add threshold for these 3 cases to pass CI tests.

So current changed file is 66: 2(Autotest.sh) + 58 + 3 + 3

jinzx10 · 2024-07-03T07:46:29Z

I find #4503 may induce 3 new warnings:
802_PW_LT_fcc
807_PW_LT_bct
810_PW_LT_fco
@jinzx10 could you have a look? Anyway, I will add threshold for these 3 cases to pass CI tests.

So current changed file is 66: 2(Autotest.sh) + 58 + 3 + 3

It turns out that the failure of stress comparison has been a long-standing problem muted in the past. I traced all the way back to #3675 (version update to v3.5.4) and the comparison already failed back then. To nail down the specific PR that introduces the problem, I think we need to resort to developers that are familiar with the recent changes in the stress calculation in PW.

I think the PR that first exposed this problem was #4496 :

#4496 was merged on July 1. Four other PRs were merged that day afterwards, but their checks were done prior to the merge of #4496 , therefore no warnings from the stress test were displayed in their respective workflows.

add threshold in 58 warning tests and make autotest fails once warnin…

3642faf

…gs occur

pxlxingliang mentioned this pull request Jun 26, 2024

test: modify to check all properties even one property is reach warning threshold in Autotest.sh #4496

Merged

4 tasks

WHUweiqingzhou added 2 commits July 2, 2024 14:20

resolve the Conflicting files

dcdb9f7

change threshold for 28 cases

d178264

WHUweiqingzhou and others added 2 commits July 3, 2024 11:17

add threshold in 802_PW_LT_fcc/807_PW_LT_bct/810_PW_LT_fco

fbb8431

Merge branch 'develop' into ci_test

c6a9542

WHUweiqingzhou requested a review from pxlxingliang July 3, 2024 04:27

pxlxingliang approved these changes Jul 3, 2024

View reviewed changes

WHUweiqingzhou self-assigned this Jul 3, 2024

mohanchen merged commit 0503eb7 into deepmodeling:develop Jul 3, 2024
13 checks passed

WHUweiqingzhou deleted the ci_test branch July 4, 2024 01:54

This was referenced Jul 4, 2024

In the CI test some warning get different value between github test #3414

Closed

Question: why there are always plenty of warnings in integrated test? #4187

Closed

Tests: We need update some reference data of integrated tests #4615

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test: make CI test fails with warnings #4493

Test: make CI test fails with warnings #4493

WHUweiqingzhou commented Jun 26, 2024

WHUweiqingzhou commented Jun 26, 2024

WHUweiqingzhou commented Jul 3, 2024 •

edited

Loading

jinzx10 commented Jul 3, 2024 •

edited

Loading

Test: make CI test fails with warnings #4493

Test: make CI test fails with warnings #4493

Conversation

WHUweiqingzhou commented Jun 26, 2024

The List of Changes

Please Notice that

WHUweiqingzhou commented Jun 26, 2024

WHUweiqingzhou commented Jul 3, 2024 • edited Loading

jinzx10 commented Jul 3, 2024 • edited Loading

WHUweiqingzhou commented Jul 3, 2024 •

edited

Loading

jinzx10 commented Jul 3, 2024 •

edited

Loading