Refactor: use less memory and optimize performance to calculate force and stress in pw base #4047

dyzheng · 2024-04-23T09:52:07Z

I have refactored stress code structure in this PR.
In case Mg16Al16, the memory cost of stress calculation from 16752 MB to 194 MB.

Linked Issue

Close #3714
Close #4158
Close #3710
Close #4026
Close #3931
Close #4031

Unit Tests and/or Case Tests for my changes

A unit test is added for each new feature or bug fix.

What's changed?

Example: My changes might affect the performance of the application under certain conditions, and I have tested the impact on various scenarios...

Any changes of core modules? (ignore if not applicable)

Example: I have added a new virtual function in the esolver base class in order to ...

… pw_stress

Qianruipku · 2024-04-24T02:12:18Z

Has the efficiency of the new algorithm been tested? Are there any test data available?

dyzheng · 2024-04-24T02:40:45Z

Has the efficiency of the new algorithm been tested? Are there any test data available?

I have not tested many cases, in Mg16Al16 case , time of stress_nl change from 94 s to 124 s, I think the performance of new method still can be improved.

Qianruipku · 2024-04-24T03:24:56Z

Perhaps the QE code can be used as a reference.

source/module_hamilt_pw/hamilt_pwdft/stress_func_nl.cpp

source/module_hamilt_pw/hamilt_pwdft/stress_func_us.cpp

source/module_hamilt_pw/hamilt_pwdft/stress_func_nl.cpp

… pw_stress

…pw_stress

dyzheng · 2024-05-06T05:02:35Z

I will work with @grysgreat to accelerate performance in GPU/DCU, change this PR to draft.

source/module_hamilt_pw/hamilt_pwdft/kernels/stress_op.h

…_stress

… pw_stress

…_stress

source/module_hamilt_pw/hamilt_pwdft/kernels/force_op.cpp

source/module_hamilt_pw/hamilt_pwdft/kernels/rocm/force_op.hip.cu

source/module_hamilt_pw/hamilt_pwdft/kernels/rocm/stress_op.hip.cu

mohanchen · 2024-06-26T04:10:24Z

two general comments. 1) lack of doxygen-style notes or explanations. 2) both CPU and GPU codes, especially for the GPU codes, are not well-written in terms of high-performance computing (but this can be improved in future)

… pw_stress

dyzheng added 2 commits April 23, 2024 17:48

Refactor: use less memory to calculate stress in pw base

64ee98c

Merge branch 'develop' of github.com:deepmodeling/abacus-develop into…

faac19f

… pw_stress

dyzheng requested review from WHUweiqingzhou and Qianruipku April 23, 2024 10:07

dyzheng added 2 commits April 23, 2024 22:18

Fix: nondiagonal matrix element in PW-Stress calculation

dfc9b0f

Fix: uspp stress calculation

12b407f

mohanchen reviewed Apr 27, 2024

View reviewed changes

Merge branch 'develop' into pw_stress

260bc30

hongriTianqi reviewed Apr 28, 2024

View reviewed changes

source/module_hamilt_pw/hamilt_pwdft/stress_func_nl.cpp Outdated Show resolved Hide resolved

hongriTianqi suggested changes Apr 28, 2024

View reviewed changes

mohanchen and others added 9 commits May 4, 2024 17:23

Merge branch 'develop' into pw_stress

75f01cc

Merge branch 'develop' into pw_stress

ccfd2f1

Merge branch 'develop' into pw_stress

4114e85

add gemm_op in gpu and add vkb_op

2eecc2b

delete comment

89a9ce9

Merge branch 'develop' of github.com:deepmodeling/abacus-develop into…

bb7ea83

… pw_stress

delete comment2

05d59b4

Merge branch 'pw_stress' of github.com:grysgreat/abacus-develop into …

d7d2782

…pw_stress

Fix: error in merging

cb63316

dyzheng marked this pull request as draft May 6, 2024 05:02

update ops.

95eef5f

dyzheng mentioned this pull request May 8, 2024

Out of memory bug in DCU calculation (Stress Memory) #3710

Closed

16 tasks

grysgreat added 2 commits May 9, 2024 15:33

add correct vq.

158cfa2

finish stress ops!

2072611

WHUweiqingzhou requested a review from denghuilu May 11, 2024 02:46

denghuilu reviewed May 11, 2024

View reviewed changes

source/module_hamilt_pw/hamilt_pwdft/kernels/stress_op.h Outdated Show resolved Hide resolved

dyzheng and others added 14 commits June 24, 2024 18:40

update from PR comments

0286f97

Merge branch 'pw_stress' of github.com:dyzheng/abacus-develop into pw…

c0b4af3

…_stress

[pre-commit.ci lite] apply automatic fixes

39fc340

add timer and optimize stress

3bd49a0

Merge branch 'pw_stress' of github.com:dyzheng/abacus-develop into pw…

e517bc5

…_stress

fix: g_plus_k

3a462cd

[pre-commit.ci lite] apply automatic fixes

23a51c2

Fix: delete pointers

40970c4

Merge branch 'pw_stress' of github.com:dyzheng/abacus-develop into pw…

6d0ba33

…_stress

Merge branch 'develop' of github.com:deepmodeling/abacus-develop into…

6c5227d

… pw_stress

[pre-commit.ci lite] apply automatic fixes

416d777

Fix: stress time

8ac8581

Merge branch 'pw_stress' of github.com:dyzheng/abacus-develop into pw…

a4f9d7e

…_stress

[pre-commit.ci lite] apply automatic fixes

0c12b34

mohanchen reviewed Jun 26, 2024

View reviewed changes

dyzheng changed the title ~~Refactor: use less memory to calculate stress in pw base~~ Refactor: use less memory and optimize performance to calculate force and stress in pw base Jun 26, 2024

dyzheng and others added 3 commits June 26, 2024 14:18

Merge branch 'develop' of github.com:deepmodeling/abacus-develop into…

190d606

… pw_stress

fix: add annotations

57626b8

[pre-commit.ci lite] apply automatic fixes

23323dc

mohanchen approved these changes Jun 27, 2024

View reviewed changes

Merge branch 'develop' into pw_stress

aee85ba

mohanchen approved these changes Jun 27, 2024

View reviewed changes

mohanchen merged commit dce250b into deepmodeling:develop Jun 27, 2024
13 checks passed

dyzheng mentioned this pull request Jul 4, 2024

The stress and force programs of SDFT and KSDFT can be unified. #4314

Open

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor: use less memory and optimize performance to calculate force and stress in pw base #4047

Refactor: use less memory and optimize performance to calculate force and stress in pw base #4047

dyzheng commented Apr 23, 2024 •

edited by WHUweiqingzhou

Loading

Qianruipku commented Apr 24, 2024

dyzheng commented Apr 24, 2024

Qianruipku commented Apr 24, 2024

dyzheng commented May 6, 2024

mohanchen commented Jun 26, 2024 •

edited

Loading

Refactor: use less memory and optimize performance to calculate force and stress in pw base #4047

Refactor: use less memory and optimize performance to calculate force and stress in pw base #4047

Conversation

dyzheng commented Apr 23, 2024 • edited by WHUweiqingzhou Loading

Linked Issue

Unit Tests and/or Case Tests for my changes

What's changed?

Any changes of core modules? (ignore if not applicable)

Qianruipku commented Apr 24, 2024

dyzheng commented Apr 24, 2024

Qianruipku commented Apr 24, 2024

dyzheng commented May 6, 2024

mohanchen commented Jun 26, 2024 • edited Loading

dyzheng commented Apr 23, 2024 •

edited by WHUweiqingzhou

Loading

mohanchen commented Jun 26, 2024 •

edited

Loading