Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Perf(Force&Stress): using zgemm in nonlocal part for high performance #1021

Merged
merged 3 commits into from
Jun 17, 2022

Conversation

dyzheng
Copy link
Collaborator

@dyzheng dyzheng commented Jun 15, 2022

This is the ultimate optimization for Force and Stress calculation in PW code for KSDFT.

@dyzheng
Copy link
Collaborator Author

dyzheng commented Jun 15, 2022

before optimized:
Forces cal_force_nl 12.493 1 12 2.3 % Stress_PW cal_stress 19.116 1 19 3.5 %
after #1015 :
Forces cal_force_nl 5.8774 1 5.9 1.1 % Stress_PW cal_stress 11.012 1 11 2.1 %
after this PR
Forces cal_force_nl 0.90567 1 0.91 0.17 % Stress_PW cal_stress 5.4573 1 5.5 1 %

@caic99 caic99 linked an issue Jun 16, 2022 that may be closed by this pull request
@dyzheng dyzheng merged commit e5427de into deepmodeling:develop Jun 17, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Stres_nl needs efficiency optimization
2 participants