Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extend auto-SIMD support #202

Open
JamesKingdon opened this issue Oct 2, 2017 · 0 comments
Open

Extend auto-SIMD support #202

JamesKingdon opened this issue Oct 2, 2017 · 0 comments
Labels

Comments

@JamesKingdon
Copy link
Contributor

JamesKingdon commented Oct 2, 2017

The hot loop of the LU matrix factorization kernel of the SciMark benchmark is suitable for auto-SIMD, but currently isn't converted because it uses multiply-subtract. The current auto-SIMD optimisation is capable of transforming multiply-adds, and support for multiply-subtract is being implemented.

The loop would also benefit from the use of four-wide avx instructions for the vector double operations when auto-SIMD reduces it.

Testing on 32 core Xeon(R) CPU E7-8867

OpenJ9 2820 Mflops vs HotSpot 4445 Mflops

By studying the OpenJ9 profile we estimate that implementing the two improvements above should help substantially.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants