Extend auto-SIMD support #202

JamesKingdon · 2017-10-02T20:08:18Z

The hot loop of the LU matrix factorization kernel of the SciMark benchmark is suitable for auto-SIMD, but currently isn't converted because it uses multiply-subtract. The current auto-SIMD optimisation is capable of transforming multiply-adds, and support for multiply-subtract is being implemented.

The loop would also benefit from the use of four-wide avx instructions for the vector double operations when auto-SIMD reduces it.

Testing on 32 core Xeon(R) CPU E7-8867

OpenJ9 2820 Mflops vs HotSpot 4445 Mflops

By studying the OpenJ9 profile we estimate that implementing the two improvements above should help substantially.

pshipton added the comp:jit label Dec 28, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend auto-SIMD support #202

Extend auto-SIMD support #202

JamesKingdon commented Oct 2, 2017 •

edited

Extend auto-SIMD support #202

Extend auto-SIMD support #202

Comments

JamesKingdon commented Oct 2, 2017 • edited

JamesKingdon commented Oct 2, 2017 •

edited