Make use of additional ARMv8 intrinsics #40

walbourn · 2016-11-11T06:50:02Z

The Windows ARM 32-bit platform requires ARMv7, while the Windows ARM 64-bit platform requires ARMv8. Therefore, DirectXMath can make use of additional ARMv8 ARM-NEON intrinsics for dividing, rounding, and half-precision conversion on the ARM64 platform:

vrndnq_f32
vrndq_f32
vrndmq_f32
vrndpq_f32
vcvt_f32_f16
vcvt_f16_f32
vdivq_f32

The text was updated successfully, but these errors were encountered:

walbourn · 2016-11-11T06:50:15Z

The additional ARMv8 optimizations are now checked for the following functions when building for _M_ARM64.

XMMATRIX::operator/=
XMMATRIX::operator/

XMVectorRound
XMVectorTruncate
XMVectorFloor
XMVectorCeiling
XMVectorSum
XMVectorDivide
XMVectorReciprocal
XMVector2TransformCoordStream
XMVector3TransformCoordStream
XMVector3ProjectStream
XMVector3UnprojectStream

XMConvertHalfToFloat
XMConvertHalfToFloatStream
XMConvertFloatToHalf
XMConvertFloatToHalfStream

walbourn · 2016-11-11T06:50:32Z

Fixed for DirectXMath 3.10

walbourn added the optimization label Nov 11, 2016

walbourn closed this as completed Nov 11, 2016

walbourn self-assigned this Nov 11, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make use of additional ARMv8 intrinsics #40

Make use of additional ARMv8 intrinsics #40

walbourn commented Nov 11, 2016 •

edited

walbourn commented Nov 11, 2016 •

edited

walbourn commented Nov 11, 2016

Make use of additional ARMv8 intrinsics #40

Make use of additional ARMv8 intrinsics #40

Comments

walbourn commented Nov 11, 2016 • edited

walbourn commented Nov 11, 2016 • edited

walbourn commented Nov 11, 2016

walbourn commented Nov 11, 2016 •

edited

walbourn commented Nov 11, 2016 •

edited