[TODO] SIMD, BF16/FP16, INT8 optimization #79

syoyo · 2023-02-06T11:58:55Z

Currently NanoRT does not utilize SIMD/AVX.

Also no quantized BVH support.

It'd be better to start to consider optimization and quantization.

Fortunately, recent CPU architecture(AlderLake, ZEN4) supports native BF16/FP16 and INT8 op support, which will boost quantized BVH construction/traversal.

syoyo · 2023-02-22T13:03:15Z

We can utilize https://github.com/DLTcollab/sse2neon for SIMDizing code for SSE and NEON(Arm) target.
(TODO: RISC-V SIMD)

syoyo added the enhancement label Feb 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TODO] SIMD, BF16/FP16, INT8 optimization #79

[TODO] SIMD, BF16/FP16, INT8 optimization #79

syoyo commented Feb 6, 2023

syoyo commented Feb 22, 2023

[TODO] SIMD, BF16/FP16, INT8 optimization #79

[TODO] SIMD, BF16/FP16, INT8 optimization #79

Comments

syoyo commented Feb 6, 2023

syoyo commented Feb 22, 2023