Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Jun 7, 2024 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
DR3 enables users to write vectorised code using generic lambdas and filters. Switch instruction set just by changing enclosing namespace
C++ template for generating small sorting networks compatible with SIMD intrinsics
C++ interface for SIMD instruction sets
Vectroized String Helper Functions
high-speed math functions based on AVX-512 intrinsics
SIMD (AVX) and multi threaded simple raytracer
IQ-TREE ported to work for systems with ARM NEON ISA
Generic ImageProcessing library
Header-only C++ library to load SIMD-vectors column-wise from a matrix but then use diagonally
A SIMD library that provides an intuitive and readable interface to 256-bit AVX and AVX2 SIMD instructions using low-cost abstractions.
K-Means clustering algorithm from scratch in C++ with SSE SIMD instructions
miner for block № 0 and others
Add a description, image, and links to the simd-intrinsics topic page so that developers can more easily learn about it.
To associate your repository with the simd-intrinsics topic, visit your repo's landing page and select "manage topics."