A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
-
Updated
Oct 9, 2024 - C++
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Header files for allowing Intel intrinsics in Cython
Using SIMD instructions in image processing using OpenCV
Project that aims to optimize the implementation of an algorithm that generates the Mandelbrot set using parallelization, vectorization and cuda
Parallelism standards for accelerating performance on calculations
8x speedup of 1D Haar-Transform using intel SIMD intrinsics
Parallization protocols for accelerating algorithm performance
CS Engineering Project - Code vectorization for AVX/AVX2 platforms with Intel Intrinsics - developed at Politecnico di Milano as BSc student
Summer internship 2020, LLNL HPCCEA. Used Intel MSRs to evaluate and optimize different matrix multiplication algorithms.
Unikraft port of psimd, portable SIMD intrinsics
Add a description, image, and links to the intel-intrinsics topic page so that developers can more easily learn about it.
To associate your repository with the intel-intrinsics topic, visit your repo's landing page and select "manage topics."