Optimized implementation of fractal compression algorithm
-
Updated
Aug 14, 2017 - C++
Optimized implementation of fractal compression algorithm
UME::SIMD A library for explicit simd vectorization.
Calculate Sum of Absolute Difference (SAD) by AVX-512
Compile-time blend masks that unifies _mm256_blend_epi8, _mm256_blend_epi16, _mm256_blend_epi32
DSL for SIMD Sorting on AVX2 & AVX512
Fast generation of long sequencies of bernoulli-distributed random variables
Vectroized String Helper Functions
2-norm guided FP32 truncation for heterogeneous deep learning training
What features does your CPU and OS support?
C++17 N-body Barnes-Hut on heterogeneous hardware architectures
Tiny optimized math framework game oriented
Implements the PatchMatch stereo algorihm. AVX2 intrinsics for Intel.
Add a description, image, and links to the avx2 topic page so that developers can more easily learn about it.
To associate your repository with the avx2 topic, visit your repo's landing page and select "manage topics."