HPC
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Implementations of SIMD instruction sets for systems which don't natively support them.
Agenium Scale vectorization library for CPUs and GPUs
std::experimental::simd for GCC [ISO/IEC TS 19570:2018]
UME::SIMD A library for explicit simd vectorization.
Performance-portable, length-agnostic SIMD with runtime dispatch
A hardware implementation of CNN, written by Verilog and synthesized on FPGA
EASTL stands for Electronic Arts Standard Template Library. It is an extensive and robust implementation that has an emphasis on high performance.
Portable header-only C++ low level SIMD library
Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).
portDNN is a library implementing neural network algorithms written using SYCL
The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using SIMD up to AVX2 intrinsic functions
A convolutional neural network implemented in hardware (verilog)
CNN acceleration on virtex-7 FPGA with verilog HDL
HIP: C++ Heterogeneous-Compute Interface for Portability
Open Source Parallel STL implementation
3D Tensors for Blaze (https://bitbucket.org/blaze-lib/blaze)
Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.
VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation



