Skip to content
View s-trinh's full-sized avatar

Block or report s-trinh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

HPC

133 repositories

Expressive Vector Engine - SIMD in C++ Goes Brrrr

C++ 1,283 66 Updated Jan 21, 2026

mold: A Modern Linker 🦠

C++ 16,072 527 Updated Dec 12, 2025

Embree ray tracing kernels repository.

C++ 2,639 420 Updated Jan 16, 2026

SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT

C 798 148 Updated Dec 25, 2025

Image processing library for learning purpose

C 54 8 Updated Dec 6, 2024

Implementations of SIMD instruction sets for systems which don't natively support them.

C 2,925 301 Updated Jan 21, 2026

Agenium Scale vectorization library for CPUs and GPUs

C 337 31 Updated Oct 21, 2021

std::experimental::simd for GCC [ISO/IEC TS 19570:2018]

C++ 637 41 Updated Mar 10, 2023

SIMD Vector Classes for C++

C++ 1,516 152 Updated Jun 6, 2024

UME::SIMD A library for explicit simd vectorization.

C++ 91 16 Updated Jan 19, 2018

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 5,280 400 Updated Jan 20, 2026

A hardware implementation of CNN, written by Verilog and synthesized on FPGA

Coq 249 77 Updated Dec 29, 2018

EASTL stands for Electronic Arts Standard Template Library. It is an extensive and robust implementation that has an emphasis on high performance.

C++ 9,107 1,018 Updated Nov 15, 2025

Portable header-only C++ low level SIMD library

C++ 1,298 130 Updated Aug 26, 2024

Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).

C++ 517 94 Updated Dec 4, 2025

portDNN is a library implementing neural network algorithms written using SYCL

C++ 113 22 Updated May 21, 2024

The platform independent header allowing to compile any C/C++ code containing ARM NEON intrinsic functions for x86 target systems using SIMD up to AVX2 intrinsic functions

C 484 158 Updated Oct 23, 2025

使用Verilog实现的CNN模块,可以方便的在FPGA项目中使用

Verilog 580 116 Updated Jun 18, 2018

A convolutional neural network implemented in hardware (verilog)

Verilog 166 82 Updated Sep 7, 2017

CNN acceleration on virtex-7 FPGA with verilog HDL

Verilog 471 138 Updated Feb 27, 2018

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 4,333 577 Updated Jan 21, 2026

fast log and exp functions for AVX2/AVX-512

Python 239 38 Updated Mar 12, 2025

Open Source Parallel STL implementation

C++ 529 83 Updated Jan 26, 2024

Sparse Parallel Robust Algorithms Library

Fortran 134 30 Updated Jan 21, 2026
C++ 144 87 Updated Jan 20, 2026

3D Tensors for Blaze (https://bitbucket.org/blaze-lib/blaze)

C++ 38 8 Updated Oct 1, 2020

Scalable High-performance Algorithms and Data-structures

C++ 135 38 Updated Dec 5, 2025

Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.

C++ 260 50 Updated Jan 13, 2025

VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP

C++ 719 84 Updated Jul 19, 2025

Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation

C 325 89 Updated Aug 11, 2023