gemm

Wrapper around intgemm (x86_64) and ruy (ARM) to switch between both based on architecture and provide a fast matrix multiplication backend for Mozilla Firefox's translation feature.

wrapper arm x86-64 gemm

Updated Apr 20, 2022
C++

blackccpie / fastconv

Star

fast 2D convolution implementation benchmark

cpp avx simd convolution gemm toeplitz im2col

Updated Nov 21, 2017
C++

CambriconECO / BANGC_Gemm_Tutorial

Star

algorithm gemm cambricon bangc

Updated Apr 7, 2021
C++

CoffeeBeforeArch / mmul

Sponsor

Star

Serial and parallel implementations of matrix multiplication

serial parallel matrix-multiplication benchmarks gemm mmul

Updated Feb 19, 2021
C++

eth-cscs / spla

Star

Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceleration.

linear-algebra mpi cuda gemm rocm

Updated Jun 7, 2024
C++

CNugteren / CLBlast

Sponsor

Star

Tuned OpenCL BLAS

gpu opencl matrix-multiplication blas gemm blas-libraries clblas

Updated Jun 13, 2024
C++

OpenNMT / CTranslate2

Star

Fast inference engine for Transformer models

Updated Jun 24, 2024
C++

Improve this page

Add a description, image, and links to the gemm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gemm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gemm

Here are 19 public repositories matching this topic...

PhuNH / hpc-aa

yester31 / OpenCL_EX

a-sidorova / gpu_opencl_cource

KaiserKlayton / lpa_cnn

zixuanweeei / gemm-opt

xylcbd / gemm_base

riskybacon / mnist_arma_blas

BenQuickDeNN / CUDA-GEMM

XiaoSong9905 / dgemm-knl

LRZ-BADW / OMMOP

scocoyash / Convolution-To-Gemm

yester31 / GEMM_Conv2d_CUDA

jerinphilip / MozIntGemm

blackccpie / fastconv

CambriconECO / BANGC_Gemm_Tutorial

CoffeeBeforeArch / mmul

eth-cscs / spla

CNugteren / CLBlast

OpenNMT / CTranslate2

Improve this page

Add this topic to your repo