gemm

Fast Matrix Multiplication Implementation in C programming language. This matrix multiplication algorithm is similar to what Numpy uses to compute dot products.

c matrix-multiplication gemm gemm-optimization

Updated Jun 6, 2021
C

The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Intel MKL(CPU) and cuBLAS(CUDA) on different matrix sizes/vendor's hardwares/OS. Out-of-the-box easy as MSVC, MinGW, Linux(CentOS) x86_64 binary provided. 在不同矩阵大小/硬件/操作系统下比较几个BLAS库的sgemm函数性能，提供binary，开盒即用。

opencl cublas matrix-multiplication blas gemm mkl clblas sgemm clblast gemm-optimization clnet

Updated Mar 28, 2019
C

koallen / gemm-optimization

Star

My experiments on optimizing GEMM

optimization math-library gemm

Updated Oct 17, 2018
C

Improve this page

Add a description, image, and links to the gemm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gemm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gemm

Here are 16 public repositories matching this topic...

salykova / matmul.c

PkuCuipy / how-to-optimize-gemm-zh-notes

jiegec / sgemm-optimize

DongqiShen / iLLM

yui0 / slibs

yui0 / ugemm

flame / how-to-optimize-gemm

pminhtam / xnor_conv_pytorch_extension

ZhangGe6 / how-to-optimize-playground

rollingbug / LinMatrix

yzhaiustc / Optimizing-DGEMM-on-Intel-CPUs-with-AVX512F

junyoung1992 / OpenCL-GEMM

flame / blislab

iVishalr / GEMM

mz24cn / gemm_optimization

koallen / gemm-optimization

Improve this page

Add this topic to your repo