BLAS-like Library Instantiation Software Framework
-
Updated
Jun 4, 2024 - C
BLAS-like Library Instantiation Software Framework
High-efficiency floating-point neural network inference operators for mobile, server, and Web
Acceleration package for neural networks on multi-core CPUs
BLISlab: A Sandbox for Optimizing GEMM
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
Parallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
M4RI is a library for fast arithmetic with dense matrices over GF(2)
使用OpenMP及MPI完成的几个并行程序设计小实验:矩阵相乘、矩阵LU分解、文档分类中的文档向量过程
Fast Matrix Multiplication Implementation in C programming language. This matrix multiplication algorithm is similar to what Numpy uses to compute dot products.
A Collection some matrix manipulation algorithms
Matrix multiplication on multiple Nios II cores
The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Intel MKL(CPU) and cuBLAS(CUDA) on different matrix sizes/vendor's hardwares/OS. Out-of-the-box easy as MSVC, MinGW, Linux(CentOS) x86_64 binary provided. 在不同矩阵大小/硬件/操作系统下比较几个BLAS库的sgemm函数性能,提供binary,开盒即用。
Matrix Multiplication in Hardware
☕Implement of Parallel Matrix Multiplication Methods Using FOX Algorithm on Peking University's High-performance Computing System
Minimalistic implementations of various algorithms for projects in machine learning and computer vision
Linear Algebra library for C/C++
phiGEMM: CPU-GPU hybrid matrix-matrix multiplication library
Implementation of matrix multiplication program with message passing
Add a description, image, and links to the matrix-multiplication topic page so that developers can more easily learn about it.
To associate your repository with the matrix-multiplication topic, visit your repo's landing page and select "manage topics."