Skip to content

GPU-accelerated matrix-matrix multiplication in Python using Pybind11

Notifications You must be signed in to change notification settings

henrikhestnes/CUDA-Pybind11-matrix-multiplication

Repository files navigation

CUDA-Pybind11-matrix-multiplication

Code for GPU-accelerating arbitrary-sized matrix-matrix multiplication in Python by exposing C++ and CUDA code to Python using Pybind11.

Prerequisites

  • Cuda installed in /usr/local/cuda
  • CMake 3.3 or later
  • Python 3.8.10 or later
  • PythonInterp 3.6 or later
  • PythonLibs 3.6 or later

Usage

Should compile out of the box by doing the following:

Bind C++ module to Python

sudo chmod +x bind_code.sh
./bind_code.sh

Test code in Python

python3 matmul.py

About

GPU-accelerated matrix-matrix multiplication in Python using Pybind11

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published