Stretching GPU performance for GEMMs and tensor contractions.
-
Updated
Apr 18, 2024 - Python
Stretching GPU performance for GEMMs and tensor contractions.
AMD OpenVX Core -- a sub-module of amdovx-modules:
AMD OpenVX modules: such as, neural network inference, 360 video stitching, etc.
hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
Add a description, image, and links to the radeon-open-compute topic page so that developers can more easily learn about it.
To associate your repository with the radeon-open-compute topic, visit your repo's landing page and select "manage topics."