radeon-open-compute

Here are 4 public repositories matching this topic...

ROCm / Tensile

Stretching GPU performance for GEMMs and tensor contractions.

python machine-learning amd gpu assembly opencl dnn matrix-multiplication neural-networks gpu-acceleration blas hip gpu-computing tensors tensor-contraction gemm radeon auto-tuning radeon-open-compute

Updated Apr 18, 2024
Python

GPUOpen-ProfessionalCompute-Libraries / amdovx-core

Star

AMD OpenVX Core -- a sub-module of amdovx-modules:

linux cpu opencl amdgpu rocm radeon-open-compute openvx radeon-instinct-mi-series radeon-vega-series amd-openvx khronos-openvx vx-loomsl

Updated Feb 5, 2019
C++

GPUOpen-ProfessionalCompute-Libraries / amdovx-modules

Star

AMD OpenVX modules: such as, neural network inference, 360 video stitching, etc.

video-stitching rocm radeon-open-compute openvx onnx neural-network-inference radeon-instinct-mi-series radeon-vega-series

Updated Feb 5, 2019
C++

ROCm / hipBLASLt

Star

hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

machine-learning amd assembly matrix-multiplication blas hip gpu-computing gemm rocm radeon-open-compute

Updated Apr 18, 2024
Assembly

Improve this page

Add a description, image, and links to the radeon-open-compute topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the radeon-open-compute topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

radeon-open-compute

Here are 4 public repositories matching this topic...

ROCm / Tensile

GPUOpen-ProfessionalCompute-Libraries / amdovx-core

GPUOpen-ProfessionalCompute-Libraries / amdovx-modules

ROCm / hipBLASLt

Improve this page

Add this topic to your repo