tensor-core

Here are 4 public repositories matching this topic...

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.

Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.

The lab assignments from CS4302 Parallel and Distributed Programming (2022 Fall) with my solutions

Add a description, image, and links to the tensor-core topic page so that developers can more easily learn about it.

To associate your repository with the tensor-core topic, visit your repo's landing page and select "manage topics."