You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Few months ago I tried to build the develop branch of rocBLAS with gfx1100 support and one users ran some benchmarks. The result shows that 7900XTX has good performance on FP16 and mixed precision GEMM, but a poor performance on FP32 GEMM. Checking the git log, it seems to indicate that there's only basic GEMM support + WMMA support, the further optimization does not come.
Library context
Software
version
rocblas
e44855972ff75053ba08922ca89ab288e6a9462e
The text was updated successfully, but these errors were encountered:
https://bugs.gentoo.org/891499#c46
Few months ago I tried to build the develop branch of rocBLAS with gfx1100 support and one users ran some benchmarks. The result shows that 7900XTX has good performance on FP16 and mixed precision GEMM, but a poor performance on FP32 GEMM. Checking the git log, it seems to indicate that there's only basic GEMM support + WMMA support, the further optimization does not come.
Library context
The text was updated successfully, but these errors were encountered: