-
Notifications
You must be signed in to change notification settings - Fork 78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
hipblasGemmEx does not match the CPU or ROCBlas results for int8 x int8 to int32 matrix multiplication #498
Comments
@xinyazhang Thank you for reporting this error. Fixes are in the following hipBLAS and rocBLAS commits:
The line below is required before calling hipblasGemmEx:
The code below shows the above call and verifies the result. Running the code Result from rocblas_gemm_ex with i8 input Without the call to hipblasSetInt8Datatype the incorrect result is: Result from rocblas_gemm_ex with i8 input
|
warning: 'hipblasSetInt8Datatype' is deprecated: "The hipblasSetInt8Datatype function will be removed in a future release and only int8_t datatype will be supported. packed_int8x4 datatype support will be removed." [-Wdeprecated-declarations] hipblasSetInt8Datatype is still required.. |
Is rocblas preferred over hipblas ? |
The minimal testing case has been attached as
igemm_all_in_one.cc.gz.
Can be compiled with
g++ -I/usr/include/eigen3 -I/opt/rocm/include igemm_all_in_one.cc -Wl,-rpath,/opt/rocm/lib/ -L/opt/rocm/lib/ -lrocblas -lhipblas -lamdhip64 -o igemm_aiw
What is the expected behavior
What actually happens
How to reproduce
Environment
modinfo amdgpu|grep version
showsversion: 5.13.20.22.10
The text was updated successfully, but these errors were encountered: