-
Notifications
You must be signed in to change notification settings - Fork 228
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
row_bcast operand from src/kernels/reduction_functions.h leads to an error on gfx10 #1528
Comments
This may be the reason, ROCm only treat gfx1030 and gfx1011 You can reference my patch to try it on gfx1031 |
@xuhuisheng thank you for answer |
@kvirikroma This is issue with inline assembly code used in BN OpenCL kernels. Note that gfx1031 is not officially supported. We can update MIOpen in order to resolve the issue. In order to help us to prioritize this work, please indicate importance of this issue for you. /cc @junliume |
@atamazov it's not urgent for me because I do it all just for fun/learning/experience and will not lose any money because of the issue (and even if I did, it would be my fault tbh, considering gfx1031 is not supported). But it would be great to get it working soon) |
@kvirikroma Okay. Luckily, the fix seems easy to implement. There is PR #1531, please review it first, then give it a try and let us know if it works for you. |
The issue was solved by @atamazov in https://github.com/atamazov/MIOpen/tree/generalize-gfx10 so I'm closing it |
@kvirikroma Technically, the issue is not closed until the PR is merged in. I recommend re-opening. |
The PR is merged :) |
Note: I'm new to ROCm and this error may be my own fault, but I still would be thankful for any kind of help.
I've compiled MIOpen from source and tried to use it with TensorFlow. It works fine with a simple dense network, but I've tried it on a convolutional one and got an error on a training stage:
I've built MIOpen with following parameters:
Environment:
GPU: Radeon RX 6700 XT (Navi 22), gfx1031 architecture
OS: Arch Linux
ROCm version: 5.1.1
MIOpen version: 2.16.0
TensorFlow version: tf-nightly-rocm 2.10.0 (custom build for gfx1031)
Yes, I know that gfx1031 is not officially supported, but AFAICT, the error happens during compilation of GPU assembly (not in runtime) so I don't think it's related somehow.
Also I've tried to change those instructions to
v_add_f32_dpp
, but it did not fix the problem (they are on 95-99 lines in src/kernels/reduction_functions.h).The text was updated successfully, but these errors were encountered: