-
Notifications
You must be signed in to change notification settings - Fork 529
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Intermittent memory access error #2097
Comments
I cannot reproduce this under gdb so I would guess there's a race condition somewhere. Please advise. |
Are you using rocm-3.5 branch? |
Yes indeed, this was occuring both on 3.3 and now on 3.5 |
I absolutely understand that his may be due to my own buggy code. |
You could try by SERIALIZING kernels and memcpys. With ROCm3.5, you can use AMD_SERIALIZE_KERNEL and AMD_SERIALIZE_COPY env vars as described @ https://github.com/ROCm-Developer-Tools/ROCclr/blob/roc-3.5.x/utils/flags.hpp#L220-#L225 |
will give it a shot thanks |
I'm not really sure what these env vars are doing, but it doesn't make much of a difference
results in both of these I think there's binary characters being logged somewhere in there too which isn't ideal! |
Got it to happen with dbg
|
After building https://github.com/RadeonOpenCompute/ROCR-Runtime with debug symbols.
|
and
|
I have my code running on Nvidia hardware with no issues. |
Any chance I can get a comment on this? If I can get access to a machine with an "official setup", and if I can't reproduce this, I'll close the ticket (and reconsider my life choices) |
I don't know if it's relevant, but the error appears to happen less often when executed just after compiling/linking |
fault.Failure.ErrorType is 0 |
ok, I'm an idiot. |
Every few executions I get the following error:
What I find surprising is that my code works at all yet sometimes fails.
My memory structure is perhaps some what complex, but I don't see why this should be causing problems.
I have no async memory calls.
The text was updated successfully, but these errors were encountered: