three CUDA.@atomic in a row result in out-of-bounds error #1254

bjarthur · 2021-11-30T00:13:59Z

but only on a 1080Ti. works fine on an A100. if i delete one of them, it also works. a minimum broken example would take some time to write. wanted to inquire whether this is a known issue before i investigate further. kernel as it is now looks like this:

function kernel(bspike,
                w0Index, w0Weights, forwardInputsE, forwardInputsI,
                wpIndexOut, wpWeightOut, forwardInputsP)
    i = threadIdx().x + (blockIdx().x - 1) * blockDim().x
    j = threadIdx().y + (blockIdx().y - 1) * blockDim().y

    if bspike[j]
        if i<=size(w0Index,1)
            CUDA.@atomic forwardInputsE[0x1 + w0Index[i,j]] += max(w0Weights[i,j], 0)
            CUDA.@atomic forwardInputsI[0x1 + w0Index[i,j]] += min(w0Weights[i,j], 0)
        end
        if i<=size(wpIndexOut,1)
            CUDA.@atomic forwardInputsP[0x1 + wpIndexOut[i,j]] += wpWeightOut[i,j]
        end
    end
    return nothing
end

this is with julia 1.7-rc2, CUDA.jl 3.5.0, and CUDA 11.2 for the 1080Ti and 11.4 for the A100. thanks.

The text was updated successfully, but these errors were encountered:

bjarthur · 2021-11-30T00:18:46Z

oh, and it works on a 2080Ti too with 11.4. so maybe it's just broken for 11.2??

maleadt · 2021-11-30T06:46:43Z

Define 'broken'? Please include at least a stack trace or compiler error. Generally this would require a MWE, I don't see what shouldn't work about multiple @atomic invocations.

bjarthur · 2021-12-01T04:10:31Z

i can't reproduce now. sorry. will re-open if/when i can.

bjarthur added the bug Something isn't working label Nov 30, 2021

bjarthur closed this as completed Dec 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

three CUDA.@atomic in a row result in out-of-bounds error #1254

three CUDA.@atomic in a row result in out-of-bounds error #1254

bjarthur commented Nov 30, 2021 •

edited

bjarthur commented Nov 30, 2021

maleadt commented Nov 30, 2021

bjarthur commented Dec 1, 2021 •

edited

three CUDA.@atomic in a row result in out-of-bounds error #1254

three CUDA.@atomic in a row result in out-of-bounds error #1254

Comments

bjarthur commented Nov 30, 2021 • edited

bjarthur commented Nov 30, 2021

maleadt commented Nov 30, 2021

bjarthur commented Dec 1, 2021 • edited

bjarthur commented Nov 30, 2021 •

edited

bjarthur commented Dec 1, 2021 •

edited