You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
CUDA.jl initialisation fails after suspending Ubuntu 20.04 with CUDA 11.2
Additional context
You will see an irrelevant error:
Error: Exception while generating log record inmodule CUDA at
/home/qyu/.julia/dev/CUDA/src/initialization.jl:34
│ exception =
│ UndefVarError: ex not defined
│ Stacktrace:...
using CUDA
# do some random stuff
W =cu(rand(2, 5)) # a 2×5 CuArray
b =cu(rand(2))
predict(x) = W*x .+ b
loss(x, y) =sum((predict(x) .- y).^2)
x, y =cu(rand(5)), cu(rand(2)) # Dummy dataloss(x, y) # ~ 3# Suspend the machine
To suspend the machine:
click the top-right of the screen
click Power Off / Log Out
click Suspend
Now wake up the machine and the existing Julia stops working with CUDA.jl, restart Atom/Juno or just Julia in terminal, and Julia now gives ERROR: CUDA.jl did not successfully initialize, and is not usable. when trying to do e.g. cu(rand(2)).
Error 999 is your driver being messed up. Nothing we can do about that.
Ah I forgot to close this issue. I started to manage multiple CUDA environments when I tried to play with CUDA.jl, so I deleted the lines that add paths automatically. Somehow before I suspend the system Julia can find the path, but not after.
Describe the bug
CUDA.jl initialisation fails after suspending Ubuntu 20.04 with CUDA 11.2
Additional context
You will see an irrelevant error:
this is described in #603 and fixed by #604
To reproduce
The Minimal Working Example (MWE) for this bug:
Launch Juno in Atom
To suspend the machine:
Power Off / Log Out
Suspend
Now wake up the machine and the existing Julia stops working with CUDA.jl, restart Atom/Juno or just Julia in terminal, and Julia now gives
ERROR: CUDA.jl did not successfully initialize, and is not usable.
when trying to do e.g.cu(rand(2))
.Manifest.toml
Version info
Details on Julia:
Also tried with the current stable 1.5 version.
Details on CUDA:
Driver Version: 460.27.04 CUDA Version: 11.2
The text was updated successfully, but these errors were encountered: