-
Notifications
You must be signed in to change notification settings - Fork 90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CUDA error: CUBLAS_STATUS_EXECUTION_FAILED #14
Comments
The same problem occurs in notebooks/latent_ops.ipynb in the cell "Addition" operation. z_list = []
for i in range(500):
tensors, fillings = dataset._load_tensor(dataset.random_id())
t_sep = tensors[0]
t_sep_rm, fillings_rm = t_sep[:-1], fillings[:-1]
if len(t_sep) >= 2:
z1 = encode(dataset.get_data(t_sep, fillings))
z2 = encode(dataset.get_data(t_sep_rm, fillings_rm))
z_list.append(z2 - z1)
z_rmv = torch.cat(z_list).mean(dim=0, keepdims=True) |
Hi @pwichmann , I'm doing a fresh install right now to check if I get the same issue. Otherwise, will try to investigate the error message you are receiving. |
Thanks, @alexandre01 One difference is the CUDA version. But I could not determine if this can be the cause. |
Okay, installed everything with the following configuration:
and worked without error... Trying to check your error message now |
Hmm.. Yeah, googling didn't help me much neither. I suspect this might be a problem with PyTorch 1.4 not supporting your exact CUDA build. It's tricky to give any advice here cause I always just install CUDA using Ubuntu's Software & Updates (additional drivers) tool. I don't have any RTX card neither.. |
I was able to solve the problem. It was NOT caused by your code but - indeed - by the PyTorch version or CUDA version. pip install torch==1.7.0+cu110 torchvision==0.8.1+cu110 torchaudio===0.7.0 -f https://download.pytorch.org/whl/torch_stable.html |
Thank you, @alexandre01! |
Wow great, I'm glad you managed to have it working! |
Hi Alexandre,
Many thanks for the paper and the code. Excellent work!
I am getting a CUDA error for which I have no immediate explanation. Maybe you have an idea (also in case of other users experiencing the same issue). Googling the error message resulted in some hits but none that I could work with.
Where does error occur?
Environment
Error message
Immediate code context:
Error message:
The text was updated successfully, but these errors were encountered: