-
Notifications
You must be signed in to change notification settings - Fork 25.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Compile with TORCH_USE_CUDA_DSA
to enable device-side assertions.
#8965
Comments
May be "Mixed Precision," with bf16 does not happen to me. [+] xformers version 0.0.18 installed. (xformers is not necessary with torch 2.0. python: 3.10.6 • torch: 2.0.0+cu118 • xformers: N/A • gradio: 3.23.0 • commit: • checkpoint: e6415c4892 |
change torch version 2.0.0+cu118 to 2.1.0.dev20230501+cu117 works for me, but I have no idea what the reason is.
|
Thank you, Your method has solved my problem! |
why the torch is installed repeatly multiply times by executing this command? |
why the torch is installed repeatly multiply times by executing this command? |
Forgive me, I'm a newb. Where do I put that command? (I'm on Windows) |
Looks like it's installing every version of cu117 contained in the nightly folder. There are subtle differences in the filenames. |
When I run this I get this error: Looking in indexes: https://download.pytorch.org/whl/nightly/cu117 |
This comment was marked as outdated.
This comment was marked as outdated.
pip install torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 try execute is ok!!! |
Late comment but I think my gpu is failing. I under clocked the memory and gpu clock and don't get the error any more. |
Is there an existing issue for this?
What happened?
return torch.cuda.cudart().cudaMemGetInfo(device)
RuntimeError: CUDA error: the launch timed out and was terminated
Compile with
TORCH_USE_CUDA_DSA
to enable device-side assertions.why stoped in this screen. not running.
how to resolve, let it contiune next task. like the same to crash.
Steps to reproduce the problem
python: 3.10.10 • torch: 2.0.0+cu118 • xformers: 0.0.17rc482 • gradio: 3.16.2
return torch.cuda.cudart().cudaMemGetInfo(device)
RuntimeError: CUDA error: the launch timed out and was terminated
Compile with
TORCH_USE_CUDA_DSA
to enable device-side assertions.why stoped in this screen. not running.
how to resolve, let it contiune next task. like the same to crash.
What should have happened?
return torch.cuda.cudart().cudaMemGetInfo(device)
RuntimeError: CUDA error: the launch timed out and was terminated
Compile with
TORCH_USE_CUDA_DSA
to enable device-side assertions.why stoped in this screen. not running.
how to resolve, let it contiune next task. like the same to crash.
Commit where the problem happens
general timing
What platforms do you use to access the UI ?
No response
What browsers do you use to access the UI ?
No response
Command Line Arguments
List of extensions
no
Console logs
Additional information
No response
The text was updated successfully, but these errors were encountered: