-
Notifications
You must be signed in to change notification settings - Fork 147
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
_run_ninja_build failed on pytorch 1.7 but success on pytorch1.4 #29
Comments
Hi, Installing with pip rather than conda should not be an issue. There might be a problem of mismatched versions or ninja failing to find your CUDA library path. Could you please provide me with the full error stack? |
Thanks for your help, the traceback is as follows (process.poll() in subprocess.py equals to 1 on torch1.7 but 0 on torch 1.4 )😀: Traceback (most recent call last): The above exception was the direct cause of the following exception: Traceback (most recent call last): |
As far as I can see (e.g. from here and here), this seems to be a problem with specific CUDA 10.1 versions. Things I'd try, depending on how much you're willing to mess with your CUDA installation or whether you'd rather solve it with code:
|
Thanks for your suggestions! I will try it one by one 😀. Once resolved, I will push the report here. |
Problem solved after updating the cuda 10.1 to 10.1.168. It works😀, thanks for your help! |
Happy to help! Closing as resolved. Feel free to open a new issue if you need additional help. |
Hi, thanks for your excellent work! It is really instructive! however, I met a strange compiling problem while training the network. It reminds "subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1 while building extension 'fused'. This only happens on pytorch 1.7.1 and nothing happens using torch 1.4 (but clip models seem require torch1.7). I have got stuck with this for many days and still cannot find the solution 😀. Any suggestion for this? BTW, the torch 1.7 is installed using pip rather than coda, does this matter? Many thanks for the help 🤪. The environment I use is as listed:
ubuntu 20.14 pytorch 1.7.1 torchvision 0.8.2 torchaudio 0.7.2 CUDA 10.1 ninja 1.8.2
The text was updated successfully, but these errors were encountered: