-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when setting up Docker image of 3DUnet with GPU support #24
Comments
Edit: Adding some more context. I have an NVIDIA TITAN X Pascal GPU with the necessary NVIDIA/CUDA/cuDNN drivers setup on a Win10 PC. |
@esgomezm |
Does this fail?docker run --rm --gpus all nvidia/cuda:11.8.0-base-ubuntu22.04 nvidia-smi
|
Are you running as admin/sudo?
|
I also tried in admin/sudo mode |
NVIDIA/nvidia-container-toolkit#289 Maybe this is? @IvanHCenalmor |
If this is the fix specifically on windows WSL itd have to be upstreamed as an actual bug fix. |
Otherwise if you're feeling brave https://github.com/HenriquesLab/DL4MicEverywhere/blob/main/Dockerfile You'd have to add the code to (I'm guessing) line 3 in this file. |
Thanks. I will have to wait as I am not sure how to implement the fix. |
Hi @Eddymorphling and @jinxsfe , @ctr26 and @Eddymorphling , thanks a lot for providing those links and feedback 🙏 ❤️ In both issues (the one given by @Eddymorphling here and the one given by @ctr26 here or here ) they mention that the problem is related with some 'ghost' files are automatically injected in the containerization (maybe because of using the default nvidia image mentioned here). And the solution for this can be to remove those files inside the container. Actually the last part of the error that you are getting is I will try to add these lines in the Dockerfile from the proposal and check if the issue can be solved with this or a similar command (that do not need to remove so many files):
Also they mention here that the installation of In any case, I will try it as soon as I can and I will comment with you the results that I get. Again thanks a lot for the feedback! 🫶 |
Okey, so still not fixed, but I want to share with you the updates that I have. I tried to remove the files in I checked a lighter version of the Dockerfile with just the building from the nvidia image and some minor installations. To check at what point did these files appear and if they could be removed in this lighter Dockerfile. With this new Dockerfile, there was no problem neither when building it nor running it. And after checking, the files were inside the container, so I assume that those files are not the ones giving the error. Additionally, I have not been able to remove them, so I decided to change the strategy. I continued from this lighter Dockerfile, adding step by step al the lines that I commented to check which one created the problem and apparently, the error appears from installing `nvidia-cuda-toolkit' inside the Dockerfile in this line. This error does not happen when building the image (because it is actually built) but when running it. Therefore, I decided tried to add (in the actual Dockefile with all the code)
To check if everything else was correctly installed, I used the Docker image from Pytorch to see if the GPUs were working and it actually works without any problem. So, as a conclusion until know is that the problem is with the Now that I now that the problem is with that library and not with the file itself, I think that I have an idea of what I will try to change to fix this problem. I think that the origin of this problem might be because the paths to that library are not correctly configured or linked, so what I will try now is to check if the problem is with the PATHs in the docker image or on the local machine. I will continue informing you on the progress and if you have any idea or feedback of the progress until know I will bee glad to read it 😄 |
If we use conda to install the cudatoolkit then you won't need to use apt-get to install it. |
Is this only testable on windows? |
Yeaaah, on the Linux machine works perfectly. It is when it comes to Windows that gives problems with |
Here I attach you the feedback that I got in a terminal that was run inside the docker container that was built with |
Hey @Eddymorphling We have considerably updated the tool and this issue should be solved. Could you give it a try please? |
Hi @Eddymorphling , We haven't heard back from you in a while, so we’re going to close this issue for now. If you encounter any further issues or have any questions, please feel free to reopen this issue or create a new one. We’re here to help! Thanks for your understanding, |
Hi,
Just tried running the Docker GUI (followed all instructions) to run the 3DUNET notebook with GPU support and ended up with the following error when building the Docket image. Any clue what could be wrong? Thank you.
The text was updated successfully, but these errors were encountered: