Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pytorch-CUDA version compatibility problem in spyglass-position environment #710

Open
sytseng opened this issue Dec 15, 2023 · 2 comments
Open
Assignees
Labels
bug Something isn't working position

Comments

@sytseng
Copy link

sytseng commented Dec 15, 2023

The current Frank lab GPU servers use CUDA 11.6, but the current environment_position.yml specifies the following dependencies:

  • pytorch<1.12.0
  • torchvision
  • torchaudio
  • cudatoolkit=11.3

which leads to a pytorch version 1.7.1.post2 which does not recognize any GPUs on the lab server due to cuda incompatibility (probably because the cudatoolkit is specified to version 11.3)

Bug behavior:
torch.cuda.is_available() returns False

torch.cuda.current_device() returns the following error AssertionError: Torch not compiled with CUDA enabled

I have not started using the DLC pipeline so I don't know the impact of this issue. Other people seem to be using the GPUs on lab server without any issues currently, but in the future there may be a need to update the environment_position.yml or make notes about installing the correct pytorch version.

@edeno edeno added the position label Dec 21, 2023
@edeno edeno added the bug Something isn't working label Jan 20, 2024
@samuelbray32
Copy link
Collaborator

Tried building without pinning cudatoolkit and this persists

@edeno
Copy link
Collaborator

edeno commented Feb 20, 2024

What version of cudatoolkit did it end up using?

And just so I understand, this is on zephyr and breeze, but you've tested on your local machine and everything has been fine?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working position
Projects
None yet
Development

No branches or pull requests

3 participants