Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

GCP Base Image Wrong CUDA Version #19263

Open
PetrochukM opened this issue Apr 15, 2019 · 6 comments
Open

GCP Base Image Wrong CUDA Version #19263

PetrochukM opened this issue Apr 15, 2019 · 6 comments
Assignees
Labels
module: doc infra Related to pytorch.org/docs, deployment of, and serving triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module

Comments

@PetrochukM
Copy link

馃悰 Bug

Working through the suggested PyTorch GCP base image, the wrong version of CUDA was installed.

To Reproduce

Steps to reproduce the behavior:

  1. On "New Deep Learning VM deployment" select "Intel Optimized Base 9.2"
  2. Deploy and SSH into the machine
  3. nvidia-smi shows CUDA Version is at 10.0

Expected behavior

The CUDA Version is correct after setting up a GCP machine.

@PetrochukM
Copy link
Author

Let me know if this is the wrong form to report the bug...

@colesbury
Copy link
Member

@PetrochukM is this an issue with Google Cloud Platform? Or an issue with PyTorch documentation?

Where is the "suggested PyTorch GCP base image"?

@PetrochukM
Copy link
Author

The issue is with GCP; however, it's a solution suggested on https://pytorch.org/ under "CLOUD PARTNERS".

@colesbury colesbury added module: doc infra Related to pytorch.org/docs, deployment of, and serving and removed module: doc infra Related to pytorch.org/docs, deployment of, and serving labels Apr 15, 2019
@colesbury
Copy link
Member

@soumith can you route this to a contact at GCP?

@colesbury colesbury added module: doc infra Related to pytorch.org/docs, deployment of, and serving triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Apr 15, 2019
@soumith
Copy link
Member

soumith commented Apr 17, 2019

@b0noI this seems like the wrong binary got installed. can you take a look?

@mikehcheng
Copy link

I work with @b0noI, the image is working as intended, but the user is likely noticing the behavior where nvidia-smi shows the driver version instead of the CUDA version. See https://stackoverflow.com/questions/53422407/different-cuda-versions-shown-by-nvcc-and-nvidia-smi for a more detailed explanation.

That being said, for PyTorch we recommend selecting the framework "PyTorch 1.0 + fast.ai 1.0 (CUDA 10.0)". The other images do not have PyTorch installed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: doc infra Related to pytorch.org/docs, deployment of, and serving triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module
Projects
None yet
Development

No branches or pull requests

4 participants