Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

resolved cuda and pytorch versions in rugpt3xl_generation notebook #65

Closed
wants to merge 1 commit into from
Closed

resolved cuda and pytorch versions in rugpt3xl_generation notebook #65

wants to merge 1 commit into from

Conversation

amrzv
Copy link

@amrzv amrzv commented Jun 27, 2021

Hi!
Tried to run rugpt3xl_generation notebook and got this error #49
Solved by specifying pytorch and cuda versions to be installed that was mentioned #60:

pip install torch==1.7.0+cu110 -f https://download.pytorch.org/whl/torch_stable.html
export CUDA_HOME=/usr/local/cuda-11.0

Also, this would be resolved #62

Tested in colab, notebook works as expected.

Copy link

@Marcus-Arcadius Marcus-Arcadius left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank You! This ended up working for me 😍

@AlexanderKozhevin
Copy link

@amrzv
Sorry can you give some reference on how to run rugpt3xl_generation

I'm getting and error

    RuntimeError: Cuda extensions are being compiled with a version of Cuda that does not match the version used to compile Pytorch binaries.  Pytorch binaries were compiled with Cuda 11.0.

@amrzv
Copy link
Author

amrzv commented Jul 12, 2021

@amrzv
Sorry can you give some reference on how to run rugpt3xl_generation

I'm getting and error

    RuntimeError: Cuda extensions are being compiled with a version of Cuda that does not match the version used to compile Pytorch binaries.  Pytorch binaries were compiled with Cuda 11.0.

Hi.
Please, see this comment.

@Artyrm
Copy link

Artyrm commented Jul 28, 2021

Unfortunately, this step: !pip install deepspeed==0.3.7 overwrites torch version:

Requirement already satisfied: torch>=1.2 in /usr/local/lib/python3.7/dist-packages (from deepspeed==0.3.7) (1.7.0+cu110)
...
Collecting torch>=1.2
  Downloading torch-1.9.0-cp37-cp37m-manylinux1_x86_64.whl (831.4 MB)
     |████████████████████████████████| 831.4 MB 2.6 kB/s 
Installing collected packages: torch
  Attempting uninstall: torch
    Found existing installation: torch 1.7.0+cu110
    Uninstalling torch-1.7.0+cu110:
      Successfully uninstalled torch-1.7.0+cu110
Successfully installed torch-1.9.0

UPD guess we need to run it like this:

!pip install --no-dependencies --no-cache-dir deepspeed==0.3.7

@amrzv
Copy link
Author

amrzv commented Aug 15, 2021

Seems that notebook is fixed in 3b0b84b

@amrzv amrzv closed this Aug 15, 2021
@amrzv amrzv deleted the fix-generation-notebook branch August 9, 2022 18:21
@ai-forever ai-forever deleted a comment Sep 2, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

RuntimeError: CUDA: Error- invalid ptx
4 participants