Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docker Image Does Not Work #9

Closed
avaziri opened this issue Jul 16, 2018 · 2 comments
Closed

Docker Image Does Not Work #9

avaziri opened this issue Jul 16, 2018 · 2 comments

Comments

@avaziri
Copy link

avaziri commented Jul 16, 2018

I have attempted to run the TD3 example script from the rlkit-gpu Docker image with no success. I had to modify the TD3 example script slightly because I dont have a Mujoco license, so it instead runs MountainCarContinuous-v0. It runs just fine on my local machine from RLKit source, but when I try to run from within the docker container I get the following error:

THCudaCheck FAIL file=/pytorch/torch/lib/THC/THCGeneral.c line=70 error=30 : unknown error
Traceback (most recent call last):
  File "examples/td3.py", line 111, in <module>
    experiment(variant)
  File "examples/td3.py", line 84, in experiment
    algorithm.cuda()
  File "/rlkit/rlkit/torch/torch_rl_algorithm.py", line 37, in cuda
    net.cuda()
  File "/env/lib/python3.5/site-packages/torch/nn/modules/module.py", line 216, in cuda
    return self._apply(lambda t: t.cuda(device))
  File "/env/lib/python3.5/site-packages/torch/nn/modules/module.py", line 146, in _apply
    module._apply(fn)
  File "/env/lib/python3.5/site-packages/torch/nn/modules/module.py", line 152, in _apply
    param.data = fn(param.data)
  File "/env/lib/python3.5/site-packages/torch/nn/modules/module.py", line 216, in <lambda>
    return self._apply(lambda t: t.cuda(device))
  File "/env/lib/python3.5/site-packages/torch/_utils.py", line 69, in _cuda
    return new_type(self.size()).copy_(self, async)
  File "/env/lib/python3.5/site-packages/torch/cuda/__init__.py", line 358, in _lazy_new
    _lazy_init()
  File "/env/lib/python3.5/site-packages/torch/cuda/__init__.py", line 121, in _lazy_init
    torch._C._cuda_init()
RuntimeError: cuda runtime error (30) : unknown error at /pytorch/torch/lib/THC/THCGeneral.c:70

Can you confirm that the td3 example runs on the docker container without issue for you?

@avaziri avaziri changed the title Docker Image Does not Work Docker Image Does Not Work Jul 17, 2018
@avaziri
Copy link
Author

avaziri commented Jul 19, 2018

I was able to work through a few errors and get it to work. One thing worth noting in the directions would be that the user should install nvidia-docker.

@vitchyr
Copy link
Collaborator

vitchyr commented Jul 29, 2018

Great, I'm glad you managed to fix this. I'll add a note about nvidia-docker it to the README--thanks!

@vitchyr vitchyr closed this as completed Jul 29, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants