-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question regarding SubprocVectorEnv failure #3
Comments
Hi, @liuzuxin, thanks for your asking! I guess this problem may have something to do with the Cuda version or something. Could you please provide more information about your machine? (The platform, NVIDIA driver version, cuda version). I didn't encounter this problem with Ubuntu 20.04 on A100 and Nvidia driver=515.105.01 and cuda=11.7. |
Thanks for your reply. Sure, mine is Ubuntu 20.04 with Nvidia driver |
Hi zuxin, Thanks for asking. We have also noticed this issue and are investigating it. In the meantime, a quick walkaround will be saving the model offline, then you can start multiple evaluation scripts with a single environment for evaluation. This will definitely increase the GPU memory requirement but can make the evaluation faster. |
Hi, |
I resolve the problem by adding these two lines to venv.py.
|
I encountered the same issue, and this solution works for me. Thank you very much!!! |
May I ask which line of env.py should I add it to? @lihenglin @JamesSand |
Hi, when I try to use the evaluation script on a headless machine (cloud server) with A10G GPU, I occasionally come across the following error:
Sometimes I came across this issue due to insufficient CUDA memory; however, now even with enough memory, I still encounter this problem and have no idea how to solve it.
I can use the evaluation script with
DummyVectorEnv
, but it seems to be too slow.So I am wondering whether you have encountered similar issues? Any hints would be appreciated. Thanks in advance.
The text was updated successfully, but these errors were encountered: