Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When I ran the Room Rearrangement task experiment, EOFerror appeared #29

Closed
twb1235 opened this issue Dec 24, 2021 · 4 comments
Closed

Comments

@twb1235
Copy link

twb1235 commented Dec 24, 2021

When I run the following command, the error in the screenshot appears。
‘allenact -o rearrange_out -b . baseline_configs/one_phase/one_phase_rgb_resnet_dagger.py ’
The error occurred after the program had been running for some time
There is a GPU on my computer
image
Please add the following information:
OS: Ubuntu 9.3.0-17ubuntu1~20.04
Allenact: 0.4.o
Allenact-plugins: 0.4.0
GPU: NVIDIA Corporation GP102 [GeForce GTX 1080 Ti]

@twb1235
Copy link
Author

twb1235 commented Dec 24, 2021

Hello, I'm sorry to bother you. When I run the code above, I find that all the Thor instances are not running on the GPU. Is this normal? After looking at other people's questions you answered, the above questions may be because all the Thor instances are running on the CPU, resulting in insufficient CPU capacity. Is there any way to solve it。
捕获1

@Lucaweihs
Copy link
Contributor

Hi @twb1235,

Sorry, I seem to have missed this issue as it was posted during the Christmas holidays. The THOR instances should definitely be on the GPU. Is it possible that you haven't started an x-server on your GPU? Doing so is required to be able to run AI2-THOR on the GPU. Assuming you're using a recent version of ai2thor and you have X11 installed (sudo apt install xserver-xorg-core), you can do so by running:

sudo ai2thor-xorg start

This might complain that it can't find the ai2thor-xorg command in which case you can run

which ai2thor-xorg

which should print something like /your/path/to/ai2thor-xorg, you should then copy this path and run

sudo /your/path/to/ai2thor-xorg start

@Lucaweihs
Copy link
Contributor

Closing this for now but please feel free to reopen if you're still having issues.

@twb1235
Copy link
Author

twb1235 commented Mar 21, 2022

Hi @twb1235,

Sorry, I seem to have missed this issue as it was posted during the Christmas holidays. The THOR instances should definitely be on the GPU. Is it possible that you haven't started an x-server on your GPU? Doing so is required to be able to run AI2-THOR on the GPU. Assuming you're using a recent version of ai2thor and you have X11 installed (sudo apt install xserver-xorg-core), you can do so by running:

sudo ai2thor-xorg start

This might complain that it can't find the ai2thor-xorg command in which case you can run

which ai2thor-xorg

which should print something like /your/path/to/ai2thor-xorg, you should then copy this path and run

sudo /your/path/to/ai2thor-xorg start

Hello, thank you for your reply

Recently, when I was doing one-phase training experiment, I still encountered unitycrashexception, I only started two training processes on one GPU.

At present, only Thor instances are running on the GPU, but it always ends after tens of thousands of steps. Can I put the vectorsampledtask: 0, vectorsampledtask: 1 sampling process on the GPU?

The following figure shows the machine parameters in the configuration file
机器参数

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants