Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A problem when running sh cmds/20/motif/predcls/sup/train.sh #2

Closed
tao123322 opened this issue Oct 17, 2021 · 3 comments
Closed

A problem when running sh cmds/20/motif/predcls/sup/train.sh #2

tao123322 opened this issue Oct 17, 2021 · 3 comments

Comments

@tao123322
Copy link

The program gets stuck while running. The following problems occur, and the GPU has been stuck at 100%


Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed.


@waxnkw
Copy link
Collaborator

waxnkw commented Oct 19, 2021

Sorry, I did not meet this problem before. But I guess this might be caused by one of the processes in the distributed training crashes or meets some other problems? So, the program will wait for the predictions of the crashed process and just get stuck here.

@tao123322
Copy link
Author

tao123322 commented Oct 23, 2021 via email

@waxnkw
Copy link
Collaborator

waxnkw commented Oct 25, 2021

The generated data is the same with the original repo. You can take a try to find details from the original repo.
Generally speaking:
visual_info.json is gt and predicated bounding boxes.
result_dict.pytorch is different evaluation metric value of each image.
eval_results.pytorch is gts and predictions saved in BoxList object. You can use boxlist.fields() to view possible attributes like pred_labels for predicted boxlist, and use boxlist.get_field("pred_labels") to get specific attributes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants