Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

about SOTR-RT-736 #5

Closed
Anglechina opened this issue Sep 16, 2021 · 3 comments
Closed

about SOTR-RT-736 #5

Anglechina opened this issue Sep 16, 2021 · 3 comments

Comments

@Anglechina
Copy link

Anglechina commented Sep 16, 2021

hi, i can't find the model about SOTR-RT-736 to test pic at high FPS, when i am reshowing your work, can you give me some help?

  1. can you provide the model about SOTR-RT-736?
  2. can you tell me some details about test the model ?
    thank you very much!
@easton-cau
Copy link
Owner

SOTR mainly focus on boosting the accuracy of instance segmentation. When it comes to real-time models, SOTR-RT has lower accuracy and speed compared with other state-of-the-art instance segmentation methods. So we recommend you choose other CNN methods if your task is a real-time task. If you want to test the SOTR-RT model, you can reduce the number of transformer layers to two and the input shorter side to 736, and then train the model.

@Anglechina
Copy link
Author

SOTR mainly focus on boosting the accuracy of instance segmentation. When it comes to real-time models, SOTR-RT has lower accuracy and speed compared with other state-of-the-art instance segmentation methods. So we recommend you choose other CNN methods if your task is a real-time task. If you want to test the SOTR-RT model, you can reduce the number of transformer layers to two and the input shorter side to 736, and then train the model.

when i test the SOTR_R101 or SOTR_R101DCN model, the cost time is about 0.2~0.3s using GPU3080, is that correct comparing with your work?

@easton-cau
Copy link
Owner

easton-cau commented Sep 16, 2021

We test the SOTR_R101 model on COCO test-dev, and the cost time is about 0.123 s/per using a single Tesla V100 GPU.

Please check the following reasons:

  1. Image resolution.
  2. Function.
    demo.py: demo.run_on_image(img)
    predictor.py: visualizer.draw_instance_predictions(predictions=instances) # need lots of time when drawing predicted pictures. We chose to directly generate json file instead of pictures.
  3. GPU.
    Tesla V100 vs. GTX 3080

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants