Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

very slow #3

Closed
lucasjinreal opened this issue Apr 19, 2019 · 7 comments
Closed

very slow #3

lucasjinreal opened this issue Apr 19, 2019 · 7 comments

Comments

@lucasjinreal
Copy link

image

0.4s on video

@heilaw
Copy link
Contributor

heilaw commented Apr 19, 2019

Which detector were you using? Can you provide more details about your hardware configuration and the images that you were using?

@lucasjinreal
Copy link
Author

@heilaw I am using CenterNet_Sacaed structure and my PC is GTX 1080 TI.

I just directly test in demo.py by change loading from video. ( I donn't know if opened cuda or not).

But even on CPU, 400ms is till very slow. Yolov3 can get 30-40ms on CUDA, 100ms on CPU.
why CenterNet so time cost anyway?

@lucasjinreal
Copy link
Author

@heilaw The video I am using is from MOT with size about 640x480

@heilaw
Copy link
Contributor

heilaw commented Apr 20, 2019

Thanks for the information.

On our machine also with a 1080Ti, the average inference time of CornerNet-Saccade is 190ms on COCO. I tested CornerNet-Saccade on the video you were using and obtained inference time similar to that on COCO.

I think I need more information so that I can further look into this issue. Are you using Anaconda Python? What is the CUDA version on your machine? It would also be great if you can share your script. Have a nice weekend.

@lucasjinreal
Copy link
Author

@heilaw I shall post it tomorrow. But 190ms still very slow in terms of yolov3

@heilaw
Copy link
Contributor

heilaw commented Apr 21, 2019

If you want a real time detector, you may want to checkout CornerNet-Squeeze instead of CornerNet-Saccade. The average inference time of CornerNet-Squeeze on COCO is 30ms on our machine with a 1080Ti and an Intel Core i7-7700k.

@lucasjinreal
Copy link
Author

@heilaw After test, the squeeze net can achieve a promising speed, but with some false detections when inference on a car highway video

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants