Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

yolox_ros_cpp inference speed up. #24

Merged
merged 3 commits into from
Jun 14, 2022
Merged

Conversation

fateshelled
Copy link
Contributor

OpenMP was used in the post-processing, but it was slowing down the process.

Changed to not use OpenMP and some fixed.

@Ar-Ray-code
Copy link
Owner

Ar-Ray-code commented Jun 13, 2022

I'm also planning to check, how much speed improvement have you seen?

@Ar-Ray-code Ar-Ray-code added the enhancement New feature or request label Jun 13, 2022
@fateshelled
Copy link
Contributor Author

In my desktop environment ( OpenVINO ( CPU, intel core i5 11400F ) + yolox_nano ), inference total speed ( preprocess + inference + postprocess ) increased by about 1.5~2 times.

TensorRT ( Jetson Nano 4GB ) + yolox_nano did not change speed much.

@Ar-Ray-code
Copy link
Owner

Ar-Ray-code commented Jun 14, 2022

For my environment, the FPS stability of TensorRT inference was better after the improvement; OpenVINO could not tell the difference.

Before: 33-125fps
After: 83-100fps

(CPU: Core i9 12900KF GPU: RTX 2080Ti)

@fateshelled
Copy link
Contributor Author

I did some more tests. All tested yolox_nano model.

  • Core i7 8550U

    • OpenVINO CPU: Before 17-22FPS, After 20FPS
    • OpenVINO GPU: Before 20-40FPS, After 50-60FPS
  • Core i5 11400F + RTX3060

    • OpenVINO CPU: Before 30-120FPS, After 60-160FPS
    • TensorRT: Before 50-140 FPS, After 70-250FPS

Inference speed and its stability also seem to have improved.

@Ar-Ray-code
Copy link
Owner

Thank you for your PR 👍 .

@Ar-Ray-code Ar-Ray-code merged commit 255eb6c into Ar-Ray-code:main Jun 14, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants