Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Low FPS on jetson type devices #24

Open
MsWik opened this issue Jun 9, 2021 · 5 comments
Open

Low FPS on jetson type devices #24

MsWik opened this issue Jun 9, 2021 · 5 comments

Comments

@MsWik
Copy link

MsWik commented Jun 9, 2021

Hello. Thanks for your work.
When testing yolor-ssss-dwt 640 on devices like jetson Xavier NX, an unsatisfactory result was obtained in terms of performance (about 30 frames per second), yolo4-tiny + tkDNN FP16 640 * 640 ~ 100 fps. Are there ways to speed up the output for end devices?
At 2070S ~ 100 FPS

@WongKinYiu
Copy link
Owner

I think yolor-ssss-dwt + tkDNN wont't have only 30 fps, since in my experiments yolov4-s is far faster than 30fps on xavier nx.
Do you has the fps results of yolor-ssss-s2d?

@MsWik
Copy link
Author

MsWik commented Jun 9, 2021

Thanks for the answer. No, the result is the same. About 27FPS when processing a file. Tell me what version of the torch you have? I have torch 1.8.0 CUDA: 0 (Xavier, 7765MB) and torchvision 0.9.0. FP16 is included.
I haven't used tkDNN for yolor-ssss-dwt, can you have an example?

@WongKinYiu
Copy link
Owner

I used pure tensorrt without tkdnn.
https://github.com/linghu8812/tensorrt_inference/tree/master/ScaledYOLOv4

@MsWik
Copy link
Author

MsWik commented Jun 10, 2021

Thanks for the answer. I managed to convert the model to onnx, however, I have not yet managed to collect and draw an output through tensorrt. Can you tell us how you made the output in tensorrt?

@MsWik
Copy link
Author

MsWik commented Jun 10, 2021

I ran the model through onnx_tensorrt, but the speed remains the same.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants