Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to track specified point in the video from the webcam? #71

Open
newforrestgump001 opened this issue Mar 12, 2024 · 8 comments

Comments

@newforrestgump001
Copy link

Is it possible to track the point without putting all the frames as a batch. Thank you a lot.

@nikitakaraevv
Copy link
Contributor

Hi @newforrestgump001, sorry for the late response. Yes, you can check out the online demo.

Is this what you are looking for?

@newforrestgump001
Copy link
Author

@nikitakaraevv Sincere thanks for your response. Suppose a video with 100 frames, First, give 5 frames as input of network and then gives output; and the output updates when more frames given to the network frame by frame. In other words, does it support tracked result updating frame by frame.

@nikitakaraevv
Copy link
Contributor

Hi @newforrestgump001, yes, that's how this demo works, except for the result is currently updated every four frames, not every frame.

@newforrestgump001
Copy link
Author

@nikitakaraevv Thanks a lot, I got it.

@lutianye
Copy link

Hi @nikitakaraevv Could we achieve real-time inference by conducting the inference process every four frames? In other words, would it be feasible to perform inference on frames one to four, and then repeat the process on frames five to eight, and so forth? Additionally, would it be necessary to reinitialize the model for each set of four frames? Lastly, could you please share any examples of C++ deployment code that utilizes TensorRT? Your response would be greatly appreciated.

@nikitakaraevv
Copy link
Contributor

Hi @lutianye, yes, that's exactly how the online demo works: it initialises the model, waits until it has access to the first 8 frames (the sliding window size is 8 frames, but the step is 4 frames), and then processes 4 new frames at a time while always taking only 8 frames as input at every step.
The model is not reinitialised after every step.

Unfortunately, we don't yet have any examples of C++ deployment, but if you let me know what your use case is, we might be able to help you. You can leave your email address or contact me via nikita@robots.ox.ac.uk, and we can discuss it in more detail.

@lutianye
Copy link

Hi @nikitakaraevv Thank you for your prompt response. I understand what you mean. Our objective is to capture images from a camera while simultaneously performing tracking, thus we cannot afford to feed all frames to the model. Instead, we need to capture and input frames in real-time. I am considering developing a C++ demo leveraging TensorRT, however, I am concerned that TensorRT might not support certain operators. Please forgive me for the delay in my response.My email address is ganzhijie@gmail.com.

@dat-nguyenvn
Copy link

Hi, any update for TensorRT? Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants