Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow tracker to run on more frames automatically #5686

Open
2 tasks done
LucaBallan96 opened this issue Feb 9, 2023 · 8 comments
Open
2 tasks done

Allow tracker to run on more frames automatically #5686

LucaBallan96 opened this issue Feb 9, 2023 · 8 comments
Labels
enhancement New feature or request

Comments

@LucaBallan96
Copy link

LucaBallan96 commented Feb 9, 2023

My actions before raising this issue

Steps to Reproduce (for bugs)

  1. Pull repo and start up local CVAT instance
  2. Set up SiamMask or TransT tracker as serverless function with Nuclio
  3. Load video and annotate first frame

Expected Behaviour

We want a tracker to automatically annotate a number of consecutive frames, once the first one is annotated. Automatic annotation with a detector, for example, works like that.

Current Behaviour

Need to go manually frame by frame to run tracker. That means, click 'next frame' 1 by 1, till the end of the video.
Issue #2949 mentioned this already, but the solution is not clear. Can 'tracker frames' be set with an attribute in function.yaml, or somewhere else?

@dschoerk
Copy link
Contributor

I've been working on the TransT implementation in CVAT. At the time of implementation, the serverless tracking functions were designed to only track one frame at a time. As a result, the UI is sending a web request for each tracked frame and tracking multiple frames would be as efficient as stepping forward manually by holding the 'f' key. A low hanging fruit to improve usability a little bit would be to implement tracking multiple frame when pressing 'v' to step forward x frames at a time. However, this wouldn't improve the efficiency of the tracker.

@SpecLad SpecLad added the enhancement New feature or request label Mar 24, 2023
@laisimiao
Copy link

@dschoerk Thank you for your contribution. But after i build and deploy transt, it gives a error when use AI tools:

Tracker initialization error Error:
 Request failed with status code 500. 
"500 Server Error: Internal Server Error for url: http://host.docker.internal:49153/".

Here are my return:
image

@dschoerk
Copy link
Contributor

@dschoerk Thank you for your contribution. But after i build and deploy transt, it gives a error when use AI tools:

Tracker initialization error Error:
 Request failed with status code 500. 
"500 Server Error: Internal Server Error for url: http://host.docker.internal:49153/".

Here are my return:
image

Check the docker logs.

@leosteets
Copy link

I've been working on the TransT implementation in CVAT. At the time of implementation, the serverless tracking functions were designed to only track one frame at a time. As a result, the UI is sending a web request for each tracked frame and tracking multiple frames would be as efficient as stepping forward manually by holding the 'f' key. A low hanging fruit to improve usability a little bit would be to implement tracking multiple frame when pressing 'v' to step forward x frames at a time. However, this wouldn't improve the efficiency of the tracker.

Hi, thanks for the update there, I was wondering why the 'v' doesn't work like the Tracker with TrackerMIL? When I press 'v' and step 10 frames, the current new frame is not processed with TransT, I don't think it has to process all the 10 frames since the interpolation in my case solves most of the cases.

Do you know how can I make it work?

@dschoerk
Copy link
Contributor

I've been working on the TransT implementation in CVAT. At the time of implementation, the serverless tracking functions were designed to only track one frame at a time. As a result, the UI is sending a web request for each tracked frame and tracking multiple frames would be as efficient as stepping forward manually by holding the 'f' key. A low hanging fruit to improve usability a little bit would be to implement tracking multiple frame when pressing 'v' to step forward x frames at a time. However, this wouldn't improve the efficiency of the tracker.

Hi, thanks for the update there, I was wondering why the 'v' doesn't work like the Tracker with TrackerMIL? When I press 'v' and step 10 frames, the current new frame is not processed with TransT, I don't think it has to process all the 10 frames since the interpolation in my case solves most of the cases.

Do you know how can I make it work?

I believe this is handled in the CVAT UI code, the integration for OpenCV based trackers (like TrackerMIL) is a little different from the Nuclio based AI tools (interactor, tracker, detector).

@hoangdung3498
Copy link

@dschoerk Thank you for your contribution. But after i build and deploy transt, it gives a error when use AI tools:

Tracker initialization error Error:
 Request failed with status code 500. 
"500 Server Error: Internal Server Error for url: http://host.docker.internal:49153/".

Here are my return: image

Do you resolve it ?

@rohit901
Copy link

@dschoerk thank you for the integration. However it seems to take a long time for processing each frame, and also running it frame by frame seems slow. Can you tell me how is this better than the standard tracking with interpolation?

@dschoerk
Copy link
Contributor

@dschoerk thank you for the integration. However it seems to take a long time for processing each frame, and also running it frame by frame seems slow. Can you tell me how is this better than the standard tracking with interpolation?

It's vastly more accurate than interpolation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

7 participants