Recursively writes descriptions of video scenes using Large Language Models and Image Captioners
-
Updated
Mar 12, 2024 - Python
Recursively writes descriptions of video scenes using Large Language Models and Image Captioners
A labeler-app for video labeling, for example for machine learning. For timewindows (|....|) and pointactivities (.|.)
Counts number of objects (instances), their figures, and number of frames that have object of specific class.
Add a description, image, and links to the video-labeling topic page so that developers can more easily learn about it.
To associate your repository with the video-labeling topic, visit your repo's landing page and select "manage topics."