[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
-
Updated
Jul 5, 2024 - Python
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
PyTorch Implementation of Consensus-based Sequence Training for Video Captioning
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
A tool for downloading from public image boards (which allow scraping) / preview your images & tags / edit your images & tags. Additional tabs for downloading other desired code repositories as well as S.O.T.A. diffusion and auto-tag/caption models for your purposes. Custom datasets can be added!
M-VAD Names Dataset. Multimedia Tools and Applications (2019)
Caption generator for live camera feed
Video Search using Natural Language
Generate TikTok— and Instagram—tailored captions and hashtags for your videos using the power of some super creative robots up in the clouds ☁️ 🤖 💬 ☁️
A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.
Automated Wistia video captioning tool
Some scripts used to convert AD scripts to many formats
A simple SRT generator using Faster Whisper
Add a description, image, and links to the captioning-videos topic page so that developers can more easily learn about it.
To associate your repository with the captioning-videos topic, visit your repo's landing page and select "manage topics."