Video captioning using SCN-LSTM models with S2VT baseline
-
Updated
Mar 15, 2023 - Python
Video captioning using SCN-LSTM models with S2VT baseline
🔍 Shotluck Holmes: A family of small-scale LLVMs for shot-level video understanding
Master Thesis on Multimodal Video Captioning, done at Huawei's Research Center in Amsterdam.
Video Captioning using Scene Change Detection and Image Captioning
AI-based Video summarizer along with captioning.
Event based Sign-Language-Translation
Official code for Global Semantic Descriptors for Zero-Shot Action Recognition (IEEE Signal Processing Letters 2022)
AI based Video summarizer along with captioning.
Second-place solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2022 workshop)
Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
A PyTorch implementation of the paper Thinking Hallucination for Video Captioning.
Implementation of Encoder-Decoder Model for Video Captioning in Tensorflow
Official implementation of state-aware video procedural captioning (ACM MM 2021)
Official code for Tell Me What You See: A Zero-Shot Action Recognition Method Based on Natural Language Descriptions (Multimedia Tools and Applications 2024)
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
A PyTorch implementation of EmpiricalMVM
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
S2VT with Attention
Add a description, image, and links to the video-captioning topic page so that developers can more easily learn about it.
To associate your repository with the video-captioning topic, visit your repo's landing page and select "manage topics."