Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
-
Updated
Oct 27, 2023 - Jupyter Notebook
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Source code for Delving Deeper into the Decoder for Video Captioning
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
Python implementation of extraction of several visual features representations from videos
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
LSTM RNN and Transformer networks video captioning on MSVD and MSR-VTT using attributes and SVOS
Code implementation & CLI tool for the paper: "Graph Based Temporal Aggregation for Video Retrieval"
Add a description, image, and links to the msr-vtt topic page so that developers can more easily learn about it.
To associate your repository with the msr-vtt topic, visit your repo's landing page and select "manage topics."