Video captioning

Source code for Video Captioning

Requirements

This code requires tensorflow1.1.0. The evaluation code is in Python, and you need to install coco-caption evaluation if you want to evaluate the model.

Download Dataset

MSVD
MSR-VTT

Preprocess data

1. Extract all frames from videos

It needs to extract the frames by using cpu_extract.py. Then use read_certrain_number_frame.py to uniformly sample 5 frames from all frames of a video. At last use the tf_feature_extract.py to extract the inception-resnet-v2 features of frame.

2.Evaluate models

use the *_s2vt.py. Before that, it needs to change the model path of evaluation function and some global parameters in the file. For example,

python tf_s2vt.py --gpu 0 --task evaluate

The MSVD models can be downloaded from here The MSR-VTT models can be downloaded from here

These processes are a little complicated, please feel free to ask me if you have some questions.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
attention_models		attention_models
msr-vtt		msr-vtt
msvd		msvd
pyciderevalcap		pyciderevalcap
pycocoevalcap		pycocoevalcap
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
read-certain-num-frames.py		read-certain-num-frames.py
read-certain-num-frames.py~		read-certain-num-frames.py~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video captioning

Requirements

Download Dataset

Preprocess data

1. Extract all frames from videos

2.Evaluate models

About

Releases

Packages

Languages

License

adwardlee/video_to_text

Folders and files

Latest commit

History

Repository files navigation

Video captioning

Requirements

Download Dataset

Preprocess data

1. Extract all frames from videos

2.Evaluate models

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages