Convolutional Network In Activity Recognition

This project implement some two-stream convolutional network.

Origin Two-Stream
TSN
DTPP

Data

UCF101

UCF101 contains 101 actions, 13320 video clips.The dataset can be download hereUCF Dataset. About 6.93GB.|

Video -> img

ffmpeg can capture the video's image in one line. Opencv can also do this.

    'ffmpeg -i \"{}\" -vf scale=-1:240 \"{}/image_%05d.jpg\"'.format(video_file_path, dst_directory_path)

details can be find in video_jpg_ucf101_hmdb51.py

img -> flow

FlowNet2.0 is used here to get the flow. Use Docker to finish this part. I use the two job below.
NVIDIA-flownet2-python
lmb

flow -> img

every flo change into two img, u and v.

Transfer Learning

Models

This part include the basebone model in the network.

Four models include here.

bninception
INceptionv4
ResNet
Inceptionv3

caffe to pytorch

caffe to torch

torch to pytorch

spatial_convnet

motion_convnet

fusion

average_fusion and svm_fusion include here.

Reference

These module is based on pytorch.
Pretrained module is based on Cadene
This origin project is based on jerryhuang's project. two-stream-action-recognition

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
checkpoints		checkpoints
data		data
models		models
runs/May03_19-38-38_awiny		runs/May03_19-38-38_awiny
utils		utils
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
all_scalars.json		all_scalars.json
config.py		config.py
config.pyc		config.pyc
main_average_fusion.py		main_average_fusion.py
main_svm_fusion.py		main_svm_fusion.py
opts.py		opts.py
resnet101_motion.py		resnet101_motion.py
resnet101_tsn_spatial.py		resnet101_tsn_spatial.py
wget-log		wget-log

maodong2056/activity_recognition

Folders and files

Latest commit

History

Repository files navigation