CNN LSTM

Implementation of CNN LSTM with Resnet backend for Video Classification

Getting Started

Prerequisites

PyTorch (ver. 0.4+ required)
FFmpeg, FFprobe
Python 3

Try on your own dataset

mkdir data
mkdir data/video_data

Put your video dataset inside data/video_data It should be in this form --

+ data 
    + video_data    
            - bowling
            - walking
            + running 
                    - running0.avi
                    - running.avi
                    - runnning1.avi

Generate Images from the Video dataset

./utils/generate_data.sh

Train

Once you have created the dataset, start training ->

python main.py --use_cuda --gpu 0 --batch_size 8 --n_epochs 100 --num_workers 0  --annotation_path ./data/annotation/ucf101_01.json --video_path ./data/image_data/  --dataset ucf101 --sample_size 150 --lr_rate 1e-4 --n_classes <num_classes>

Note

All the weights will be saved to the snapshots folder
To resume Training from any checkpoint, Use

--resume_path <path-to-model>

Tensorboard Visualisation(Training for 4 labels from UCF-101 Dataset)

Inference

python inference.py  --annotation_path ./data/annotation/ucf101_01.json  --dataset ucf101 --model cnnlstm --n_classes <num_classes> --resume_path <path-to-model.pth>

References

License

This project is licensed under the MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 164 Commits
datasets		datasets
images		images
models		models
snapshots		snapshots
tf_logs/tf_logs		tf_logs/tf_logs
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
inference.py		inference.py
main.py		main.py
mean.py		mean.py
model.py		model.py
opts.py		opts.py
requirements.txt		requirements.txt
sample.py		sample.py
spatial_transforms.py		spatial_transforms.py
target_transforms.py		target_transforms.py
temporal_transforms.py		temporal_transforms.py
train.py		train.py
utils.py		utils.py
validation.py		validation.py

License

pranoyr/cnn-lstm

Folders and files

Latest commit

History

Repository files navigation

CNN LSTM

Getting Started

Prerequisites

Try on your own dataset

Train

Note

Tensorboard Visualisation(Training for 4 labels from UCF-101 Dataset)

Inference

References

License

About

Resources

License

Stars

Watchers

Forks

Languages