GitHub - Ruiyang-061X/TSN: Temporal Segment Network implemented by PyTorch for action recognition.

TSN

Introduction

TSN is the abstraction of Temporal Segment Network. It can do action recognition. Given a video containing an action, it can recognize the action that is happening. Action recognition is basically video classification. The way TSN does action recognition is very similiar to image classification. First, it picks several segments of the video, which are actually a set of images. Then it does 'image classification' on each of the segment, and averages the results to get the result of the video. Then it can determine the label of the video.

The paper where TSN comes from is Temporal Segment Networks: Towards Good Practices for Deep Action Recognition. The codes in this repository are basically based on yjxiong /tsn-pytorch, which is also the origin code of the paper. yjxiong/temporal-segment-networks is the caffe version code.

Dependency

Ubuntu
PyTorch 1.4.0
Pillow 7.0.0
a GPU

How To Use

The model in this repository is trained on ucf101, so you need to download ucf101 first, you can download it from here. Then you need to prepare the dataset following the instructions in yjxiong/temporal-segment-networks. The two most important things are frames and video lists.

Download this repository.

 git clone --recursive https://github.com/Ruiyang-061X/TSN.git

Using the frames and video lists, you can train the model. Excute the following command to train the model. The trained models are saved in trained_model/
```
 python3 train.py --dataset ucf101 --modality RGB --trainset YOUR_TRAIN_VIDEO_LIST --validationset YOUR_VALIDATION_VIDEO_LIST --base_model BNInception --n_segment 3 --consensus_type avg --dropout 0.8 --epoch 80 --batch_size 4 --lr 0.001 --lr_step 30 60 --clip_gradient 20
```
You can change --modality RGB to --modality RGBDiff or --modality Flow to train on the RGBDiff or optical flow version of the dataset.

Result

The following results are results on ucf101.

name	base_model	modality	accuracy@1	accuracy@5
BNInception_RGB	BNInception	RGB	69.05	84.40

The trained model can be downloaded from BaiduNetdisk, the code is 1rr9.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
tf_model_zoo @ 9788c67		tf_model_zoo @ 9788c67
.gitmodules		.gitmodules
README.md		README.md
dataset.py		dataset.py
train.py		train.py
transform.py		transform.py
tsn.py		tsn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TSN

Introduction

Dependency

How To Use

Result

About

Releases

Packages

Contributors 2

Languages

Ruiyang-061X/TSN

Folders and files

Latest commit

History

Repository files navigation

TSN

Introduction

Dependency

How To Use

Result

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages