SCS Framework

Source Code

Usage

Dependencies:
- Python 2.7
- Pytorch >= 0.4
- torchvision
- Numpy
- Pillow
- tqdm
Download Kinetics-400 from the official website or from the copy of facebookresearch/video-nonlocal-net, and organize the image files (from the videos) the same as UCF101 and HMDB:
```
Dataset
├── train_frames
│   ├── action0
│   │   ├── video0
|   |   |   ├── frame0
├── test_frames
```
Extract optical flow of the original RGB frames. Note that the stride between the two RGB frames used to extract optical flow need to be the same with the original inputs. The optical_flow only need two channel (h and v), but we still save it as jpg padding the third channel to 0. Store the optical flows in train_ofs:
```
 Dataset
├── train_frames
│   ├── action0
│   │   ├── video0
|   |   |   ├── frame0
├── train_ofs
│   ├── action0
│   │   ├── video0
|   |   |   ├── frame0
├── test_frames
```

In this standalone model, we only commit the action recognition task:

a. Run the following command to train.

# start from scratch
python main.py --train 

# start from our pre-trained model
python main.py --model_path [path_to_model] --model_name [model's name] --resume --train

b. Run the following command to test.

python main.py --test

Action recognition results on standalone RNN models:

Architecture	Kinetics	UCF-101	HMDB-51
Shallow LSTM with Backbone	53.9	86.8	49.7
C3D	56.1	79.9	49.4
Two-Stream	62.8	93.8	64.3
3D-Fused	62.3	91.5	66.5
Deep RBM without Backbone	60.2	91.9	61.7

Demo

Dependencies:
- Python 3.5
- Pytorch >= 1.1.0
- torchvision
- Numpy
- Pillow
- tqdm
- PyQt5
Usage
1. Download the pre-trained model from Google Drive and put it into Demo/Code/
2. Run the demo by:
```
python main.py
```
1. After get the main window :
  1. Click "Select Image":
  2. Click "Choose Object" and drag out a bbox for the target
  3. Click "Annotate" and wait for a moment （5s on i7 CPU）:
    
    (Because we automatically assign the left-bottom corner as the start point, the result may be not so good in some specific scenes.)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Demo		Demo
imgs		imgs
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Demo

Demo

imgs

imgs

src

src

LICENSE

LICENSE

README.md

README.md

Repository files navigation

SCS Framework

Source Code

Usage

Demo

About

Releases 1

Packages

Languages

License

BoPang1996/Semi-Coupled-Structure-for-visual-sequental-tasks

Folders and files

Latest commit

History

Repository files navigation

SCS Framework

Source Code

Usage

Demo

About

Resources

License

Stars

Watchers

Forks

Languages