Structure Motion based Iterative Feature Fusion (SMIFF) for Video frame interpolation

This software project accompanies the research paper, Video Frame Interpolation via Structure Motion based Iterative Feature Fusion

This work proposes an end-to-end structure-motion based iterative fusion method for video frame interpolation.

Documentation

In this project, we propose a video frame interpolation method via structure-motion based iterative fusion, which aims to provide results with a clear and reasonable appearance. To achieve this goal, a two-stage framework is established. Given two adjacent frames, we encode images by structure and motion based learning branches respectively in the first stage. Then, the temporal information alignment unit and spatial feature based rectifier unit is introduced in the second stage, which achieves further enhancement based on adjacent frames and hierarchical context. Here, iterative learning structure is utilized to integrate spatial and temporal feature based optimization, and hence to generate video results with higher quality. To learn more about this work, please refer link.

The Code structure of this project is listed below.

# Datas/
  # vimeo_90k_interpolation.py # dataloader of vimeo-90k

# Models/
  # SmiffNet.py # network building of the proposed SMIFF

# my_package/ # c++/cu functions for projection calculation

# Utils/ # other functions
  # average_meter.py: evaluation function
  # loss_functions.py: loss functions
  # lr_scheduler.py: training scheduler

# inference.py: inference pipeline

# train.py: training pipeline

Getting Start

Requirements:

pytorch: version==1.2.0
mmdetection: version==1.1

Installation:

clone this repo:

git clone git@github.pie.apple.com:weston-li/SMIFF.git

build the dependencies:

cd my_package/
sh build.sh
cd ../

cd Models/correlation_package_pytorch1_0/
sh build.sh
cd ../

Usage:

Training:

Prepare your training dataset. Vimeo-90k triplet set is widely use, you can download it from http://toflow.csail.mit.edu/index.html#triplet
Set your path to dataset and the txt files for "train_list" and "test_list" in "train.sh"
Download the pretrained_weight from "https://www.icloud.com.cn/iclouddrive/0aaPyenXEametIepzSqXARpPQ#SMIFF_weights" into "Weights/", and set the weight path also in "train.sh".
run the training precess by

sh train.sh

Inference:

Prepare your inference dataset. A video shoud be split to several frames, and you should put each pair of adjacent frame into a folder. The dataset should be list as:

video/
  1/
    frame_00.png
    frame_02.png
  ...
  n/
    frame_00.png
    frame_02.png

Set your path to dataset in "inference.sh"
Download the trained_weight from "https://www.icloud.com.cn/iclouddrive/0aaPyenXEametIepzSqXARpPQ#SMIFF_weights" into "Weights/", and set the weight path also in "inference.sh".
run the inference precess by

sh inference.sh

Tips:

This work is mainly built on DAIN [1] and FeatureFlow [2]. If you want to follow this work [3], please cite the following papers:

[1] Bao W, Lai W S, Ma C, et al. Depth-aware video frame interpolation[C]//CVPR 2019.

[2] Gui S, Wang C, Chen Q, et al. Featureflow: Robust video interpolation via structure-to-texture generation[C]//CVPR 2020.

[3] Li X, Cao M, Tang Y, et al. 13‐3: Invited Paper: Video Frame Interpolation via Structure Motion based Iterative Feature Fusion[C]//SID 2021.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Structure Motion based Iterative Feature Fusion (SMIFF) for Video frame interpolation

Documentation

Getting Start

Requirements:

Installation:

Usage:

Training:

Inference:

Tips:

Files

README.md

Latest commit

History

README.md

File metadata and controls

Structure Motion based Iterative Feature Fusion (SMIFF) for Video frame interpolation

Documentation

Getting Start

Requirements:

Installation:

Usage:

Training:

Inference:

Tips: