Dynamic Frame Interpolation in Wavelet Domain

The official PyTorch implementation of WaveletVFI (TIP 2023).

Authors: Lingtong Kong, Boyuan Jiang, Donghao Luo, Wenqing Chu, Ying Tai, Chengjie Wang, Jie Yang

Abstract

Video frame interpolation is an important low-level vision task, which can increase frame rate for more fluent visual experience. Existing methods have achieved great success by employing advanced motion models and synthesis networks. However, the spatial redundancy when synthesizing the target frame has not been fully explored, that can result in lots of inefficient computation. On the other hand, the computation compression degree in frame interpolation is highly dependent on both texture distribution and scene motion, which demands to understand the spatial-temporal information of each input frame pair for a better compression degree selection. In this work, we propose a novel two-stage frame interpolation framework termed WaveletVFI to address above problems. It first estimates intermediate optical flow with a lightweight motion perception network, and then a wavelet synthesis network uses flow aligned context features to predict multi-scale wavelet coefficients with sparse convolution for efficient target frame reconstruction, where the sparse valid masks that control computation in each scale are determined by a crucial threshold ratio. Instead of setting a fixed value like previous methods, we find that embedding a classifier in the motion perception network to learn a dynamic threshold for each sample can achieve more computation reduction with almost no loss of accuracy. On the common high resolution and animation frame interpolation benchmarks, proposed WaveletVFI can reduce computation up to 40% while maintaining similar accuracy, making it perform more efficiently against other state-of-the-arts.

Framework

Overall framework of our WaveletVFI that can interpolate frames dynamically in wavelet domain.

Preparation

We have verified that this repository supports Python 3.6/3.7, PyTorch 1.9.1/1.10.1.
$ cd pytorch_wavelets && python setup.py install
$ pip install onnx imageio
Download training and test datasets: Vimeo90K
Set the right dataset path on your machine.

Evaluation

Download our pre-trained models in this link, and then put file checkpoints into the root dir.
Run the following scripts to evaluate on Vimeo90K test dataset.

$ python benchmark/Vimeo90K.py

Training

Stage1, pre-train WaveletVFI on Vimeo90K training dataset statically

$ python -m torch.distributed.launch --nproc_per_node=4 train_vimeo90k.py --world_size 4 --epochs 300 --batch_size 6 --lr_start 1e-4 --lr_end 1e-5

Stage2, load pre-trained WaveletVFI in Stage1 by uncommenting model.load_state_dict(...) in train_vimeo90k.py, set proper weighting coefficients in models/WaveletVFI.py, and then train WaveletVFI on Vimeo90K training dataset dynamically

$ python -m torch.distributed.launch --nproc_per_node=4 train_vimeo90k.py --world_size 4 --epochs 100 --batch_size 6 --lr_start 1e-4 --lr_end 1e-5 --dynamic 'true'

Visualization

Predicted target frame and multi-scale sparse valid masks on diverse datasets.

Citation

When using any parts of the Software or the Paper in your work, please cite the following paper:

@Article{Kong_2023_TIP,
  author={Kong, Lingtong and Jiang, Boyuan and Luo, Donghao and Chu, Wenqing and Tai, Ying and Wang, Chengjie and Yang, Jie},
  journal={IEEE Transactions on Image Processing}, 
  title={Dynamic Frame Interpolation in Wavelet Domain}, 
  year={2023},
  doi={10.1109/TIP.2023.3315151}
}

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
benchmark		benchmark
data		data
models		models
pytorch_wavelets		pytorch_wavelets
thop		thop
LICENSE		LICENSE
README.md		README.md
datasets.py		datasets.py
train_vimeo90k.py		train_vimeo90k.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmark

benchmark

data

data

models

models

pytorch_wavelets

pytorch_wavelets

thop

thop

LICENSE

LICENSE

README.md

README.md

datasets.py

datasets.py

train_vimeo90k.py

train_vimeo90k.py

Repository files navigation

Dynamic Frame Interpolation in Wavelet Domain

Abstract

Framework

Preparation

Evaluation

Training

Visualization

Citation

About

Releases

Packages

Languages

License

ltkong218/WaveletVFI

Folders and files

Latest commit

History

Repository files navigation

Dynamic Frame Interpolation in Wavelet Domain

Abstract

Framework

Preparation

Evaluation

Training

Visualization

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages