Advancing Learned Video Compression with In-loop Frame Prediction

Our other works on learned video compression:

Perceptual Learned Video Compression (PLVC) (IJCAI 2022) [Paper] [Codes]
Hierarchical Learned Video Compression (HLVC) (CVPR 2020) [Paper] [Codes]
Recurrent Learned Video Compression (RLVC) (IEEE J-STSP 2021) [Paper] [Codes]
OpenDVC: An open source implementation of DVC [Codes] [Technical report]

Advancing Learned Video Compression with In-loop Frame Prediction

The project page for the paper:

Ren Yang, Radu Timofte and Luc Van Gool, "Advancing Learned Video Compression with In-loop Frame Prediction", IEEE Transactions on Circuits and Systems for Video Technology (IEEE T-CSVT), 2022. [Paper]

If our paper and codes are useful for your research, please cite:

@article{yang2022advancing,
  title={Advancing Learned Video Compression with In-loop Frame Prediction},
  author={Yang, Ren and Timofte, Radu and Van Gool, Luc},
  journal={IEEE Transactions on Circuits and Systems for Video Technology},
  year={2022},
  publisher={IEEE}
}

If you have questions or find bugs, please contact:

Ren Yang @ ETH Zurich, Switzerland

Email: r.yangchn@gmail.com

Codes

Preperation

We feed RGB images into the our encoder. To compress a YUV video, please first convert to PNG images with the following command.

ffmpeg -pix_fmt yuv420p -s WidthxHeight -i Name.yuv -vframes Frame path_to_PNG/f%03d.png

Note that, our RLVC codes currently only support the frames with the height and width as the multiples of 16. Therefore, when using these codes, if the height and width of frames are not the multiples of 16, please first crop frames, e.g.,

ffmpeg -pix_fmt yuv420p -s 1920x1080 -i Name.yuv -vframes Frame -filter:v "crop=1920:1072:0:0" path_to_PNG/f%03d.png

We uploaded a prepared sequence BasketballPass here as a test demo, which contains the PNG files of the first 100 frames.

Dependency

Tensorflow 1.12

(Since we train the models on tensorflow-compression 1.0, which is only compatibable with tf 1.12, the pre-trained models are not compatible with higher versions.)
Tensorflow-compression 1.0 (Download link)

(After downloading, put the folder "tensorflow_compression" to the same directory as the codes.)
SciPy 1.2.0

(Since we use misc.imread, do not use higher versions in which misc.imread is removed.)
Pre-trained models (Download link)

(Download the folder "model" to the same directory as the codes.)
VTM (Download link)

(In our PSNR model, we use VVC to compress I-frames. Please compile VTM and put the folder "VVCSoftware_VTM" in the same directory as the codes.)

Test code

The augments in the ALVC test code (ALVC.py) include:

--path, the path to PNG files;

--l, lambda value. The pre-trained PSNR models are trained by 4 lambda values, i.e., 256, 512, 1024 and 2048, with increasing bit-rate/PSNR.

For example:

python ALVC.py --path BasketballPass

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
BasketballPass		BasketballPass
.gitignore		.gitignore
ALVC.py		ALVC.py
CNN_img.py		CNN_img.py
CNN_recurrent.py		CNN_recurrent.py
Extrapolation.py		Extrapolation.py
Interpolation_Compression.py		Interpolation_Compression.py
MC_network_inter.py		MC_network_inter.py
README.md		README.md
Recurrent_AutoEncoder_Extrapolation.py		Recurrent_AutoEncoder_Extrapolation.py
Recurrent_Prob_Model.py		Recurrent_Prob_Model.py
arithmeticcoding.py		arithmeticcoding.py
func.py		func.py
functions.py		functions.py
functions_inter.py		functions_inter.py
helper.py		helper.py
helper2.py		helper2.py
inv_flow.py		inv_flow.py
mc_func.py		mc_func.py
motion.py		motion.py
ms_ssim_np.py		ms_ssim_np.py
rec_exp.py		rec_exp.py
sepconv_inter.py		sepconv_inter.py
sepconv_inter_enc.py		sepconv_inter_enc.py

RenYang-home/ALVC

Folders and files

Latest commit

History

Repository files navigation

Advancing Learned Video Compression with In-loop Frame Prediction

Codes

Preperation

Dependency

Test code

About

Resources

Stars

Watchers

Forks

Languages