Skip to content

A Hierarchical Deep Learning Approach for In-loop Filtering in Video Coding

Notifications You must be signed in to change notification settings

IVC-Projects/cnn_In-loop_filter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A Hierarchical Deep Learning Approach for In-loop Filtering in Video Coding


The paper proposes a deep learning based systematic approach that includes an effective Convolutional Neural Network (CNN) structure, a hierarchical training strategy, and a video codec oriented switchable mechanism. In brief, the contributions of this work are as follows:

  • A novel network,named as Squeezeand-Excitation Filtering CNN (SEFCNN), is designed, which is comprised of two subnets: Feature EXtracting (FEX) net and Feature ENhancing (FEN) net. The FEX is a stack of convolutional layers characterizing the spatial and channel-wise correlation, while the FEN is a squeeze-andexcitation net that fully explores the relationship between channels.
  • A hierarchical model training strategy is developed. During the encoding, (a) different Quantization Parameters (QPs) cause different levels of artifacts; (b) different frame types employ different coding tools and thus exhibit different artifact proprieties. In contrast to prior researches that design a single powerful network for all kinds of artifacts, we propose to hierarchically deploy two subnets for different coding scenarios.
  • When incorporating CNN model into video encoder, we conduct an adaptive mechanism that switches between the CNN-based and the traditional methods to selectively enhance some frames or some regions of a frame. Compared to previous work that applies one model to every single frame, our approach takes advantage of coding reference structure and obtains the superiority in both encoder computational complexity and overall coding efficiency.

State-of-the-art solutions

State-of-the-art solutions

Network Structure

Network Structure

Hierarchical CNN Models for Different Frame Types

Hierarchical CNN Models for Different Frame Types fig 6

visual comparisons

visual comparisons

EXPERIMENTAL RESULTS

CODING PERFORMANCE COMPARED WITH VRCNN (BD-RATE) CODING PERFORMANCE OF THE PROPOSED SEFCNN COMPARED TO VRCNN AND RHCNN (QP = 37) CODING PERFORMANCE COMPARED WITH WORK [39] (BD-RATE COMPARED TO HM12.0)  PSNR histogram statistics of different network structures on common test sequences. CODING PERFORMANCE COMPARED WITH WORK [40] (BD-RATE COMPARED TO HM7.0) Visualization of the parameter quantity versus PSNR gain in various network deployments. AVERAGE ENCODING TIME COMPARED TO HM16.9 (SECOND/FRAME)

About

A Hierarchical Deep Learning Approach for In-loop Filtering in Video Coding

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published