Skip to content

Aitical/CFSR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach

Gang Wu, Junjun Jiang, Junpeng Jiang, and Xianming Liu

AIIA Lab, Harbin Institute of Technology.


paper | results | pretrained models

Hits

This repository is the official PyTorch implementation of "Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach"

Recent progress in single-image super-resolution (SISR) has achieved remarkable performance, yet the computational costs of these methods remain a challenge for deployment on resource-constrained devices. Especially for transformer-based methods, the self-attention mechanism in such models brings great breakthroughs while incurring substantial computational costs. To tackle this issue, we introduce the Convolutional Transformer layer (ConvFormer) and the ConvFormer-based Super-Resolution network (CFSR), which offer an effective and efficient solution for lightweight image super-resolution tasks. In detail, CFSR leverages the large kernel convolution as the feature mixer to replace the self-attention module, efficiently modeling long-range dependencies and extensive receptive fields with a slight computational cost. Furthermore, we propose an edge-preserving feed-forward network, simplified as EFN, to obtain local feature aggregation and simultaneously preserve more high-frequency information. Extensive experiments demonstrate that CFSR can achieve an advanced trade-off between computational cost and performance when compared to existing lightweight SR methods. Compared to state-of-the-art methods, e.g. ShuffleMixer, the proposed CFSR achieves \textit{0.39 dB} gains on Urban100 dataset for $\times2$ SR task while containing\textit{ 26% }and \textit{31%} fewer parameters and FLOPs, respectively.

Results

Results of x2, x3, and x4 SR tasks are available at Google Drive

Method Scale Params FLOPs Set5 (PSNR/SSIM) Set14 (PSNR/SSIM) B100 (PSNR/SSIM) Urban100 (PSNR/SSIM) Manga109 (PSNR/SSIM)
VDSR $666 \mathrm{~K}$ $612.6 \mathrm{G}$ $31.35 / 0.8838$ $28.01 / 0.7674$ $27.29 / 0.7251$ $25.18 / 0.7524$ $28.83 / 0.8870$
LapSRN $813 \mathrm{~K}$ $149.4 \mathrm{G}$ $31.54 / 0.8852$ $28.09 / 0.7700$ $27.32 / 0.7275$ $25.21 / 0.7562$ $29.09 / 0.8900$
IDN $553 \mathrm{~K}$ $32.3 \mathrm{G}$ $31.82 / 0.8903$ $28.25 / 0.7730$ $27.41 / 0.7297$ $25.41 / 0.7632$ $29.41 / 0.8942$
CARN $592 \mathrm{~K}$ $90.9 \mathrm{G}$ $32.13 / 0.8937$ $28.60 / 0.7806$ $27.58 / 0.7349$ $26.07 / 0.7837$ $30.47 / 0.9084$
SRResNet $1,518 \mathrm{~K}$ $146 \mathrm{G}$ $32.17 / 0.8951$ $28.61 / 0.7823$ $27.59 / 0.7365$ $26.12 / 0.7871$ $30.48 / 0.9087$
IMDN $715 \mathrm{~K}$ $40.9 \mathrm{G}$ $32.21 / 0.8948$ $28.58 / 0.7811$ $27.56 / 0.7353$ $26.04 / 0.7838$ $30.45 / 0.9075$
LatticeNet $777 \mathrm{~K}$ $43.6 \mathrm{G}$ $32.18 / 0.8943$ $28.61 / 0.7812$ $27.57 / 0.7355$ $26.14 / 0.7844$ $-/-$
LAPAR-A $\times 4$ $659 \mathrm{~K}$ $94.0 \mathrm{G}$ $32.15 / 0.8944$ $28.61 / 0.7818$ $27.61 / 0.7366$ $26.14 / 0.7871$ $30.42 / 0.9074$
SMSR $1006 \mathrm{~K}$ $41.6 \mathrm{G}$ $32.12 / 0.8932$ $28.55 / 0.7808$ $27.55 / 0.7351$ $26.11 / 0.7868$ $30.54 / 0.9085$
ECBSR $603 \mathrm{~K}$ $34.7 \mathrm{G}$ $31.92 / 0.8946$ $28.34 / 0.7817$ $27.48 / 0.7393$ $25.81 / 0.7773$ $-/-$
PAN $272 \mathrm{~K}$ $28.2 \mathrm{G}$ $32.13 / 0.8948$ $28.61 / 0.7822$ $27.59 / 0.7363$ $26.11 / 0.7854$ $30.51 / 0.9095$
DRSAN $410 \mathrm{~K}$ $30.5 \mathrm{G}$ $32.15 / 0.8935$ $28.54 / 0.7813$ $27.54 / 0.7364$ $26.06 / 0.7858$ $-/-$
DDistill-SR $434 \mathrm{~K}$ $33.0 \mathrm{G}$ $32.23 / 0.8960$ $28.62 / 0.7823$ $27.58 / 0.7365$ $26.20 / 0.7891$ $30.48 / 0.9090$
RFDN $550 \mathrm{~K}$ $23.9 \mathrm{G}$ $32.24 / 0.8952$ $28.61 / 0.7819$ $27.57 / 0.7360$ $26.11 / 0.7858$ $30.58 / 0.9089$
ShuffleMixer $411 K$ $28.0 \mathrm{G}$ $32.21 / 0.8953$ $28.66 / 0.7827$ $27.61 / 0.7366$ $26.08 / 0.7835$ $30.65 / 0.9093$
CFSR (Ours) $307 \mathrm{~K}$ $17.5 \mathrm{G}$ $32.33/0.8964$ $28.73 / 0.7842$ $27.63 / 0.7381$ $26.21/0.7897$ $30.72 / 0.9111$

TODO

  • Add implementation code
  • Add pretrained model

About

Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published