TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

Code and datasets of paper TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting.

Abstract: Image inpainting for completing complicated semantic environments and diverse hole patterns of corrupted images is challenging even for state-of-the-art learning-based inpainting methods trained on large-scale data. A reference image capturing the same scene of a corrupted image offers informative guidance for completing the corrupted image as it shares similar texture and structure priors to that of the holes of the corrupted image. In this work, we propose a transformer-based encoder-decoder network, named TransRef, for reference-guided image inpainting. Specifically, the guidance is conducted progressively through a reference embedding procedure, in which the referencing features are subsequently aligned and fused with the features of the corrupted image. For precise utilization of the reference features for guidance, a reference-patch alignment (Ref-PA) module is proposed to align the patch features of the reference and corrupted images and harmonize their style differences, while a reference-patch transformer (Ref-PT) module is proposed to refine the embedded reference feature. Moreover, to facilitate the research of reference-guided image restoration tasks, we construct a publicly accessible benchmark dataset containing 50K pairs of input and reference images. Both quantitative and qualitative evaluations demonstrate the efficacy of the reference information and the proposed method over the state-of-the-art methods in completing complex holes.

Usage Instructions

Environment

Please install Anaconda, Pytorch. For other libs, please refer to the file requirements.txt.

conda create -n TransRef python=3.8
conda activate TransRef
git clone https://github.com/Cameltr/TransRef.git
pip install -r requirements.txt

Datasets

For reference-guided image inpainting, the similarity between the image and its reference image is of great significance to the inpainting results. However, to the best of our knowledge, there is no such publicly available dataset for this new task. In this work, expended from our previously proposed DPED10K dataset in RGTSI, we construct a new dataset, namely DPED50K, based on the DPED dataset , which consists of real-world photos captured by three different mobile phones and one high-end reflex camera.

Please download DPED50K dataset from Baidu Netdisk(password: pxl2), or Google Cloud
TransRef is trained and tested on the irregular mask dataset provided by Liu et al. You can download publically available Irregular Mask Dataset from their website.
Create a folder and unzip the dataset into it, then edit the path of the folder in options/base_options.py

Pre-trained weight and test model

I will release our model after the paper accepted.

Training and Testing

# To train on the your dataset, for example.
python train.py  --de_root=[the path of ground truth images] --mask_root=[the path of mask images] -ref_root=[the path of reference images]

There are many options you can specify. Please use python train.py --help or see the options

To log training, use --./logs for Tensorboard. The logs are stored at logs/[name].

# To test on the your dataset, for example.
python test.py

Please edit the the path of test images in test.py when testing on your dataset.

Citation

If you find our code or datasets helpful for your research, please cite：

@article{TransRef,
    title={TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting}, 
    author={Liang Liao and Taorong Liu and Delin Chen and Jing Xiao and Zheng Wang and Chia-Wen Lin and Satoh, Shin’Ichi},
    year={2023},
    journal={arXiv preprint arXiv:2306.11528},
}

@inproceedings{RGTSI,
    title={Reference-guided texture and structure inference for image inpainting},
    author={Liu, Taorong and Liao, Liang and Wang, Zheng and Satoh, Shin’Ichi},
    booktitle={2022 IEEE International Conference on Image Processing (ICIP)},
    pages={1996--2000},
    year={2022},
    organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
data		data
imgs		imgs
models		models
options		options
util		util
README.md		README.md
env.yaml		env.yaml
parameter.py		parameter.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

imgs

imgs

models

models

options

options

util

util

README.md

README.md

env.yaml

env.yaml

parameter.py

parameter.py

requirements.txt

requirements.txt

test.py

test.py

train.py

train.py

Repository files navigation

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

Usage Instructions

Environment

Datasets

Pre-trained weight and test model

Training and Testing

Citation

About

Releases

Packages

Languages

Cameltr/TransRef

Folders and files

Latest commit

History

Repository files navigation

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting

Usage Instructions

Environment

Datasets

Pre-trained weight and test model

Training and Testing

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages