WACV2024-DocReal

The source code for the WACV'2024 paper titled "DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction".

Introduction

This Github repository contains the first Chinese distorted image benchmark, i.e., DocReal, designed for a wide range of Chinese real-life document image scenarios.

3 Scenarios: DocReal includes 10 subscenarios across 3 significant scenarios: work, study, and daily life. The work scenario includes tables, contracts, and notices, while the study scenario includes papers, books, tests, and notes. The daily life scenario includes receipts, certificates, and newspapers. All these sub-scenarios cover the vast majority of usage scenarios in Chinese people’s daily lives..
5 Deformations: Each sub-scenario contains five different contents, and for each content, there are four distorted images with varying types of deformation, shooting angles, and shooting distances. The deformation types include curled, perspective, skew, folded, and flat, commonly encountered in real-life document images.

In total, DocReal contains 200 images within five classes, each representing a different deformation type to facilitate qualitative comparisons, providing a comprehensive and diverse dataset for researchers in the field of document image dewarping.

!!! You can download DocReal through the following link.

Google Drive: https://drive.google.com/file/d/1fxdUMMCQoTxc-THv1LmO-Ye8XjMyPK--/view?usp=sharing

Method

Performance Evaluation

Our method achieves the SOTA performance in terms of image similarity and OCR performance.

Our method effectively removes background and improves text readability, while other methods face challenges with residual background and reduced textability.

Rectified Results

This work was done during Fangchen Yu’s internship at vivo AI Lab. Due to commercial restrictions, we are unable to provide the source code currently. Instead, we will provide a demo API in the future, allowing for further exploration and optimization by the research community.

You can obtain our rectified results for DocUNet and DocReal through the following link.

Google Drive: https://drive.google.com/file/d/1i0LMai40OBl-tit92KKFNg9CnXXKZv6d/view?usp=sharing

Citation

If you find this code useful for your research, please use the following BibTeX entry.

@inproceedings{yu2024docreal,
  title={DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction},
  author={Yu, Fangchen and Xie, Yina and Wu, Lei and Wen, Yafei and Wang, Guozhi and Ren, Shuai and Chen, Xiaoxin and Mao, Jianfeng and Li, Wenye},
  booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision},
  pages={665--674},
  year={2024}
}

Contact

If you have any problems or questions, please contact the author: Fangchen Yu (email: fangchenyu@link.cuhk.edu.cn)

Name	Name	Last commit message	Last commit date
Latest commit CUHKSZ-Yu update Dec 26, 2023 3e994cf · Dec 26, 2023 History 12 Commits
Fig	Fig	update	Dec 26, 2023
LICENSE	LICENSE	Initial commit	Jun 26, 2023
README.md	README.md	update	Dec 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WACV2024-DocReal

Introduction

Method

Performance Evaluation

Rectified Results

Citation

Contact

About

Releases

Packages

License

SciYu/DocReal

Folders and files

Latest commit

History

Repository files navigation

WACV2024-DocReal

Introduction

Method

Performance Evaluation

Rectified Results

Citation

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages