DIQA_CNN

PyTorch 0.4.1 implementation of the following paper: Le Kang, et al. "A DEEP LEARNING APPROACH TO DOCUMENT IMAGE QUALITY ASSESSMENT." 2014 ICIP.

The SOC dataset can be downloaded in DIQA: Document Image Quality Assesment Datasets

Note

Download the dataset and put all images in a directory and set this directory as root in 'config.yaml'

The ground truth for the dataset has been pre-processed and saved as a excel file SOC_gt.xlsx stored in ./data/gt_files/SOC_gt.xlsx

The ground truth file contains:

img_name: the image name
img_set: the index of reference image from which the current degraded image generated.
acc_f: OCR accuracy by ABBYY Finereader
acc_t: OCR accuracy by Tesseract
acc_o: OCR accuracy by Omnipage
acc_avg: average accuracy of the three OCR engines above

The creating details about this dataset:

A Dataset for Quality Assessment of Camera Captured Document Images

Training and validating

python main.py --batch_size=128 --epochs=500 --lr=0.001

Before training, the root in config.yaml must be specified.

demo_DIQA

python demo_DIQA.py

When a DIQA model has been trained, demo_DIQA.py can be used to predict the quality of a document image directly.

Before running demo_DIQA.py, the model_path and img_path must be specified.

Requirements

PyTorch 0.4.1
pytorch/ignite

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
checkpoints		checkpoints
data/gt_files		data/gt_files
models		models
DIQADataset.py		DIQADataset.py
DataInfoLoader.py		DataInfoLoader.py
Performance.py		Performance.py
README.md		README.md
config.yaml		config.yaml
demo_DIQA.py		demo_DIQA.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

checkpoints

checkpoints

data/gt_files

data/gt_files

models

models

DIQADataset.py

DIQADataset.py

DataInfoLoader.py

DataInfoLoader.py

Performance.py

Performance.py

README.md

README.md

config.yaml

config.yaml

demo_DIQA.py

demo_DIQA.py

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

DIQA_CNN

Note

Training and validating

demo_DIQA

Requirements

About

Releases

Packages

Contributors 2

Languages

rjchern/DIQA_CNN

Folders and files

Latest commit

History

Repository files navigation

DIQA_CNN

Note

Training and validating

demo_DIQA

Requirements

About

Resources

Stars

Watchers

Forks

Languages