❓ What is this

This repository includes a workflow that addresses common problems digital libraries encounter when archiving images of corpora. The common problem types are:

Rotation correction in multiples of 90 degrees.
Cropping the images so that a small fraction, or none, of the background remains.
Safely splitting images into two, in case the image set contains two-pagers.

The code is not wrapped in a UI yet, but the functionalities are all present (of course, future updates will be expected), even including QA and evaluation methods. This workflow uses ⚙️methods⚙️ such as

Pre-processing using the EAST AI model,
CNN categorization,
Radon transform,
Fourier transform,
adaptive binarization,
opening & closing,
edge detection.

📄 The Image correction workflow

The image below illustrates the five-step workflow used in this project, along with their corresponding visual representations at each stage.

Each Python script in this repository is named according to the step it implements, making it easy to follow the full pipeline from start to finish.

Each script handles a distinct part of the image processing pipeline, from data loading and preprocessing to rotation correction and result export.

🤔 Why not merge all Python scripts into one automatic script?

Some steps make mistakes, so human intervention, like QA steps, needs to be done periodically. This is why step 4 has two branches, as shown in the picture below. One of them is used to split the image, while the other one is used to merge the images that have been processed incorrectly.

❗ More about splitting pages

The image below shows two types of processing scripts for splitting and merging images, respectively.

"Automatic" scripts are configured in a way so that the user only needs to drag the image in question into the processing folder, and the script will handle them one by one. You may want to use this type of script because the computer can process the image concurrently with the user when the user is selecting images to be processed. Yes, selecting the two- paged images are manual, and I haven't found a perfect automatic way to identify them.

"Manual" scripts are to be used when you already know or have constructed a folder with two-page images only. Read the script comments for more information.

You will also see a script named "4 add_left_or_right_margin.py". This is used to add the left vertical portion of the right image to the left image and vice versa. You may want to use this whenever the texts are too close to the central crease, or when the splitting lines are too close to the texts. After applying it, the reader can be more confident that the cut did not accidentally split texts on the pages.

What do you do after identifying and merging the poorly cut pages? This online batch cropping tool is extremely fast and useful for this type of manual workflow. https://www.imgtools.co/crop-image Remember to duplicate the images before uploading and splitting them on this website so that you can cut the left and right versions of the page at once.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
1_gather_images.py		1_gather_images.py
2.1_CNN_4_categories_ver2.py		2.1_CNN_4_categories_ver2.py
2.1_CNN_4_validate.py		2.1_CNN_4_validate.py
2.1_CNN_finetune.py		2.1_CNN_finetune.py
2.1_retrieve_h5_param.py		2.1_retrieve_h5_param.py
2.1_sample_selection2.py		2.1_sample_selection2.py
2.1_slice_text2.py		2.1_slice_text2.py
2.2_300_rever_engineer_7patienceGray.h5		2.2_300_rever_engineer_7patienceGray.h5
2.2_method1_radonOnly.py		2.2_method1_radonOnly.py
2.2_method3_CNN_rotation.py		2.2_method3_CNN_rotation.py
2.2_radonMethod_successRate_test.py		2.2_radonMethod_successRate_test.py
3_cropping.py		3_cropping.py
4_add_left_or_right_margin.py		4_add_left_or_right_margin.py
4_automatic_merge.py		4_automatic_merge.py
4_automatic_rough_split.py		4_automatic_rough_split.py
4_automatic_split.py		4_automatic_split.py
4_merge_given_folder.py		4_merge_given_folder.py
4_split_given_folder.py		4_split_given_folder.py
5_folder_tree_reconstruction.py		5_folder_tree_reconstruction.py
LICENSE		LICENSE
README.md		README.md
experimental or visualizations.zip		experimental or visualizations.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

❓ What is this

📄 The Image correction workflow

🤔 Why not merge all Python scripts into one automatic script?

❗ More about splitting pages

📂 File Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

❓ What is this

📄 The Image correction workflow

🤔 Why not merge all Python scripts into one automatic script?

❗ More about splitting pages

📂 File Structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages