Text_alignment_and_segmentation

This project was part of my Master's Thesis Project during spring 2023

The system takes an image of a handwritten page document as input and segments and aligns the image to a ground truth. In the case that a ground truth is not available, the algorithm allows for manual transcription of the segmentation. Where the segmentation fails to recognize text, it is possible to correct the boxes during the process. Bayesian optimisation is used for automatically setting reasonable parameters. The resulting report from the thesis project can be found at:

http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-506000

Prerequisites

To be able to run the pipeline in its entirety, see requirements.txt for required packages.

Usage

Clone this repository by using:

> git clone https://github.com/PhilipMacCormack/Text_alignment_and_segmentation

cd into Text_alignment_and_segmentation:
Install packages from requirements.txt
Open main.py
Input parameters in the script
Run main.py

> python main.py

Follow the procedure from the terminal until finish

Output

The output from the algorithm can be found in Text_alignment_and_segmentation/Results/{file}. The output consists of several saved images from the process as well as an xml file containing the final alignment of the document. Individual line, and word images from the segmentation can also be found, in results/{file}/lines.

References

This algorithm is partly based upon work from:

https://github.com/KadenMc/PreprocessingHTR

https://github.com/harshavkumar/word_segmentation

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Bayesian		Bayesian
Method_1		Method_1
Method_2		Method_2
results		results
.gitattributes		.gitattributes
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text_alignment_and_segmentation

Prerequisites

Usage

Output

References

About

Releases

Packages

Languages

PhilipMacCormack/Text_alignment_and_segmentation

Folders and files

Latest commit

History

Repository files navigation

Text_alignment_and_segmentation

Prerequisites

Usage

Output

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages