ABL-HD

🌟 New! ABLkit released: A toolkit for Abductive Learning with high flexibility, easy-to-use interface, and optimized performance. Welcome to try it out!🚀

KESAR: Knowledge Enhanced Historical Document Sagmentation and Recognition

This is the code for KESAR, an Abductive Learning-based model training method for historical document segmentation and recognition.

Publication

"Knowledge-Enhanced Historical Document Segmentation and Recognition". En-Hao Gao, Yu-Xuan Huang, Wen-Chao Hu, Xin-Hao Zhu, Wang-Zhou Dai. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI’24), Vancouver, Canada, 2024, pp.8409-8416 (Oral).[paper]

Environment

conda create -n kesar python=3.7
conda activate kesar
pip install torch==1.11.0+cu113 torchvision==0.12.0+cu113 torchaudio==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu113
pip install onnx onnxruntime-gpu python-opencv scipy scikit-image shapely tqdm segmentation_models_pytorch

Inference

You need to change 3 paths before inference.

Segmentation model path, Line 19 ~ 22 in TestModel.py
Word model path, Line 30 ~ 31 in TestModel.py
Image path, Line 116 in TestModel.py

After changing these paths, you can run the following commond to conduct inference:

python TestModel.py

Results will be saved into the "outputs/" folder.

Citation

@inproceedings{KESAR2024Gao,
  author     = {Gao, En-Hao and Huang, Yu-Xuan and Hu, Wen-Chao and Zhu, Xin-Hao and Dai, Wang-Zhou},
  title      = {Knowledge-Enhanced Historical Document Segmentation and Recognition},
  booktitle  = {Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI'24)},
  pages      = {8409--8416},
  year       = {2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
images		images
outputs		outputs
weights		weights
README.md		README.md
TestModel.py		TestModel.py
craft.py		craft.py
craft_utils.py		craft_utils.py
dictionary.txt		dictionary.txt
imgproc.py		imgproc.py
inference_boxes.py		inference_boxes.py
match_column_and_character.py		match_column_and_character.py
ordering_utils.py		ordering_utils.py
resnet50.py		resnet50.py
word_inference.py		word_inference.py
word_nn.py		word_nn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ABL-HD

KESAR: Knowledge Enhanced Historical Document Sagmentation and Recognition

Publication

Environment

Inference

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ABL-HD

KESAR: Knowledge Enhanced Historical Document Sagmentation and Recognition

Publication

Environment

Inference

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages