Skip to content

Model and predictions for results presented in the paper "Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model"

Notifications You must be signed in to change notification settings

omni-us/research-das2018-joint-text-entities-results

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repository includes the weights of the model and the corresponding predictions on the test set for one of the results presented in the paper Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model. The model and predictions correspond to the result in the last row of Table III in the paper.

The model weights are provided both as a Torch 7 file model.t7, which can be used with the original framework Laia used in the experiments, and also provided as a state dictionary model.pth that could be used in pytorch. The symbols.txt file lists the characters and semantic symbols for the model.

The test set prediction results are shared in page xml format. They include the recognized text, the predicted category and person for each word, and prediction confidences. The words also include bounding boxes, though these are only a crude approximation, which in general are not very accurate.

The original test set images and the xmls including the ground truth can be obtained from the website of the Information Extraction in Historical Handwritten Records challenge. After downloading the images, copy or symlink them to the results/ directory. Then you can use for example nw-page-editor to visualize the predictions. Also included are two css files to visualize the results highlighting with colors the different categories and persons. The different ways to visualize the results are the following:

nw-page-editor results/*.xml
nw-page-editor --css viz/nw-page-editor-category.css results/*.xml
nw-page-editor --css viz/nw-page-editor-person.css results/*.xml

Below are screenshots of an example visualization.

Full window visualization

Zoomed in word visualization

In the code/ directory you can a python scripts for converting the xmls into structured json files. This script requires the pagexml library to work.

If you use any of the files in this repository please include a link to this github project and cite the paper:

@inproceedings{Carbonell18_DAS,
  author    = {Manuel Carbonell and
               Mauricio Villegas and
               Alicia Forn\'es and
               Josep Llad\'os},
  title     = {Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model},
  booktitle = {13th {IAPR} Workshop on Document Analysis Systems, {DAS} 2018, Vienna,
               Austria, April 24-27, 2018},
  pages     = {399--404},
  year      = {2018},
  doi       = {10.1109/DAS.2018.52},
  isbn      = {978-1-5386-3346-5},
}

About

Model and predictions for results presented in the paper "Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model"

Resources

Stars

Watchers

Forks