About OCR_aligned and Lost or missing text #11

USTCHJY · 2018-07-07T02:25:42Z

Hi,
I'm working on the OCR post-correction tasks and Ochre really helps me a lot. But I still have some questions looking forward to your reply.
When using the Ochre for OCR post-correction tasks,we only have the OCR_input . So how can I get OCR_aligned from OCR_input without gs? Otherwise,how to deal with the Lost or missing text without aligned text?
Thanks!

jvdzwaan · 2018-07-09T08:08:14Z

The task ochre performs is a supervised machine learning task. So, without gold standard, you can't create aligned data or train a (supervised) model.

USTCHJY · 2018-07-09T08:26:21Z

Sorry,maybe I expressed not clearly.
I mean after supervised training(for training data,we must have gold standard),how can I use this trained ochre model for actual OCR post-correction tasks? Because for actual tasks,we usually don't have gold standard and desire to get corrected text which similiar to the gold standard. On this occasion,how can I get OCR_aligned from the raw OCR_input of the actual tasks?
Thanks！

jvdzwaan · 2018-07-17T10:37:29Z

The README specifies how to use a trained model to do post correction: https://github.com/KBNLresearch/ochre#ocr-post-correction

If you want to calculate performance for this text, you'd still need to have ground truth/gold standard.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About OCR_aligned and Lost or missing text #11

About OCR_aligned and Lost or missing text #11

USTCHJY commented Jul 7, 2018

jvdzwaan commented Jul 9, 2018

USTCHJY commented Jul 9, 2018

jvdzwaan commented Jul 17, 2018

About OCR_aligned and Lost or missing text #11

About OCR_aligned and Lost or missing text #11

Comments

USTCHJY commented Jul 7, 2018

jvdzwaan commented Jul 9, 2018

USTCHJY commented Jul 9, 2018

jvdzwaan commented Jul 17, 2018