OCR of transcriptions, batch import before crowd sourcing #29

martinantonmueller · 2019-07-01T12:52:13Z

I have 25.000 pages of newspaper clippings and 25.000 txt-files with an OCR of those clippings. It would be nice to be able to upload those transcriptions to the Scripto-field before crowd-sourcing. But I don't see how to address the relation uris between OMEKA and Scripto. Any idea?

jimsafley · 2019-07-03T00:49:09Z

I see no simple way to do this. One way is to write a script that iterates through the items and POSTs to admin/scripto/index/page-action using the following form data:

page_action: edit
page: transcription
item_id: [the item ID]
file_id: [the file ID]
wikitext: [OCR text]

martinantonmueller · 2019-07-03T07:42:24Z

Thank you! I'll check that out!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCR of transcriptions, batch import before crowd sourcing #29

OCR of transcriptions, batch import before crowd sourcing #29

martinantonmueller commented Jul 1, 2019

jimsafley commented Jul 3, 2019 •

edited

martinantonmueller commented Jul 3, 2019

OCR of transcriptions, batch import before crowd sourcing #29

OCR of transcriptions, batch import before crowd sourcing #29

Comments

martinantonmueller commented Jul 1, 2019

jimsafley commented Jul 3, 2019 • edited

martinantonmueller commented Jul 3, 2019

jimsafley commented Jul 3, 2019 •

edited