Skip to content

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

License

Notifications You must be signed in to change notification settings

tboenig/17_fontmix_simple

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

17_fontmix_simple

Ground Truth with a collection of documents with the following characteristics: fonts blackletter and antiqua, ancient Greek, Hebrew, initials, with title page, colour chart

Metadata

Language:
grc, heb, deu
Format:
Page-XML
Time:
1600-1700
GT Type:
data_structure_and_text
License:
CC0 1.0
Transcription Guidelines:
OCR-D Ground Truth Guidelines https://ocr-d.de/en/gt-guidelines/trans/
Project:
OCR-D
Project-URL:
https://ocr-d.de/

Sources

The volume of transcriptions:

TextLine Page TxtRegion GraphRegion SepRegion
332 12 81 4 3

List of transcriptions

document TxtRegion ImgRegion LineDrawRegion GraphRegion TabRegion ChartRegion SepRegion MathRegion ChemRegion MusicRegion AdRegion NoiseRegion UnkownRegion CustomRegion TextLine Page
bohse_helicon_1696 35 3 2 121 5
weigel_gnothi02_1618 24 1 130 4
rollenhagen_reysen_1603 22 1 81 3

Extent

In this section they can insert additional information, instructions or notes.

About

This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published