Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add new gt metadata yml files #143

Open
wants to merge 9 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
50 changes: 50 additions & 0 deletions catalog/ocr-d/gt_structure_1_1_METADATA_htr_united.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
schema: https://htr-united.github.io/schema/2023-06-27/schema.json
title: gt_structure_1_1
url: https://github.com/OCR-D/gt_structure_1_1
authors:
- name: Matthias
surname: Boenig
orcid: 0000-0003-4615-4753
roles:
- transcriber
- aligner
- project-manager
- quality-control
- digitization
- support
institutions: []
description: >-
The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D. Corrections and extensions can be reported, please use the Issues.
project-name: OCR-D
project-website: https://ocr-d.de/
language:
- deu
production-software: Aletheia
automatically-aligned: false
script:
- iso: Latn
- iso: Latf
script-type:
only-typed
time:
notAfter: '1920'
notBefore: '1600'
hands:
count: unknown
precision: exact
license:
name: CC0-1.0
url: https://creativecommons.org/licenses/zero/1.0/
format: Page-XML
volume:
- count: 0
metric: characters
- count: 1317
metric: files
- count: 0
metric: lines
- count: 9457
metric: regions
citation-file-link: https://github.com/OCR-D/gt_structure_1_1/blob/main/CITATION.cff
transcription-guidelines: >-
OCR-D-GT-Guideline, Part: Structure Ground Truth https://ocr-d.de/en/gt-guidelines/trans/structur_gt.html
50 changes: 50 additions & 0 deletions catalog/ocr-d/gt_structure_1_2_METADATA_htr_united.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
schema: https://htr-united.github.io/schema/2023-06-27/schema.json
title: gt_structure_1_2
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
title: gt_structure_1_2
title: OCR-D gt_structure_1_2

url: https://github.com/OCR-D/gt_structure_1_2
authors:
- name: Matthias
surname: Boenig
orcid: 0000-0003-4615-4753
roles:
- transcriber
- aligner
- project-manager
- quality-control
- digitization
- support
institutions: []
description: >-
The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D. Corrections and extensions can be reported, please use the Issues.
project-name: OCR-D
project-website: https://ocr-d.de/
language:
- deu
production-software: Aletheia
automatically-aligned: false
script:
- iso: Latn
- iso: Goth
script-type:
only-typed
time:
notAfter: '1900'
notBefore: '1600'
hands:
count: unknown
precision: exact
license:
name: CC0-1.0
url: https://creativecommons.org/licenses/zero/1.0/
format: Page-XML
volume:
- count: 0
metric: characters
- count: 1336
metric: files
- count: 0
metric: lines
- count: 7224
metric: regions
citation-file-link: https://github.com/OCR-D/gt_structure_1_2/blob/main/CITATION.cff
transcription-guidelines: >-
OCR-D-GT-Guideline, Part: Structure Ground Truth https://ocr-d.de/en/gt-guidelines/trans/structur_gt.html
50 changes: 50 additions & 0 deletions catalog/ocr-d/gt_structure_1_3_METADATA_htr_united.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
schema: https://htr-united.github.io/schema/2023-06-27/schema.json
title: gt_structure_1_3
url: https://github.com/OCR-D/gt_structure_1_3
authors:
- name: Matthias
surname: Boenig
orcid: 0000-0003-4615-4753
roles:
- transcriber
- aligner
- project-manager
- quality-control
- digitization
- support
institutions: []
description: >-
The repo gt_structure_1_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D. Corrections and extensions can be reported, please use the Issues.
project-name: OCR-D
project-website: https://ocr-d.de/
language:
- deu
production-software: Aletheia
automatically-aligned: false
script:
- iso: Latn
- iso: Latf
script-type:
only-typed
time:
notAfter: '1900'
notBefore: '1600'
hands:
count: unknown
precision: exact
license:
name: CC0-1.0
url: https://creativecommons.org/licenses/zero/1.0/
format: Page-XML
volume:
- count: 0
metric: characters
- count: 1315
metric: files
- count: 0
metric: lines
- count: 8996
metric: regions
citation-file-link: https://github.com/OCR-D/gt_structure_1_3/blob/main/CITATION.cff
transcription-guidelines: >-
OCR-D-GT-Guideline, Part: Structure Ground Truth https://ocr-d.de/en/gt-guidelines/trans/structur_gt.html
50 changes: 50 additions & 0 deletions catalog/ocr-d/gt_structure_1_4_METADATA_htr_united.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
schema: https://htr-united.github.io/schema/2023-06-27/schema.json
title: gt_structure_1_4
url: https://github.com/OCR-D/gt_structure_1_4
authors:
- name: Matthias
surname: Boenig
orcid: 0000-0003-4615-4753
roles:
- transcriber
- aligner
- project-manager
- quality-control
- digitization
- support
institutions: []
description: >-
The repo gt_structure_1_4 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D. Corrections and extensions can be reported, please use the Issues.
project-name: OCR-D
project-website: https://ocr-d.de/
language:
- deu
production-software: Aletheia
automatically-aligned: false
script:
- iso: Latn
- iso: Latf
script-type:
only-typed
time:
notAfter: '1900'
notBefore: '1600'
hands:
count: unknown
precision: exact
license:
name: CC0-1.0
url: https://creativecommons.org/licenses/zero/1.0/
format: Page-XML
volume:
- count: 0
metric: characters
- count: 1256
metric: files
- count: 0
metric: lines
- count: 5675
metric: regions
citation-file-link: https://github.com/OCR-D/gt_structure_1_4/blob/main/CITATION.cff
transcription-guidelines: >-
OCR-D-GT-Guideline, Part: Structure Ground Truth https://ocr-d.de/en/gt-guidelines/trans/structur_gt.html
50 changes: 50 additions & 0 deletions catalog/ocr-d/gt_structure_2_1_METADATA_htr_united.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
schema: https://htr-united.github.io/schema/2023-06-27/schema.json
title: gt_structure_2_1
url: https://github.com/OCR-D/gt_structure_2_1
authors:
- name: Matthias
surname: Boenig
orcid: 0000-0003-4615-4753
roles:
- transcriber
- aligner
- project-manager
- quality-control
- digitization
- support
institutions: []
description: >-
The repo gt_structure_2_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D. Corrections and extensions can be reported, please use the Issues.
project-name: OCR-D
project-website: https://ocr-d.de/
language:
- deu
production-software: Aletheia
automatically-aligned: false
script:
- iso: Latn
- iso: Latf
script-type:
only-typed
time:
notAfter: '1900'
notBefore: '1600'
hands:
count: unknown
precision: exact
license:
name: CC0-1.0
url: https://creativecommons.org/licenses/zero/1.0/
format: Page-XML
volume:
- count: 0
metric: characters
- count: 1614
metric: files
- count: 0
metric: lines
- count: 7642
metric: regions
citation-file-link: https://github.com/OCR-D/gt_structure_2_1/blob/main/CITATION.cff
transcription-guidelines: >-
OCR-D-GT-Guideline, Part: Structure Ground Truth https://ocr-d.de/en/gt-guidelines/trans/structur_gt.html
50 changes: 50 additions & 0 deletions catalog/ocr-d/gt_structure_2_2_METADATA_htr_united.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
schema: https://htr-united.github.io/schema/2023-06-27/schema.json
title: gt_structure_2_2
url: https://github.com/OCR-D/gt_structure_2_2
authors:
- name: Matthias
surname: Boenig
orcid: 0000-0003-4615-4753
roles:
- transcriber
- aligner
- project-manager
- quality-control
- digitization
- support
institutions: []
description: >-
The repo gt_structure_2_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D. Corrections and extensions can be reported, please use the Issues.
project-name: OCR-D
project-website: https://ocr-d.de/
language:
- deu
production-software: Aletheia
automatically-aligned: false
script:
- iso: Latn
- iso: Latf
script-type:
only-typed
time:
notAfter: '1900'
notBefore: '1600'
hands:
count: unknown
precision: exact
license:
name: CC0-1.0
url: https://creativecommons.org/licenses/zero/1.0/
format: Page-XML
volume:
- count: 0
metric: characters
- count: 891
metric: files
- count: 0
metric: lines
- count: 3957
metric: regions
citation-file-link: https://github.com/OCR-D/gt_structure_2_2/blob/main/CITATION.cff
transcription-guidelines: >-
OCR-D-GT-Guideline, Part: Structure Ground Truth https://ocr-d.de/en/gt-guidelines/trans/structur_gt.html
50 changes: 50 additions & 0 deletions catalog/ocr-d/gt_structure_2_3_METADATA_htr_united.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
schema: https://htr-united.github.io/schema/2023-06-27/schema.json
title: gt_structure_2_3
url: https://github.com/OCR-D/gt_structure_2_3
authors:
- name: Matthias
surname: Boenig
orcid: 0000-0003-4615-4753
roles:
- transcriber
- aligner
- project-manager
- quality-control
- digitization
- support
institutions: []
description: >-
The repo gt_structure_2_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D. Corrections and extensions can be reported, please use the Issues.
project-name: OCR-D
project-website: https://ocr-d.de/
language:
- deu
production-software: Aletheia
automatically-aligned: false
script:
- iso: Latn
- iso: Latf
script-type:
only-typed
time:
notAfter: '1900'
notBefore: '1600'
hands:
count: unknown
precision: exact
license:
name: CC0-1.0
url: https://creativecommons.org/licenses/zero/1.0/
format: Page-XML
volume:
- count: 0
metric: characters
- count: 1160
metric: files
- count: 0
metric: lines
- count: 5289
metric: regions
citation-file-link: https://github.com/OCR-D/gt_structure_2_3/blob/main/CITATION.cff
transcription-guidelines: >-
OCR-D-GT-Guideline, Part: Structure Ground Truth https://ocr-d.de/en/gt-guidelines/trans/structur_gt.html
Loading
Loading