Skip to content

Enhanced Stand-off TEI Annotation with StoReCo: A generic approach with the use of RDF.

Notifications You must be signed in to change notification settings

KardungLa/StoReCo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

Enhanced Stand-off TEI Annotation with StoReCo: A generic approach with the use of RDF.

Author: Daniel Schlager "email at danielschlager.at"

Abstract

The Text Encoding Initiative (TEI) is the de facto standard for the representation of texts in digital form, especially in the digital humanities of high standing due to its diverse application spectrum. Due to the increasing number and depth of analysis possibilities and the need for overlapping annotations such as Entity Recognition, Part of Speech Tagging, syntactic analysis, etc., the hierarchical structure soon reaches the boundaries of what is achievable. The usual way of storing primary data and annotation layers in a file results in a non-well-formed XML instance. There are different approaches to stand-off notation recommended by the Text Encoding Initiative. However, the most recent approach using Xpointer seems to be too complicated, or there are not yet enough tools to make it more accessible to use. Also, the readability of the document in the XML source code suffers from it. The StoReCo called Standoff Markup Resolver and Compiler joins the XML/TEI document with one or more RDF sources, which can be available as an XML-RDF document, Turtle, N3, or triple store/Sparql endpoint and produces an annotated PDF, HTML or an interactive viewer. With this technique, an unlimited number of annotation levels can be introduced. The XML/TEI document is not enriched with additional markup elements and remains in the original for the producer. The link between TEI and RDF opens up new possibilities for use in Linked Open Data.

StoReCo CLI and Library

Please see https://github.com/KardungLa/StoReCo/tree/master/storeco for more about the StoReCo command line interface and Java Library.

StoReCo UI (not published yet)

Please see https://github.com/KardungLa/StoReCo/tree/master/storeco-ui for more about the User Interface.

StoReCo Server (not published yet)

Please see https://github.com/KardungLa/StoReCo/tree/master/storeco-server for more about the Backend Service.

License

Copyright © 2020-2021 Daniel Schlager

This program and the accompanying materials are made available under the terms of the Eclipse Public License 2.0 which is available at http://www.eclipse.org/legal/epl-2.0.

This Source Code may also be made available under the following Secondary Licenses when the conditions for such availability set forth in the Eclipse Public License, v. 2.0 are satisfied: GNU General Public License as published by the Free Software Foundation, either version 2 of the License, or (at your option) any later version, with the GNU Classpath Exception which is available at https://www.gnu.org/software/classpath/license.html.

About

Enhanced Stand-off TEI Annotation with StoReCo: A generic approach with the use of RDF.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published