Skip to content

Documentation

elisasibarani edited this page Jun 15, 2016 · 11 revisions

Overview

Krextor, the KWARC RDF Extractor, is an extensible XSLT-based framework for extracting RDF from XML, supporting multiple input languages as well as multiple output RDF notations. Krextor provides convenience templates that try to do “the right thing”™ in many common cases, as to reduce the need for manually writing repetitive code. The Publications provide further background on the design, requirements, and use cases behind Krextor.

Semantics

The extracted RDF graph will in most cases be an outline of the semantic structure of an XML document, abstracting from the concrete syntax. It can be used for more easily exchanging or interlinking knowledge contained in XML documents on the semantic web. There are many tools that support querying RDF, using languages like SPARQL. If the extracted RDF is backed by an expressive ontology, a reasoner can be used to infer additional knowledge from it.

Supported Formats

Krextor comes with some number of extraction and output modules. Support for additional formats is easy to add. Please let us know if you have written any extraction or output module, test case, or documentation that you would like us to make a part of the Krextor default distribution.

Input Formats (Extraction Modules; varying stability)

The following input formats are already supported. Others are easy to add. Just copy an existing extraction module to get started.

Output Formats (all stable)

  • sequence of triples:
  • grouped triples (first by common subject, then by common predicate; implemented as post-processing of RXR for now):
    • turtle: Turtle ([source](https://github.com/EIS-Bonn/krextor/blob/master/src/xslt/output/turtle.xsl))
    • rdf-xml: RDF/XML ([source](https://github.com/EIS-Bonn/krextor/blob/master/src/xslt/output/rdf-xml.xsl))
    • rdfa: RDFa (for inclusion into XSLTs that render XML to XHTML+RDFa; [source](https://github.com/EIS-Bonn/krextor/blob/master/src/xslt/output/util/rdfa.xsl))
  • java: Java callback for every triple ([source](https://github.com/EIS-Bonn/krextor/blob/master/src/xslt/output/java.xsl))
  • none: no output; for testing ([source](https://github.com/EIS-Bonn/krextor/blob/master/src/xslt/output/none.xsl))
  • YourOwnOutput

Usage

See Usage

Source code documentation

(generated using XSLTdoc)

  • [latest version (trunk)](https://github.com/EIS-Bonn/krextor/blob/master/doc/xsltdoc/index.html)

External documentation