Ferenda is a python library and framework for transforming unstructured document collections into structured Linked Data. It helps with downloading documents, parsing them to add explicit semantic structure and RDF-based metadata, finding relationships between documents, and republishing the results.
intro firststeps createdocrepos keyconcepts docmetadata elementclasses fsmparser citationparsing readers facets toc news wsgi restapi external-dbs testing advanced
api/documentrepository api/document api/documententry api/documentstore api/facet api/resourceloader api/tocpage api/tocpageset api/feed api/feedset api/elements api/elements-html api/describer api/transformer api/fsmparser api/citationparser api/uriformatter api/triplestore api/fulltextindex api/textreader api/pdfreader api/pdfanalyzer api/wordreader api/wsgiapp api/resources api/compositerepository
api/util api/citationpatterns api/uriformats api/manager api/testutil
api/decorators
api/errors
docrepo/keyword docrepo/mediawiki docrepo/sitenews docrepo/skeleton docrepo/static docrepo/tech docrepo/legal-eu docrepo/legal-se api/devel
changelog
genindex
modindex
search