Metablock reads RDF data and writes a possibly modified resource description back to a RDF target.
- SPARQL service endpoints
- RDF data files
- OAI data sources with XSLT lifting
- RDBMS with XSLT lifting
- Virtuoso RDF storage
- Jena TDB storage
- RDF data files
- Solr search index with XSLT transformation
- configured by lib/metablock.ttl
- compile:
CPATH=war/jetty/lib/servlet-api-3.1.jar:lib:lib/*
javac -cp $CPATH -d lib src/org/metablock/rest/*
jar cf lib/metablock.jar -C lib org
- start autobib daemon:
abd start
To build a Solr search index from a SPARQL service endpoint, three steps are required:
-
Resource Enumeration: List all resources that should be indexed,
-
Resource Dump: Query everything the triple store knows about a resource,
-
Resource Tranformation: Transform RDF/XML to solr index format.
Step 1. and 2. need a SPARQL query, step 3 works with XSLT.
The sparql queries and xslt transformations used so far are rather general,
but modelling of bibliographic resources may vary and require modification.
All configurations are done in turtle (see lib/metablock.ttl)
RDF data sources configured in lib/metablock.ttl can be tested with
java -jar metablock.jar -s [source] -t [target] -test
Copy from RDF source to RDF target
java -jar metablock.jar -s [source] -t [target]
PDF Analyzer: utilizes Grobid / CERMINE to extract metadata and bibliographical references from scientific articles
java -jar metablock.jar -s [source] -t [target] -e pdf
Reference Analysis: use external libraries to find citation context and determine citation polarity
java -jar metablock.jar -s [source] -t [target] -e sen
-
Index a directory of PDF files, enable pdf engine to extract metadata and write to VuFind:
java -jar metablock.jar -crawl -s files -t solr1 -e pdf Documents
-
Write DSpace metadata from DSpace REST API to Virtuoso triplestore (experimental):
java -jar metablock.jar -crawl -s dspace -t virt
-
Crawl OAI sources to a jena TDB store:
java -jar metablock.jar -crawl -s oai -t tdb
-
Build a search index for a TDB triple store
java -jar metablock.jar -crawl -s tdb -t solr1
Javadoc is available online.