The Aozora Bunko corpus Converter (ABC) is a library and system for converting texts and metadata from the public Aozora Bunko GitHub repository into TEI P5 XML and Linked Open Data.
Use either the build tool Boot or the official Clojure CLI tools.
Extracting LOD from a local clone of the Aozora Bunko repository:
clojure -m abc.core -i ../../Dependencies/aozorabunko -o dist
This will save a Turtle-formatted file aozora-bunko.ttl
under dist/
.
For CLI tools:
clj
For boot:
boot dev
And then connect with your favorite editor.
Run all tests:
clojure -Atest:runner
Continuously running test process for use during development:
boot watch deps-test bat-test
Copyright © 2018 Bor Hodošček
Distributed under the Eclipse Public License either version 1.0 or (at your option) any later version.