Skip to content

borh/abc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Aozora Bunko corpus Converter

The Aozora Bunko corpus Converter (ABC) is a library and system for converting texts and metadata from the public Aozora Bunko GitHub repository into TEI P5 XML and Linked Open Data.

Usage

Use either the build tool Boot or the official Clojure CLI tools.

CLI

Extracting LOD from a local clone of the Aozora Bunko repository:

clojure -m abc.core -i ../../Dependencies/aozorabunko -o dist

This will save a Turtle-formatted file aozora-bunko.ttl under dist/.

Interactive Access

For CLI tools:

clj

For boot:

boot dev

And then connect with your favorite editor.

Testing

Run all tests:

clojure -Atest:runner

Continuously running test process for use during development:

boot watch deps-test bat-test

License

Copyright © 2018 Bor Hodošček

Distributed under the Eclipse Public License either version 1.0 or (at your option) any later version.

Releases

No releases published

Packages

No packages published