Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)
Switch branches/tags
Nothing to show
Clone or download
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
dist added content in lib directory Sep 15, 2016
lib added content in lib directory Sep 15, 2016
resources/xsl hm Feb 16, 2016
src . Feb 23, 2016
README.md first commit Oct 7, 2015
build.xml hm Feb 16, 2016

README.md

OpenConvert

The OpenConvert tools output TEI from a number of input formats.

Using the command line

The OpenConvert distribution can be accessed at https://github.com/INL/OpenConvert.

The command line can be used as follows:

java -jar OpenConvert.jar -from -to

Options:

-from input format: text, TEI, alto, doc, docx, HTML

-to output format: TEI, text or folia

Arguments:

input filename, directory name or zip archive name (ending with .zip)

output filename, directory name or zip archive name (ending with .zip)

If the from and to flags are omitted, the conversion to be applied will be guessed from file name extensions.