Skip to content
Parses essay annotation into FoLiA XML and back
Branch: develop
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
config
conversions
tests
.gitignore
.travis.yml
README.md
__init__.py
requirements.in
requirements.txt

README.md

essay-annotation

Parses essay annotation into FoLiA XML

This project contains scripts to convert Word documents with a specific markup (called essay annotation) to FoLiA XML. After the conversion is complete, there are scripts available to output the results to .csv- or .html-format. You can find details on the scripts below.

Preprocessing

Word to plain text

Call conversions/docx2txt.py with the specified file to convert the document to plain text files.

Plain text to FoLiA

Call conversions/essay2xml.py to convert the plain-text files to FoLiA.

Conversions

FoLiA to HTML

Call conversions/folia2html.py to convert the plain-text files to FoLiA.

FoLiA to .csv

Call conversions/xml2csv.py to convert the plain-text files to FoLiA.

FoLiA to .txt

Call conversions/xml2txt.py to convert the plain-text files to FoLiA.

You can’t perform that action at this time.