A Python parser for alto XML files, for handling OCR outputs
from alto import parse_file
alto = parse_file('path/to/alto/file.xml')
print(alto.extract_words())
Stable Release: pip install alto-xml
Development Head: pip install git+https://github.com/envinorma/alto.git
For full package documentation please visit envinorma.github.io/alto.
See CONTRIBUTING.md for information related to development.
MIT license