If you want to use Tsakorpus, download the most recent version from this repository.
Tsakorpus supports corpora with morphological annotation, special gloss search, multi-word search, subcorpus selection, automatic transliteration, word distribution charts, parallel corpora, and media-aligned corpora. Multiple interface languages are supported with Flask-Babel.
Tsakorpus was tested on Windows and Ubuntu. Its dependencies are the following:
- elasticsearch 7.x (tested on 7.6-7.12)
- python >= 3.5
- python modules: elasticsearch 7.x, flask, lxml, ijson, Flask-Babel, xlsxwriter (you can use requirements.txt)
- for converting media-aligned corpora: ffmpeg
- it is recommended to deploy tsakorpus through apache2 with wsgi or nginx
The following resources are used by tsakorpus, but do not need to be installed:
- jQuery library
- video.js media player
- videojs-youtube plugin
- bootstrap toolkit
- D3.js visualization library
- KioskBoard virtual keyboard
The software is distributed under MIT license (see LICENSE).