Skip to content
/ BoAT Public

Boğaziçi University Annotation Tool for Dependency Parsing

Notifications You must be signed in to change notification settings

boun-tabi/BoAT

Repository files navigation

BoAT

Boğaziçi University Annotation Tool for Dependency Parsing

This repository contains an implementation of BoAT to easily annotate CoNLL-U files for dependency parsing. It also visualizes the parse trees.

Implementation

Requirements

Installation of Python

To check your python3 version, run python3 --version.

To install Python 3.6:

To work with a virtual Python environment (optional)

Go to the directory of BoAT-master and run the following:

python3 -m venv .venv
source .venv/bin/activate

Installation of the modules

Using requirements.txt, run pip3 install -r requirements.txt.

Implementation Notes

Required resources for validation (data/ folder and validate.py) are taken from github.com/universaldependencies/tools.

How to run?

  • Go to the directory of BoAT-master
  • Run with: python3 main.py

To use this tool for a specific language, giving the language as an argument (-l) will check your sentences through UD's validation. For example:

python3 main.py -l tr

You can give the conllu file directly by using the -f argument. For example:

python3 main.py -f example.conllu

How to use?

Check the user manual.

Citation

If you make use of the tool, please cite the following papers:

@inproceedings{enhance-altnlp,
  author = {Marşan, Büşra and Furkan Akkurt, Salih and Şen, Muhammet and Gürbüz, Merve and Güngör, Onur and Betül Özateş, Şaziye and Üsküdarlı, Suzan and Özgür, Arzucan and Güngör, Tunga and Öztürk, Balkız},
  title = {{Enhancements to the BOUN Treebank Reflecting the Agglutinative Nature of Turkish}},
  booktitle = {Proceedings of The International Conference and Workshop on Agglutinative Language Technologies as a challenge of Natural Language Processing (ALTNLP)},
  year = {2022},
  month = jun,
}
@article{turk2021resources,
  title={Resources for Turkish dependency parsing: Introducing the BOUN treebank and the BoAT annotation tool},
  author={T{\"u}rk, Utku and Atmaca, Furkan and {\"O}zate{\c{s}}, {\c{S}}aziye Bet{\"u}l and Berk, G{\"o}zde and Bedir, Seyyit Talha and K{\"o}ksal, Abdullatif and Ba{\c{s}}aran, Balk{\i}z {\"O}zt{\"u}rk and G{\"u}ng{\"o}r, Tunga and {\"O}zg{\"u}r, Arzucan},
  journal={Language Resources and Evaluation},
  pages={1--49},
  year={2021},
  publisher={Springer}
}
@inproceedings{turk-etal-2019-turkish,
    title = "{T}urkish Treebanking: Unifying and Constructing Efforts",
    author = {T{\"u}rk, Utku  and
      Atmaca, Furkan  and
      {\"O}zate{\c{s}}, {\c{S}}aziye Bet{\"u}l  and
      K{\"o}ksal, Abdullatif  and
      Ozturk Basaran, Balkiz  and
      Gungor, Tunga  and
      {\"O}zg{\"u}r, Arzucan},
    booktitle = "Proceedings of the 13th Linguistic Annotation Workshop",
    month = aug,
    year = "2019",
    address = "Florence, Italy",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/W19-4019",
    doi = "10.18653/v1/W19-4019",
    pages = "166--177",
}

About

Boğaziçi University Annotation Tool for Dependency Parsing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages