Skip to content

An environment for querying and revising CoNLL-U datasets written in Python/Javascript

License

Notifications You must be signed in to change notification settings

alvelvis/Interrogat-rio

Repository files navigation

Interrogatório

Interrogatório is an environment for querying and editing annotated corpora in the CoNLL-U format. It is available in Portuguese and English, based on the user's web browser language. It was developed under the supervision of Prof. Cláudia Freitas and it's been used in several research projects, including Linguistic resources for Portuguese Natural Language Processing from the Linguistics Department of PUC-Rio (Brazil), project Petrolês, at the Applied Computational Intelligence Laboratory from PUC-Rio, and POeTiSA, at the Center for Artificial Intelligence (C4AI) of the University of São Paulo.

Live demo

Interrogatório is part of ET: a workstation for querying, editing and evaluating annotated corpora. Consider also installing Julgamento, an enviroment for evaluating annotated corpora, in the same folder as Interrogatório to integrate both.

If you wish to deploy Interrogatório to a web server, using Ubuntu/Apache2, check Deploying with Ubuntu/Apache2.

How to run on a Windows machine

  1. Go to releases

  2. Download the latest version Interrogat-rio.zip

  3. Extract the folder Interrogat-rio from the zip file

  4. Double click interrogatorio_for_windows.bat

How to install in a local server: 4-steps Tutorial

If you wish to run Interrogatório in a local server, a Linux computer (or Windows with Windows Subsystem for Linux), python3 and virtualenv are needed.

  1. The recommended way to download the repository is by cloning it. In a terminal, execute the following command:

    $ git clone https://github.com/alvelvis/Interrogat-rio.git --recursive

  2. Change to the newly created directory:

    $ cd Interrogat-rio

  3. After downloading the repository, run the command below, and it will then create a Python 3 virtual environment and install the requirements in order to run Interrogatório.

    $ sh install_interrogatorio.sh

  4. Then, whenever you intend to run Interrogatório locally, run the command below and it will also auto-update when necessary if you cloned the repo using Git:

    $ sh run_interrogatorio.sh

All set, you'll be able to access Interrogatório by the local page http://localhost:8000/. End the server by pressing "Ctrl+C" in the terminal window.

How to cite

@inproceedings{de-souza-freitas-2021-et,
    title = "{ET}: A Workstation for Querying, Editing and Evaluating Annotated Corpora",
    author = "de Souza, Elvis  and
      Freitas, Cl{\'a}udia",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations",
    month = nov,
    year = "2021",
    address = "Online and Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.emnlp-demo.5",
    pages = "35--41",
    abstract = "In this paper we explore the functionalities of ET, a suite designed to support linguistic research and natural language processing tasks using corpora annotated in the CoNLL-U format. These goals are achieved by two integrated environments {--} Interrogat{\'o}rio, an environment for querying and editing annotated corpora, and Julgamento, an environment for assessing their quality. ET is open-source, built on different Python Web technologies and has Web demonstrations available on-line. ET has been intensively used in our research group for over two years, being the chosen framework for several linguistic and NLP-related studies conducted by its researchers.",
}

About

An environment for querying and revising CoNLL-U datasets written in Python/Javascript

Resources

License

Stars

Watchers

Forks