OPTSLA is an Optimization-based Sequential Label Aggregation method, that jointly considers the characteristics of sequential labeling tasks, workers reliabilities, and advanced deep learning techniques to conquer the challenge of annotation aggregation.
The code is divided into 2 folders, code and dataset. The dataset folder contains the files NER dataset formatted into conll format. Code folder contains python files.We use Glove for embedding, please download glove.6B from below link, unzip and place unzipped files in OPTSLA folder.
http://nlp.stanford.edu/data/glove.6B.zip
Python file | Description |
---|---|
conlleval.py | This file is used for evaluating the output |
data_preprocessing.py | This file is used for data preprocessing |
evaluation.py | This is the main file containing OPTSLA implementation |
functions.py | This file contains implementation of few necessary functions |
NOTE: Please update variables in python files before proceeding.
Pre-processing of the data is the first step, to perform the task execute below command
- python data_preprocessing.py
A new folder named iteration0 is created in execution folder which contains pre-processed files.
Now, execute OPTSLA by running following command
- python evaluation.py
Once aggregation is done, the model can be evaluated by running following command
- python calculations.py
This will create a file in calculations folder with the results.
In case of any queries, please contact us at
- nasim@iastate.edu
- aditkulk@iastate.edu
@inproceedings{sabetpour2020optsla, title={OptSLA: an Optimization-Based Approach for Sequential Label Aggregation}, author={Sabetpour, Nasim and Kulkarni, Adithya and Li, Qi}, booktitle={Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings}, pages={1335--1340}, year={2020} }