README

OPTSLA: an Optimization-Based Approach for Sequential LabelAggregation

OPTSLA is an Optimization-based Sequential Label Aggregation method, that jointly considers the characteristics of sequential labeling tasks, workers reliabilities, and advanced deep learning techniques to conquer the challenge of annotation aggregation.

Structure

The code is divided into 2 folders, code and dataset. The dataset folder contains the files NER dataset formatted into conll format. Code folder contains python files.

Embedding

We use Glove for embedding, please download glove.6B from below link, unzip and place unzipped files in OPTSLA folder.

http://nlp.stanford.edu/data/glove.6B.zip

Python file details

Python file	Description
conlleval.py	This file is used for evaluating the output
data_preprocessing.py	This file is used for data preprocessing
evaluation.py	This is the main file containing OPTSLA implementation
functions.py	This file contains implementation of few necessary functions

Usage

The following is the sequence of execution:

NOTE: Please update variables in python files before proceeding.

Pre-processing of the data is the first step, to perform the task execute below command

python data_preprocessing.py

A new folder named iteration0 is created in execution folder which contains pre-processed files.

Now, execute OPTSLA by running following command

python evaluation.py

Once aggregation is done, the model can be evaluated by running following command

python calculations.py

This will create a file in calculations folder with the results.

Contact

In case of any queries, please contact us at

nasim@iastate.edu
aditkulk@iastate.edu

Evaluation Note

The dataset provided contains 4515 sentences, the results published in the paper are evaluated on 3466 sentences to match with baselines.

References

https://www.aclweb.org/anthology/2020.findings-emnlp.119/

Citing

@inproceedings{sabetpour2020optsla, title={OptSLA: an Optimization-Based Approach for Sequential Label Aggregation}, author={Sabetpour, Nasim and Kulkarni, Adithya and Li, Qi}, booktitle={Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings}, pages={1335--1340}, year={2020} }

Name	Name	Last commit message	Last commit date
Latest commit NasimISU Update README.md Feb 8, 2021 8fc2ea3 · Feb 8, 2021 History 6 Commits
code	code	https://github.com/NasimISU/OptSLA.git	Oct 3, 2020
datasets	datasets	https://github.com/NasimISU/OptSLA.git	Oct 3, 2020
README.md	README.md	Update README.md	Feb 8, 2021
conlleval.pl	conlleval.pl	https://github.com/NasimISU/OptSLA.git	Oct 3, 2020
conlleval.py	conlleval.py	https://github.com/NasimISU/OptSLA.git	Oct 3, 2020
conlleval.pyc	conlleval.pyc	https://github.com/NasimISU/OptSLA.git	Oct 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

OPTSLA: an Optimization-Based Approach for Sequential LabelAggregation

Structure

Embedding

Python file details

Usage

Contact

Evaluation Note

References

Citing

About

Releases

Packages

Languages

NasimISU/OptSLA

Folders and files

Latest commit

History

Repository files navigation

README

OPTSLA: an Optimization-Based Approach for Sequential LabelAggregation

Structure

Embedding

Python file details

Usage

Contact

Evaluation Note

References

Citing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages