CoVal: A coreference evaluation tool for the CoNLL and ARRAU datasets

Implementation of the common evaluation metrics including MUC, B-cubed, CEAFe, and LEA for both CoNLL and ARRAU datasets. See the paper Which Coreference Evaluation Metric Do You Trust? A Proposal for a Link-based Entity Aware Metric.

This fork has been updated with somewhat cleaner CLI functionality as well as the ability to use as a library.

Requirements

See requirements.txt.

Usage

scorer.py is the entrypoint to scoring.

Basic usage with CoNLL files:

➜ python cli.py --help               
usage: cli.py [-h] [--key_file KEY_FILE] [--sys_file SYS_FILE] [--np_only NP_ONLY] [--remove_nested REMOVE_NESTED] [--keep_singletons KEEP_SINGLETONS] [--min_span MIN_SPAN]

options:
  -h, --help            show this help message and exit
  --key_file KEY_FILE
  --sys_file SYS_FILE
  --np_only NP_ONLY
  --remove_nested REMOVE_NESTED
  --keep_singletons KEEP_SINGLETONS
  --min_span MIN_SPAN

Or as a library, import the scorer module and use the score function:

>>> from coval import scorer
>>> import inspect
>>> inspect.signature(scorer.score)
<Signature (key_file, sys_file, *, np_only=False, remove_nested=False, keep_singletons=True, min_span=False)>

The key and system flags are the files with gold coreference and system output, respectively.

For more details, refer to ARRAU README for evaluations of the ARRAU files and CoNLL README for CoNLL evaluations.

Run tests with python3 -m pytest unittests.py.

Reference

If you use this code in your work, please cite the paper:

@InProceedings{moosavi2019minimum,
    author = { Nafise Sadat Moosavi, Leo Born, Massimo Poesio and Michael Strube},
    title = {Using Automatically Extracted Minimum Spans to Disentangle Coreference Evaluation from Boundary Detection},
    year = {2019},
    booktitle = {Proceedings of the 57th Annual Meeting of
		the Association for Computational Linguistics (Volume 1: Long Papers)},
    publisher = {Association for Computational Linguistics},
    address = {Florence, Italy},
}

Authors

This code was written by @ns-moosavi. Some parts are borrowed from https://github.com/clarkkev/deep-coref/blob/master/evaluation.py

The test suite is taken from https://github.com/conll/reference-coreference-scorers/

Mention evaluation and the test suite are added by @andreasvc.

Parsing CoNLL files is developed by Leo Born.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
coval		coval
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
arrau-scorer.py		arrau-scorer.py
cli.py		cli.py
guild.yml		guild.yml
requirements.txt		requirements.txt
setup.py		setup.py
unittests.py		unittests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CoVal: A coreference evaluation tool for the CoNLL and ARRAU datasets

Requirements

Usage

Reference

Authors

About

Releases

Packages

Languages

License

Zatteliet/coval

Folders and files

Latest commit

History

Repository files navigation

CoVal: A coreference evaluation tool for the CoNLL and ARRAU datasets

Requirements

Usage

Reference

Authors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages