amr-evaluation-enhanced (this is a variant of https://github.com/mdtux89/amr-evaluation)

Evaluation metrics to compare AMR graphs based on Smatch (http://amr.isi.edu/evaluation.html). The script computes a set of metrics between AMR graphs in addition to the traditional Smatch code:

Unlabeled(differ): Smatch score computed on the predicted graphs after (canonicalizing direction and) removing all edge labels
No WSD. Smatch score while ignoring Propbank senses (e.g., duck-01 vs duck-02)
Named Ent. F-score on the named entity recognition (:name roles)
Non_sense_frames(new). F-score on Propbank frame identification without sense (e.g. duck-00)
Frames(new). F-score on Propbank frame identification without sense (e.g. duck-01)
Wikification. F-score on the wikification (:wiki roles)
Negations. F-score on the negation detection (:polarity roles)
Concepts. F-score on the concept identification task
Reentrancy. Smatch computed on reentrant edges only
SRL. Smatch computed on :ARG-i roles only

The different metrics were introduced in the paper below, which also uses them to evaluate several AMR parsers:

"An Incremental Parser for Abstract Meaning Representation", Marco Damonte, Shay B. Cohen and Giorgio Satta. Proceedings of EACL (2017). URL: https://arxiv.org/abs/1608.06111

(Some of the metrics were recently fixed and updated)

Usage: ./evaluation.sh <parsed data> <gold data>, where and are two files which contain multiple AMRs. A blank line is used to separate two AMRs (same format required by Smatch).

In the paper we also discuss a metric for noun phrase analysis. To compute this metric:

./preprocessing.sh <gold data> and python extract_np.py <gold data> to extract the noun phrases from your gold dataset. This will create two files: np_sents.txt and np_graphs.txt.
Parse np_sents.txt with the AMR parser and evaluate with Smatch python smatch/smatch.py --pr -f <parsed data> np_graphs.txt

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
smatch		smatch
.Rhistory		.Rhistory
README.md		README.md
__init__.py		__init__.py
alignments.py		alignments.py
allscores.txt		allscores.txt
amrdata.py		amrdata.py
evaluation.sh		evaluation.sh
extract_np.py		extract_np.py
scores.py		scores.py
tmp_debug.txt		tmp_debug.txt
unlabel.py		unlabel.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

smatch

smatch

.Rhistory

.Rhistory

README.md

README.md

init.py

init.py

alignments.py

alignments.py

allscores.txt

allscores.txt

amrdata.py

amrdata.py

evaluation.sh

evaluation.sh

extract_np.py

extract_np.py

scores.py

scores.py

tmp_debug.txt

tmp_debug.txt

unlabel.py

unlabel.py

Repository files navigation

amr-evaluation-enhanced (this is a variant of https://github.com/mdtux89/amr-evaluation)

About

Releases

Packages

Languages

ChunchuanLv/amr-evaluation-tool-enhanced

Folders and files

Latest commit

History

Repository files navigation

amr-evaluation-enhanced (this is a variant of https://github.com/mdtux89/amr-evaluation)

About

Resources

Stars

Watchers

Forks

Languages