What kinds of errors do reference resolution models make and what can we learn from them?

This repository contains code and models for our NAACL 2022 paper What kinds of errors do reference resolution models make and what can we learn from them? by Jorge Sánchez, Mauricio Mazuecos, Hernán Maina and Luciana Benotti.

Installation

Setup the code in a virtualenv

$ git clone https://github.com/jadrs/rec.git && cd rec
$ python3 -m venv venv  && source venv/bin/activate

you'll also need a running version of pytorch. You can go to the website and choose the version that best suits your hardware, eg.:

$ python3 -m pip install torch==1.8.2+cu111 torchvision==0.9.2+cu111 torchaudio==0.8.2 -f https://download.pytorch.org/whl/lts/1.8/torch_lts.html
$ python3 -m pip install -r requirements.txt

Setup data

Clone the Referring Expression Dataset API

$ git clone https://github.com/lichengunc/refer.git && cd refer
$ git checkout python3

and follow the instructions to access the ReferItGame (a.k.a RefCLEF), RefCOCO, RefCOCO+ and RefCOCOg datasets.

Training and validation

Run

$ python3 trainval.py -h

for a complete list of training options.

Pretrained models

Here you can find both the baseline and extended models trained on the different datasets (Table 3 in the paper). For convenience, we recommend to keep the same directory structure since the testing script infer some of the parameters from the path names.

ReferItGame: baseline, extended
RefCOCO: baseline, extended
RefCOCO+: baseline, extended
RefCOCOg: baseline, extended

Evaluation

First, you'll a running version of stanza. You can download the english package files as:

$ python3 -c "import stanza; stanza.download('en')"

You can also use spacy, in which case you need to change the backend="stanza" argument in line 178 to "backend=spacy". To get the spacy language files, run:

$ python3 -m spacy download en_core_web_md

Now, to test a trained model run:

$ python3 test.py <MODEL.ckpt>

The script will inferr the dataset and parameters from the file path. You can run -h and check which options are available. The test script is provided as an example use of our trained models. You can customize it to your needs.

Error analysis annotation

We make available the annotation of the type of abilities needed for each RE to be correctly resolved in the file ReferIt_Skill_annotation-NAACL2022.csv.g0.

The file contains more type of abilities than the ones discussed in the paper. The only types relevant for the analysis are:

fuzzy objects
meronimy
occlusion
directional
implicit
typo
viewpoint

Citation

If you find this repository useful, please consider citing us.

@inproceedings{sanchez2022reference,
  title = {What kinds of errors do reference resolution models make and what can we learn from them?},
  author = {S\'anchez, Jorge and
    Mazuecos, Mauricio and
    Maina, Hern\'an and
    Benotti, Luciana},
  booktitle = {Findings of the {A}ssociation for {C}omputational {L}inguistics: {NAACL}},
  year = {2022},
  address = "Seattle, US",
  publisher = "{A}ssociation for {C}omputational {L}inguistics",
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
README.md		README.md
ReferIt_Skill_annotation-NAACL2022.csv.gz		ReferIt_Skill_annotation-NAACL2022.csv.gz
backbones.py		backbones.py
datasets.py		datasets.py
embeddings.py		embeddings.py
encoders.py		encoders.py
losses.py		losses.py
models.py		models.py
parser.py		parser.py
re_classifier.py		re_classifier.py
requirements.txt		requirements.txt
test.py		test.py
trainval.py		trainval.py
transformers_pos.py		transformers_pos.py
transforms.py		transforms.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What kinds of errors do reference resolution models make and what can we learn from them?

Installation

Setup data

Training and validation

Pretrained models

Evaluation

Error analysis annotation

Citation

About

Releases

Packages

Contributors 2

Languages

jadrs/rec

Folders and files

Latest commit

History

Repository files navigation

What kinds of errors do reference resolution models make and what can we learn from them?

Installation

Setup data

Training and validation

Pretrained models

Evaluation

Error analysis annotation

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages