What GPT Knows about Who is Who

This is the repository for our paper What GPT Knows About Who is Who.

Our abstract:

Coreference resolution -- which is a crucial task for understanding discourse and language at large -- has yet to witness widespread benefits from large language models (LLMs). Moreover, coreference resolution systems largely rely on supervised labels, which are highly expensive and difficult to annotate, thus making it ripe for prompt engineering. In this paper, we introduce a QA-based prompt-engineering method and discern generative, pre-trained LLMs' abilities and limitations toward the task of coreference resolution. Our experiments show that GPT-2 and GPT-Neo can return valid answers, but that their capabilities to identify coreferent mentions are limited and prompt-sensitive, leading to inconsistent results.

Environment Setup

Local environment

You can create the required conda env with:

conda env create -f corefGPT.yml

Then you can start the conda env with:

conda activate corefGPT

To start easily, you can download our processed ECB+ files from here and place them in the root of this directory. Moreover, to experiment with the Streamlining model, you can download our trained model from here and place that locally under Models/Streamline.

Setting up the right paths

Lastly, when running the notebooks you have to change the lines the prefix the grdrive path with the local path. So this:

local_path = ""

# Change this to "local_path" if you run the notebook locally
root_path = gdrive_dir_path

Needs to be changed to this:

local_path = ""

# Change this to "local_path" if you run the notebook locally
root_path = local_path

Model runtimes

Below you will see how long each model took to run on colab:

Model	GPT2	GPT-NEO-125M	Multi-pass Sieves	E2E	Streamline (Pairwise)
Time	3h	7.5h	5h	2m	2h

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Data Preprocessing		Data Preprocessing
Evaluation		Evaluation
Models		Models
Results		Results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
corefGPT.yml		corefGPT.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What GPT Knows about Who is Who

Environment Setup

Local environment

Setting up the right paths

Model runtimes

About

Releases

Packages

Contributors 2

Languages

License

AwesomeCoref/prompt-coref

Folders and files

Latest commit

History

Repository files navigation

What GPT Knows about Who is Who

Environment Setup

Local environment

Setting up the right paths

Model runtimes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages