A Pytorch Implementation of R-BERT relation classification model

This is an unofficial pytorch implementation of R-BERT model described paper Enriching Pre-trained Language Model with Entity Information for Relation Classification.

In addition to the SemEval 2010 dataset tested in the original paper, I aslo test implementation on the more recent TACRED dataset

Requirements:

Python version >= 3.6
Pytorch version >= 1.1
Transformer library version >= 2.5.1

Install

$ https://github.com/mickeystroller/R-BERT
$ cd R-BERT

Train

SemEval-2010

The SemEval-2010 dataset is already included in this repo and you can directly run:

CUDA_VISIBLE_DEVICES=0 python r_bert.py --config config.ini

TACRED

You need to first download TACRED dataset from LDC, which due to the license issue I cannot put in this repo. Then, you can directly run:

CUDA_VISIBLE_DEVICES=0 python r_bert.py --config config_tacred.ini

Eval

SemEval-2010

We use the official script for SemEval 2010 task-8

$ cd eval
$ bash test.sh
$ cat res.txt

TACRED

First, we generate prediction file tac_res.txt

$ python eval_tacred.py

You may change test file/model path in the eval_tacred.py file

Then, we use the official scoring script for TACRED dataset

$ python ./eval/score.py -gold_file <TACRED_DIR/data/gold/test.gold> -pred_file ./eval/tac_res.txt

Results

SemEval-2010

Below is the Macro-F1 score

Model	Original Paper	Ours
BERT-uncased-base	----	88.40
BERT-uncased-large	89.25	90.16

TACRED

Below is the evaluation result

Model	Precision (Micro)	Recall (Micro)	F1 (Micro)
BERT-uncased-base	72.99	62.50	67.34
BERT-cased-base	71.27	64.84	67.91
BERT-uncased-large	72.91	66.20	69.39
BERT-cased-large	70.86	65.96	68.32

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
eval		eval
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.ini		config.ini
config.py		config.py
config_tacred.ini		config_tacred.ini
eval_tacred.py		eval_tacred.py
generate_tacred_tsv.py		generate_tacred_tsv.py
model.py		model.py
r_bert.py		r_bert.py
utils.py		utils.py

License

mickeysjm/R-BERT

Folders and files

Latest commit

History

Repository files navigation

A Pytorch Implementation of R-BERT relation classification model

Requirements:

Install

Train

SemEval-2010

TACRED

Eval

SemEval-2010

TACRED

Results

SemEval-2010

TACRED

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Languages