Fine-grained Contrastive Learning for Relation Extraction

Description

This repo contains the source code for the EMNLP 2022 paper Fine-grained Contrastive Learning for Relation Extraction.

Project Structure

├── configs                <- Configuration files
│   ├── experiment               <- Experiment configs
│   ├── mode                     <- Mode configs
│   ├── trainer                  <- Trainer configs
│   └── config.yaml              <- Main config 
│
├── data                   <- Project data
│
├── logs                   <- Logs and saved checkpoints
│
├── preprocess             <- Preprocessing scripts
│
├── saved_models           <- Saved models
│
├── src                    <- Source code
│   ├── stage_1                  <- Stage 1 code: record learning order
│   ├── stage_2                  <- Stage 2 code: contrastive pre-training
│   └── stage_3                  <- Stage 2 code: fine-tuning
│       ├── doc_level            <- Fine-tune for doc-level relation extraction
│       └── sentence_level       <- Fine-tune for sentence-level relation extraction
│
├── requirements.txt       <- File for installing python dependencies
├── run.py                 <- Controller
└── README.md

Instructions

📓 Pre-run Notes:

For convenience, we provide the complete preprocessed datasets, saved models, and logs for all of our experiments.
By using the preprocessed data and saved models, you can jump to any stage detailed below without needing to run the previous stages.
This repo uses Hyrda to run and configure experiments. You can override any parameter from command line like this: CUDA_VISIBLE_DEVICES=0 python run.py trainer.learning_rate=1e-5.
Please see the FineCL paper for details about each stage of training and evaluation.
The code in this repo was developed using Python (v3.9), PyTorch (v1.12.1), Hugginface transformers (v2.5.0), and CUDA (v11.6)

⓪ Initalize

Install dependencies from requirements.txt
Install Apex
Download the data and saved models via the command line:
gdown --no-check-certificate --folder https://drive.google.com/drive/u/1/folders/13-iTHhde8B5BQPNk8bCA0z6dxxo42ov1
Unzip data.zip and then move both data and saved_models into the project's root directory.
[OPTIONAL] All the preprocessed data is provided, but if you'd like to preprocess the data yourself, run:
python run.py mode=preprocess.yaml

① Stage 1 – Record learning order:

The exact learning order data used in the paper is provided in data/erica_data/order_dict_augmented.json. However, if you'd like to generate a learning order of relation instances from scratch, do the following:
There are 10 separate distantly labeled training data files numbered 0 through 9. Run:
CUDA_VISIBLE_DEVICES=0 python run.py experiment=stage_1.yaml erica_file_num=[FILE_NUM] for each of the 10 files, replacing [FILE_NUM] with the appropriate file number for each run.
Each run outputs a json file containing the epoch and a list of learned unique identifiers (UIDs). A UID identifies a relational instance in the dataset.
Merge the outputs of each run into a single json file with the format: {str(UID): int(epoch_leared), ...}. For example: {"39582": 2, "49243": 12, ...}
This merged file will replace data/erica_data/order_dict_augmented.json in Stage 2 of training.

② Stage 2 – Fine-grained contrastive pre-training:

To pre-train a model using Fine-grained contrastive learning (FineCL), run:

CUDA_VISIBLE_DEVICES=0 python run.py experiment=stage_2.yaml

③ Stage 3 – Downstream fine-tuning:

Document-level RE fine-tuning:
- For document-level fine-tuning, we load the pre-trained model from Stage 2 and train on annotated data from the DocRED dataset.
```
CUDA_VISIBLE_DEVICES=0 python run.py experiment=stage_3_doc_re.yaml
```
  To evaluate the performance of the fine-tuned model from Stage 3, run:
```
CUDA_VISIBLE_DEVICES=0 python run.py experiment=stage_3_eval.yaml
```
Sentence-level RE fine-tuning:
- For sentence-level fine-tuning, we again load the pre-trained model from Stage 2 and train on the following annotated datasets:
  - TACRED.
  - SemEval-2010 Task 8.
```
CUDA_VISIBLE_DEVICES=0 python run.py experiment=stage_3_sentence_re.yaml
```

Credits: This work began as a fork of the ERICA repository. If you found our code useful, please consider citing:

@inproceedings{hogan-etal-2022-fine,
    title = "Fine-grained Contrastive Learning for Relation Extraction",
    author = "Hogan, William  and
      Li, Jiacheng  and
      Shang, Jingbo",
    booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing",
    month = dec,
    year = "2022",
    address = "Abu Dhabi, United Arab Emirates",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.emnlp-main.71",
    pages = "1083--1095",
    abstract = "Recent relation extraction (RE) works have shown encouraging improvements by conducting contrastive learning on silver labels generated by distant supervision before fine-tuning on gold labels. Existing methods typically assume all these silver labels are accurate and treat them equally; however, distant supervision is inevitably noisy{--}some silver labels are more reliable than others. In this paper, we propose fine-grained contrastive learning (FineCL) for RE, which leverages fine-grained information about which silver labels are and are not noisy to improve the quality of learned relationship representations for RE. We first assess the quality of silver labels via a simple and automatic approach we call {``}learning order denoising,{''} where we train a language model to learn these relations and record the order of learned training instances. We show that learning order largely corresponds to label accuracy{--}early-learned silver labels have, on average, more accurate labels than later-learned silver labels. Then, during pre-training, we increase the weights of accurate labels within a novel contrastive learning objective. Experiments on several RE benchmarks show that FineCL makes consistent and significant performance gains over state-of-the-art methods.",
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

preprocess

preprocess

src

src

.gitignore

.gitignore

README.md

README.md

requirements.txt

requirements.txt

run.py

run.py

Repository files navigation

Fine-grained Contrastive Learning for Relation Extraction

Description

Project Structure

Instructions

📓 Pre-run Notes:

⓪ Initalize

① Stage 1 – Record learning order:

② Stage 2 – Fine-grained contrastive pre-training:

③ Stage 3 – Downstream fine-tuning:

Document-level RE fine-tuning:

Sentence-level RE fine-tuning:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
preprocess		preprocess
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

wphogan/finecl

Folders and files

Latest commit

History

Repository files navigation

Fine-grained Contrastive Learning for Relation Extraction

Description

Project Structure

Instructions

📓 Pre-run Notes:

⓪ Initalize

① Stage 1 – Record learning order:

② Stage 2 – Fine-grained contrastive pre-training:

③ Stage 3 – Downstream fine-tuning:

Document-level RE fine-tuning:

Sentence-level RE fine-tuning:

About

Resources

Stars

Watchers

Forks

Languages