MG-BERT

This is the source code of our paper, Parishad BehnamGhader, Hossein Zakerinia, Mahdieh Soleymani Baghshah. "MG-BERT: Multi Graph Augmented BERT in Masked Language Modeling." In Proceedings of the Fifteenth Workshop on Graph-Based Methods for Natural Language Processing (TextGraphs-15), pp. 125-131. 2021.

Datasets

You may use the CoLA and SST datasets from the GLUE repository and the Brown dataset from the Brown Corpus Manual to train and assess your models. The WN18 Knowledge Graph can also get accessed through this repository.

Running the code

In order to run this code, first use the prepare_data.py to create the multi-graphs based on the corpus.

python prepare_data.py --dataset cola --kg WN11

Then, you can train your MG-BERT model using train.py. Here, we train a model using a multi-graph consisting of tf-idf, pmi, and KG graphs.

python train.py --dataset cola --kg WN11 --dyn 0.8 --graph-mode 123 --epoch 100

Finally, evaluate your final model (via Hits@k metrics) via evaluate.py.

python evaluate.py --dataset cola --kg WN11 --dyn 0.8 --graph-mode 123 --epoch 100

Some parts of this project were originally implemented in the VGCN-BERT repository and Huggingface transformers.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
output		output
pytorch_pretrained_bert		pytorch_pretrained_bert
results		results
README.md		README.md
evaluate.py		evaluate.py
model.py		model.py
prepare_data.py		prepare_data.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

output

output

pytorch_pretrained_bert

pytorch_pretrained_bert

results

results

README.md

README.md

evaluate.py

evaluate.py

model.py

model.py

prepare_data.py

prepare_data.py

train.py

train.py

utils.py

utils.py

Repository files navigation

MG-BERT

Datasets

Running the code

About

Releases

Packages

Languages

ParishadBehnam/MG-BERT

Folders and files

Latest commit

History

Repository files navigation

MG-BERT

Datasets

Running the code

About

Resources

Stars

Watchers

Forks

Languages