Memorisation versus Generalisation in Pre-trained Language Models

GitHub repository for the paper "Memorisation versus Generalisation in Pre-trained Language Models" by Tanzer et al. published at ACL2022.

State-of-the-art pre-trained language models have been shown to memorise facts and perform well with limited amounts of training data. To gain a better understanding of how these models learn, we study their generalisation and memorisation capabilities in noisy and low-resource scenarios. We find that the training of these models is almost unaffected by label noise and that it is possible to reach near-optimal results even on extremely noisy datasets. However, our experiments also show that they mainly learn from high-frequency patterns and largely fail when tested on low-resource tasks such as few-shot learning and rare entity recognition. To mitigate such limitations, we propose an extension based on prototypical networks that improves performance in low-resource named entity recognition tasks.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
bert_models.py		bert_models.py
consts.py		consts.py
imaging_models.py		imaging_models.py
metrics.py		metrics.py
model_utils.py		model_utils.py
noise_detection.py		noise_detection.py
processors.py		processors.py
proto_train.py		proto_train.py
seqeval_modified.py		seqeval_modified.py
trainer.py		trainer.py
wandber.py		wandber.py
wnuteval.py		wnuteval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

bert_models.py

bert_models.py

consts.py

consts.py

imaging_models.py

imaging_models.py

metrics.py

metrics.py

model_utils.py

model_utils.py

noise_detection.py

noise_detection.py

processors.py

processors.py

proto_train.py

proto_train.py

seqeval_modified.py

seqeval_modified.py

trainer.py

trainer.py

wandber.py

wandber.py

wnuteval.py

wnuteval.py

Repository files navigation

Memorisation versus Generalisation in Pre-trained Language Models

About

Releases

Packages

Languages

License

Michael-Tanzer/BERT-mem-lowres

Folders and files

Latest commit

History

Repository files navigation

Memorisation versus Generalisation in Pre-trained Language Models

About

Resources

License

Stars

Watchers

Forks

Languages