Implements entity pre-training #178

JohnGiorgi · 2019-09-09T23:34:50Z

Overview

This PR implements entity pre-training. Simply put, entity pre-training involves pre-training the entity module before training the entire NER + RE model jointly.

Previously, we were implementing this by delaying the training of the RE module by a full epoch. I have found a different yet equally simple technique that works quite well. Basically, the loss function is weighted according to

loss = ner_loss + decay_coef * re_loss

where 0 <= delay_coef <= 1 and is computed as current_step / total_steps if this is the first epoch, or 1 otherwise. I saw gains up to 1% on the validation set using this scheme.

coveralls · 2019-09-09T23:51:25Z

Pull Request Test Coverage Report for Build 502

3 of 3 (100.0%) changed or added relevant lines in 1 file are covered.
2 unchanged lines in 1 file lost coverage.
Overall coverage increased (+0.007%) to 88.091%

Files with Coverage Reduction	New Missed Lines	%
saber/models/modules/bert_for_entity_and_relation_extraction.py	2	95.15%

Totals
Change from base Build 500:	0.007%
Covered Lines:	1435
Relevant Lines:	1629

💛 - Coveralls

✨ Implements entity pre-training

bd2bbe5

JohnGiorgi self-assigned this Sep 9, 2019

JohnGiorgi added the feature label Sep 9, 2019

JohnGiorgi merged commit f73d6b8 into development Sep 10, 2019

JohnGiorgi deleted the entity-pretraining branch September 10, 2019 00:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implements entity pre-training #178

Implements entity pre-training #178

JohnGiorgi commented Sep 9, 2019

coveralls commented Sep 9, 2019 •

edited

Implements entity pre-training #178

Implements entity pre-training #178

Conversation

JohnGiorgi commented Sep 9, 2019

Overview

coveralls commented Sep 9, 2019 • edited

Pull Request Test Coverage Report for Build 502

💛 - Coveralls

coveralls commented Sep 9, 2019 •

edited