[Bert2Bert] allow bert2bert + relative embeddings #14324

patrickvonplaten · 2021-11-08T15:29:25Z

What does this PR do?

Fixes #14010
Everything is explained in #14010. IMO, Bert2Bert like models should not (and also cannot really) make use of positional bias in the cross_attention_layers. The PR forces cross attention layers to always use "absolute" position encodings.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

sgugger

Thanks for fixing! IMO, we can make this better by adding an init argument to the attention layer with this flag.

sgugger · 2021-11-08T15:42:39Z

src/transformers/models/bert/modeling_bert.py

+            # cross attention cannot have relative position embeddings
+            cross_attention_config = copy.deepcopy(config)
+            cross_attention_config.position_embedding_type = "absolute"
+
+            self.crossattention = BertAttention(cross_attention_config)


This is a bit hackish. If we start to turn this argument on and off, maybe it should be an init argument? It can default to None in which case we take the config value (so this is 100% backward compatible).

qqaatw · 2021-11-08T16:22:47Z

Thanks @patrickvonplaten for fixing this! It looks correct that relative position embeddings should not be used on cross-attention layers.

README_ko.md

…/patrickvonplaten/transformers into fix_bert2bert_relative_pos_embds

…into fix_bert2bert_relative_pos_embds

LysandreJik

Yes, LGTM!

LysandreJik · 2021-11-09T19:26:55Z

Merging now to prevent merge conflicts

[Bert2Bert] allow bert2bert + relative embeddings

aa894d9

patrickvonplaten linked an issue Nov 8, 2021 that may be closed by this pull request

Bert: relative_key position embedding causes error in encoder-decoder setup #14010

Closed

4 tasks

patrickvonplaten requested review from sgugger and LysandreJik November 8, 2021 15:38

sgugger approved these changes Nov 8, 2021

View reviewed changes

up

5e259cf

patrickvonplaten commented Nov 9, 2021

View reviewed changes

README_ko.md Outdated Show resolved Hide resolved

patrickvonplaten added 5 commits November 9, 2021 16:30

Update README_ko.md

7a5f662

up

e65f889

Merge branch 'fix_bert2bert_relative_pos_embds' of https://github.com…

aeaf191

…/patrickvonplaten/transformers into fix_bert2bert_relative_pos_embds

up

e807a3a

Merge branch 'master' of https://github.com/huggingface/transformers …

85d6881

…into fix_bert2bert_relative_pos_embds

LysandreJik approved these changes Nov 9, 2021

View reviewed changes

LysandreJik merged commit e81d8d7 into huggingface:master Nov 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bert2Bert] allow bert2bert + relative embeddings #14324

[Bert2Bert] allow bert2bert + relative embeddings #14324

patrickvonplaten commented Nov 8, 2021 •

edited

Loading

sgugger left a comment

sgugger Nov 8, 2021

patrickvonplaten Nov 9, 2021

qqaatw commented Nov 8, 2021

LysandreJik left a comment

LysandreJik commented Nov 9, 2021

[Bert2Bert] allow bert2bert + relative embeddings #14324

[Bert2Bert] allow bert2bert + relative embeddings #14324

Conversation

patrickvonplaten commented Nov 8, 2021 • edited Loading

What does this PR do?

Before submitting

Who can review?

sgugger left a comment

Choose a reason for hiding this comment

sgugger Nov 8, 2021

Choose a reason for hiding this comment

patrickvonplaten Nov 9, 2021

Choose a reason for hiding this comment

qqaatw commented Nov 8, 2021

LysandreJik left a comment

Choose a reason for hiding this comment

LysandreJik commented Nov 9, 2021

patrickvonplaten commented Nov 8, 2021 •

edited

Loading