[BUGFIX] BART and mBART support 2D attention mask from tokenizer #1637

gongel · 2022-01-25T11:24:10Z

PR types

Bug fixes

PR changes

Others

Description

ZeyuChen · 2022-01-25T13:53:18Z

paddlenlp/transformers/mbart/modeling.py

@@ -286,7 +286,13 @@ def forward(self, input_ids=None, attention_mask=None, **kwargs):
            attention_mask = paddle.cast(
                input_ids == self.pad_token_id,
                dtype=paddle.get_default_dtype()).unsqueeze([1, 2]) * -1e4
-            attention_mask.stop_gradient = True
+
+    # For 2D attention_mask from tokenizer


注释的位置不对

Done and thx.

fix: bart and mbart support 2d attention mask

f3852b7

ZeyuChen reviewed Jan 25, 2022

View reviewed changes

fix: bart and mbart support 2d attention mask

b0bd3d8

guoshengCS approved these changes Jan 26, 2022

View reviewed changes

gongel merged commit 32d01fa into PaddlePaddle:develop Jan 26, 2022

gongel mentioned this pull request Jan 26, 2022

PaddleNLP 2.2.4 Release Note Candidate #1614

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUGFIX] BART and mBART support 2D attention mask from tokenizer #1637

[BUGFIX] BART and mBART support 2D attention mask from tokenizer #1637

gongel commented Jan 25, 2022

ZeyuChen Jan 25, 2022

gongel Jan 25, 2022

[BUGFIX] BART and mBART support 2D attention mask from tokenizer #1637

[BUGFIX] BART and mBART support 2D attention mask from tokenizer #1637

Conversation

gongel commented Jan 25, 2022

PR types

PR changes

Description

ZeyuChen Jan 25, 2022

Choose a reason for hiding this comment

gongel Jan 25, 2022

Choose a reason for hiding this comment