BartForQuestionAnswering #4908

patil-suraj · 2020-06-10T15:02:38Z

This PR adds BartForQuestionAnswering.

Decided to add this models as BART is intended for both NLU and NLG tasks and also achieves comparable performance to ROBERTa on SQuAD.

Also fine-tuned the model here. The metrics are slightly worse than given in the paper. Got following metrics on SQuADv1
{'exact_match': 86.80227057710502, 'f1': 92.73424907872341}

@sshleifer , @patrickvonplaten

codecov · 2020-06-10T15:08:36Z

Codecov Report

Merging #4908 into master will increase coverage by 0.03%.
The diff coverage is 94.11%.

@@            Coverage Diff             @@
##           master    #4908      +/-   ##
==========================================
+ Coverage   76.99%   77.02%   +0.03%     
==========================================
  Files         128      128              
  Lines       21602    21635      +33     
==========================================
+ Hits        16633    16665      +32     
- Misses       4969     4970       +1

Impacted Files	Coverage Δ
src/transformers/__init__.py	`99.13% <ø> (ø)`
src/transformers/modeling_bart.py	`96.26% <93.93%> (-0.15%)`	⬇️
src/transformers/modeling_auto.py	`78.40% <100.00%> (ø)`
src/transformers/modeling_utils.py	`90.49% <0.00%> (-0.12%)`	⬇️
src/transformers/modeling_tf_utils.py	`87.42% <0.00%> (+0.15%)`	⬆️
src/transformers/file_utils.py	`73.49% <0.00%> (+0.40%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ac99217...63eb191. Read the comment docs.

sshleifer

LGTM

src/transformers/modeling_bart.py

tests/test_modeling_bart.py

sshleifer · 2020-06-10T18:16:57Z

Thanks for the contribution @patil-suraj !

LysandreJik

Hi! Very cool @patil-suraj.

Could you also add BartForQuestionAnswering to the all_model_classes in test_modeling_bart.py?

patil-suraj · 2020-06-11T06:03:32Z

Hi! Very cool @patil-suraj.

Could you also add BartForQuestionAnswering to the all_model_classes in test_modeling_bart.py?

Hi, @LysandreJik
After adding BartForQuestionAnswering in all_model_classes I also had to add output_attention parameter to forward.

Now for some reason test_attention_outputs is failing, I am not sure why, could you help me fix it ?
Thanks !

patrickvonplaten · 2020-06-11T10:10:23Z

Awesome work @patil-suraj - I can help you with this test :-)

patrickvonplaten · 2020-06-11T10:14:49Z

I see what the problem is...it's actually not related to your PR at all. Can we you for now just remove BartForQuestionAnswering from the all_models tuples in the tests. @LysandreJik @sshleifer I will open a new PR after this one to fix it :-)

patil-suraj · 2020-06-11T11:03:45Z

I see what the problem is...it's actually not related to your PR at all. Can we you for now just remove BartForQuestionAnswering from the all_models tuples in the tests. @LysandreJik @sshleifer I will open a new PR after this one to fix it :-)

Thank you @patrickvonplaten . I've removed it from all_models tuple for now

patil-suraj added 5 commits June 10, 2020 19:52

add BartForQuestionAnswering

1ca8227

update __init__

2e6b2d1

add test_question_answering_forward

dbb7e45

add BartForQuestionAnswering in auto models

7c8b5dc

make flake8 happy

0166289

sshleifer approved these changes Jun 10, 2020

View reviewed changes

src/transformers/modeling_bart.py Outdated Show resolved Hide resolved

tests/test_modeling_bart.py Outdated Show resolved Hide resolved

sshleifer requested a review from LysandreJik June 10, 2020 18:16

sshleifer changed the title ~~Bart for question answering~~ BartForQuestionAnswering Jun 10, 2020

patil-suraj added 4 commits June 10, 2020 23:56

make test cleaner

8bb0318

fix typo in docstring

0c42a8b

add BartForQuestionAnswering in docs

8d7bfcb

fix typo in docs

3b9fba1

LysandreJik reviewed Jun 10, 2020

View reviewed changes

patil-suraj added 2 commits June 11, 2020 11:27

add output_attentions to forward parameters

ec2f4d9

add BartForQuestionAnswering in all model classes

5e7366d

remove BartForQuestionAnswering from all_model_classes

63eb191

patrickvonplaten approved these changes Jun 11, 2020

View reviewed changes

sshleifer merged commit e93ccb3 into huggingface:master Jun 12, 2020

patrickvonplaten mentioned this pull request Jun 15, 2020

[Bart] Question Answering Model is added to tests #5024

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BartForQuestionAnswering #4908

BartForQuestionAnswering #4908

patil-suraj commented Jun 10, 2020

codecov bot commented Jun 10, 2020 •

edited

sshleifer left a comment

sshleifer commented Jun 10, 2020

LysandreJik left a comment

patil-suraj commented Jun 11, 2020

patrickvonplaten commented Jun 11, 2020

patrickvonplaten commented Jun 11, 2020 •

edited

patil-suraj commented Jun 11, 2020

BartForQuestionAnswering #4908

BartForQuestionAnswering #4908

Conversation

patil-suraj commented Jun 10, 2020

codecov bot commented Jun 10, 2020 • edited

Codecov Report

sshleifer left a comment

Choose a reason for hiding this comment

sshleifer commented Jun 10, 2020

LysandreJik left a comment

Choose a reason for hiding this comment

patil-suraj commented Jun 11, 2020

patrickvonplaten commented Jun 11, 2020

patrickvonplaten commented Jun 11, 2020 • edited

patil-suraj commented Jun 11, 2020

codecov bot commented Jun 10, 2020 •

edited

patrickvonplaten commented Jun 11, 2020 •

edited