[WIP] Add BART for summarization training with CNN/DM using pytorch-lightning #3236

andr-ec · 2020-03-12T01:08:11Z

This pull request adds to the example for BART for summarization. I used the example for NER using pytorch-lightning as guidance. This example will train on CNN/DM and evaluate, and get decent results, though I haven't trained it on the full dataset just yet. I'm sure there are better defaults for the hyperparams but these seem to work.
I based this PR on the code I wrote in this colab.

This would hopefully close #3004

TODO

Be able to train the model on a GPU.
remove unused args
add test step and save results.

Happy to hear any feedback!

codecov-io · 2020-03-12T01:15:23Z

Codecov Report

Merging #3236 into master will not change coverage by %.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #3236   +/-   ##
=======================================
  Coverage   77.56%   77.56%           
=======================================
  Files         100      100           
  Lines       16970    16970           
=======================================
  Hits        13162    13162           
  Misses       3808     3808

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9d4a019...9d4a019. Read the comment docs.

sshleifer · 2020-03-12T05:28:20Z

Nice! @yjernite might be interested!

sshleifer

Thanks for starting this! I left a couple nitpicks, but looks reasonable to me. Were you planning on running finetuning for longer and posting results?

examples/summarization/bart/run_bart_sum.py

examples/summarization/bart/utlis.py

andr-ec · 2020-03-12T20:52:55Z

I made those requested changes. And yes I'm planning to run finetuning this weekend and share results. I only have access to a k80 so it'll take a while 🤷🏽‍♂️

srush · 2020-03-16T14:58:26Z

This looks awesome. Let's coordinate with #3290 as well to share whatever code is possible.

srush · 2020-03-16T15:12:23Z

@nateraw can you do a review of this PR as well?

nateraw

Once #3290 gets merged, you'll have to update a few things, so I marked some here so you can get ahead of the curve on that. Once it's merged I'll be able to give a little more specific advice. Great work 👍

examples/summarization/bart/run_bart_sum.py

srush · 2020-03-17T03:07:01Z

@acarrera94 I will try to get this working this week. If you are in the pytorch-lightning open slack we can also chat a bit more about the design.

LGTM pending other reviewers!

andr-ec · 2020-03-18T19:14:28Z

@nateraw I've made all of those changes and it looks like #3290 has been merged, anything else that needs to change? Thanks!

srush · 2020-03-18T20:52:50Z

It's blocked on me, I should be able to get to it tonight.

examples/summarization/bart/run_bart_sum.py

srush

Thanks! This code is nicely done. If you integrate with @nateraw 's work, I think it will eliminate about half the code. Also I would recommend just moving it to one file (utils will become very small.)

examples/summarization/bart/README.md

examples/summarization/bart/run_bart_sum.py

…unused args

…ing test, removed unused imports and functions

…unused args

…ing test, removed unused imports and functions

…unused args

examples/summarization/bart/run_bart_sum.py

srush · 2020-03-24T20:05:36Z

New code looks great. Excited to try it out!

examples/summarization/bart/utils.py

srush · 2020-03-25T01:03:37Z

Thanks for sticking with it @ACarrera I'm really impressed how concise this became. Next we can get some numbers.

sshleifer · 2020-03-29T19:24:29Z

@acarrera94 run_train.sh is using 19GB on my system.
Does your system use less?
I am also seeing no memory savings from adding --fp16.
Thanks!

andr-ec · 2020-03-29T20:15:04Z

@sshleifer I usually ran it using --max_seq_lengt=756. And that used less than 16gb of memory with a batch size of 4, so we might want to change that default. And I haven’t tried it using --fp16. That comes from BaseTransformer right?

sshleifer self-requested a review March 12, 2020 05:28

sshleifer previously requested changes Mar 12, 2020

View reviewed changes

andr-ec requested a review from sshleifer March 13, 2020 20:59

srush self-requested a review March 16, 2020 14:57

nateraw reviewed Mar 17, 2020

View reviewed changes

yjernite reviewed Mar 20, 2020

View reviewed changes

examples/summarization/bart/run_bart_sum.py Outdated Show resolved Hide resolved

srush suggested changes Mar 22, 2020

View reviewed changes

andr-ec added 14 commits March 24, 2020 13:51

added bart for finetuning

f5cb5e6

reformated with style and quality make files

498c101

added requested changes, added test step and saving results, removed …

ad9aef7

…unused args

fixed typo in utils filename

ecc57a8

fixed wrong logits name

39806de

small fixes for test/val step

acb5395

changes in preperation for PR

0ddc24c

small fixes

674ee77

inherting from basetransformer, updated generation to only happen dur…

d0cdb66

…ing test, removed unused imports and functions

added bart for finetuning

f3c12f1

reformated with style and quality make files

4b05222

added requested changes, added test step and saving results, removed …

4446696

…unused args

small fixes for test/val step

242915c

changes in preperation for PR

3c87ea8

andr-ec added 15 commits March 24, 2020 13:51

added bart for finetuning

0fbd050

reformated with style and quality make files

b809d77

added requested changes, added test step and saving results, removed …

73e1cc3

…unused args

fixed typo in utils filename

c30c92f

small fixes for test/val step

5f3bf80

changes in preperation for PR

4a0ac2b

small fixes

359bfcd

inherting from basetransformer, updated generation to only happen dur…

42f95db

…ing test, removed unused imports and functions

added bart for finetuning

6a6d314

reformated with style and quality make files

8db83a3

added requested changes, added test step and saving results, removed …

14642c6

…unused args

fixed typo in utils filename

be95c6c

small fixes for test/val step

e1cee72

changes in preperation for PR

29cf6a1

small fixes

8e24219

andr-ec force-pushed the bart_summarization_finetuning branch from 2dfbb55 to 8e24219 Compare March 24, 2020 20:00

clarified pre-processing

c65b625

srush reviewed Mar 24, 2020

View reviewed changes

examples/summarization/bart/run_bart_sum.py Outdated Show resolved Hide resolved

srush approved these changes Mar 24, 2020

View reviewed changes

style formatting

db34772

srush reviewed Mar 24, 2020

View reviewed changes

examples/summarization/bart/utils.py Outdated Show resolved Hide resolved

andr-ec added 4 commits March 24, 2020 16:55

consolodated args, updated default args in training script

68fcf4a

fixed training args, renamed dataset

5b2a1a9

using defaults for result directory

5599d6f

added default dir in run train script

9d4a019

srush merged commit 3d76df3 into huggingface:master Mar 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add BART for summarization training with CNN/DM using pytorch-lightning #3236

[WIP] Add BART for summarization training with CNN/DM using pytorch-lightning #3236

andr-ec commented Mar 12, 2020 •

edited

codecov-io commented Mar 12, 2020 •

edited

sshleifer commented Mar 12, 2020

sshleifer left a comment •

edited

andr-ec commented Mar 12, 2020

srush commented Mar 16, 2020

srush commented Mar 16, 2020

nateraw left a comment

srush commented Mar 17, 2020

andr-ec commented Mar 18, 2020

srush commented Mar 18, 2020

srush left a comment

srush commented Mar 24, 2020

srush commented Mar 25, 2020

sshleifer commented Mar 29, 2020

andr-ec commented Mar 29, 2020 •

edited

[WIP] Add BART for summarization training with CNN/DM using pytorch-lightning #3236

[WIP] Add BART for summarization training with CNN/DM using pytorch-lightning #3236

Conversation

andr-ec commented Mar 12, 2020 • edited

TODO

codecov-io commented Mar 12, 2020 • edited

Codecov Report

sshleifer commented Mar 12, 2020

sshleifer left a comment • edited

Choose a reason for hiding this comment

andr-ec commented Mar 12, 2020

srush commented Mar 16, 2020

srush commented Mar 16, 2020

nateraw left a comment

Choose a reason for hiding this comment

srush commented Mar 17, 2020

andr-ec commented Mar 18, 2020

srush commented Mar 18, 2020

srush left a comment

Choose a reason for hiding this comment

srush commented Mar 24, 2020

srush commented Mar 25, 2020

sshleifer commented Mar 29, 2020

andr-ec commented Mar 29, 2020 • edited

andr-ec commented Mar 12, 2020 •

edited

codecov-io commented Mar 12, 2020 •

edited

sshleifer left a comment •

edited

andr-ec commented Mar 29, 2020 •

edited