[Doc] add more MBart and other doc #6490

patil-suraj · 2020-08-14T16:51:15Z

This PR

adds example for MBart
adds MBart in pre_trained models list and readme (Pegasus was missing from readme, so also added that).

patil-suraj · 2020-08-14T16:53:42Z

@sgugger do you think it would be a good idea to add more fine-tuning info for MBart, since it requires input processed in a different way than other models as it is multilingual model ?

sgugger

Thanks for your PR, it looks good to me. If there is any specific behavior for MBart needed in the preprocessing, it should be documented, yes.

sgugger · 2020-08-17T12:03:22Z

docs/source/index.rst

@@ -126,7 +126,7 @@ conversion utilities for the following models:
    Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, and Wen-tau Yih.
 23. `Pegasus <https://github.com/google-research/pegasus>`_ (from Google) released with the paper `PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
    <https://arxiv.org/abs/1912.08777>`_ by Jingqing Zhang, Yao Zhao, Mohammad Saleh and Peter J. Liu.
-24. `MBart <https://github.com/pytorch/fairseq/tree/master/examples/mbart>`_ (from Facebook) released with the paper  `Multilingual Denoising Pre-training for Neural Machine Translation <https://arxiv.org/abs/2001.08210>`_ by Yinhan Liu, Jiatao Gu, Naman Goyal, Xian Li, Sergey Edunov
+24. `MBart <https://github.com/pytorch/fairseq/tree/master/examples/mbart>`_ (from Facebook) released with the paper  `Multilingual Denoising Pre-training for Neural Machine Translation <https://arxiv.org/abs/2001.08210>`_ by Yinhan Liu, Jiatao Gu, Naman Goyal, Xian Li, Sergey Edunov,


It looks like the two lists got out of date with DPR (it should be in 22 in that list). Would you mind fixing this in your PR? Can do in another one if it's a problem.

no worries, will fix it in this PR.

patil-suraj · 2020-08-17T16:09:06Z

@sshleifer ,@sgugger added DPR in readme.

codecov · 2020-08-17T16:30:05Z

Codecov Report

Merging #6490 into master will decrease coverage by 0.46%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #6490      +/-   ##
==========================================
- Coverage   80.38%   79.91%   -0.47%     
==========================================
  Files         156      156              
  Lines       28058    28058              
==========================================
- Hits        22554    22423     -131     
- Misses       5504     5635     +131

Impacted Files	Coverage Δ
src/transformers/modeling_mbart.py	`100.00% <ø> (ø)`
src/transformers/modeling_tf_openai.py	`22.58% <0.00%> (-72.26%)`	⬇️
src/transformers/modeling_tf_flaubert.py	`24.53% <0.00%> (-63.81%)`	⬇️
src/transformers/tokenization_roberta.py	`76.71% <0.00%> (-21.92%)`	⬇️
src/transformers/tokenization_utils_base.py	`86.58% <0.00%> (-7.19%)`	⬇️
src/transformers/tokenization_transfo_xl.py	`38.73% <0.00%> (-3.76%)`	⬇️
src/transformers/tokenization_auto.py	`95.55% <0.00%> (-2.23%)`	⬇️
src/transformers/tokenization_utils_fast.py	`92.14% <0.00%> (-2.15%)`	⬇️
src/transformers/tokenization_openai.py	`82.57% <0.00%> (-1.52%)`	⬇️
src/transformers/generation_tf_utils.py	`84.96% <0.00%> (-1.51%)`	⬇️
... and 9 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 895ed8f...e1c522b. Read the comment docs.

sgugger · 2020-08-17T16:30:23Z

Great! Thanks for the PR.

* add mbart example * add Pegasus and MBart in readme * typo * add MBart in Pretrained models * add pre-proc doc * add DPR in readme * fix indent * doc fix

This reverts commit fff84fa.

patil-suraj added 4 commits August 14, 2020 22:16

add mbart example

f3cb9a9

add Pegasus and MBart in readme

44a6106

typo

f416058

add MBart in Pretrained models

000fc27

sgugger reviewed Aug 17, 2020

View reviewed changes

sshleifer approved these changes Aug 17, 2020

View reviewed changes

patil-suraj added 2 commits August 17, 2020 21:17

add pre-proc doc

517f263

add DPR in readme

f01652c

patil-suraj changed the title ~~[Doc] add more MBart doc~~ [Doc] add more MBart and other doc Aug 17, 2020

patil-suraj added 2 commits August 17, 2020 21:42

fix indent

ec11e48

doc fix

e1c522b

sgugger merged commit c9564f5 into huggingface:master Aug 17, 2020

fabiocapsouza added a commit to fabiocapsouza/transformers that referenced this pull request Nov 15, 2020

Revert "[Doc] add more MBart and other doc (huggingface#6490)"

a9af696

This reverts commit fff84fa.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc] add more MBart and other doc #6490

[Doc] add more MBart and other doc #6490

patil-suraj commented Aug 14, 2020

patil-suraj commented Aug 14, 2020

sgugger left a comment

sgugger Aug 17, 2020

patil-suraj Aug 17, 2020

sgugger Aug 17, 2020

patil-suraj commented Aug 17, 2020

codecov bot commented Aug 17, 2020 •

edited

Loading

sgugger commented Aug 17, 2020

[Doc] add more MBart and other doc #6490

[Doc] add more MBart and other doc #6490

Conversation

patil-suraj commented Aug 14, 2020

patil-suraj commented Aug 14, 2020

sgugger left a comment

Choose a reason for hiding this comment

sgugger Aug 17, 2020

Choose a reason for hiding this comment

patil-suraj Aug 17, 2020

Choose a reason for hiding this comment

sgugger Aug 17, 2020

Choose a reason for hiding this comment

patil-suraj commented Aug 17, 2020

codecov bot commented Aug 17, 2020 • edited Loading

Codecov Report

sgugger commented Aug 17, 2020

codecov bot commented Aug 17, 2020 •

edited

Loading