-
Notifications
You must be signed in to change notification settings - Fork 25.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[s2s] add BartTranslationDistiller for distilling mBART #6363
Conversation
Codecov Report
@@ Coverage Diff @@
## master #6363 +/- ##
==========================================
- Coverage 79.93% 78.37% -1.56%
==========================================
Files 153 148 -5
Lines 27888 27196 -692
==========================================
- Hits 22293 21316 -977
- Misses 5595 5880 +285
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
…ingface#6363)" This reverts commit 569234e.
New class
BartTranslationDistiller
does the same distillation method asSummarizationDistiller
, but computes BLEU scores instead of ROUGE scores. It also accepts--src_lang
and--tgt_lang
arguments from the command line.There is one strong checkpoint already posted at
sshleifer/distillmbart-12-6/
. I will post more in the coming days.