[s2s] add support for overriding config params #6149

stas00 · 2020-07-30T03:01:12Z

add support for overriding model params:

python finetune.py --encoder_layerdrop 0.1 --decoder_layerdrop 0.1 --dropout 0.1 --attention_dropout 0.1

as requested at #6018

README.md seems to be mostly the editor removing superfluous whitespace - not sure why github shows it - normally it doesn't. The only added doc section is https://github.com/stas00/transformers/blob/seq2seq-train_params-1/examples/seq2seq/README.md#finetuning-training-params

…p 0.1 --dropout 0.1 --attention_dropout 0.1

codecov · 2020-07-30T03:06:41Z

Codecov Report

Merging #6149 into master will increase coverage by 1.32%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #6149      +/-   ##
==========================================
+ Coverage   78.35%   79.68%   +1.32%     
==========================================
  Files         146      146              
  Lines       26403    26403              
==========================================
+ Hits        20689    21039     +350     
+ Misses       5714     5364     -350

Impacted Files	Coverage Δ
src/transformers/modeling_tf_flaubert.py	`24.22% <0.00%> (-63.98%)`	⬇️
src/transformers/tokenization_bart.py	`60.56% <0.00%> (-35.22%)`	⬇️
src/transformers/modeling_tf_gpt2.py	`65.42% <0.00%> (-29.91%)`	⬇️
src/transformers/generation_tf_utils.py	`85.71% <0.00%> (-0.51%)`	⬇️
src/transformers/file_utils.py	`82.20% <0.00%> (-0.29%)`	⬇️
src/transformers/modeling_tf_distilbert.py	`98.79% <0.00%> (+34.61%)`	⬆️
src/transformers/modeling_tf_mobilebert.py	`96.77% <0.00%> (+73.38%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 54f9fbe...5476a9e. Read the comment docs.

- optional - goes into model.config

sshleifer

Can you add test cases:

good case where model.config has the new value
case (T5) that hits your assert. its called dropout_rate in t5. We can expose that or leave it for future work.

I'm surprised you didn't need to change the CHEAP_ARGS constant in the tests.

sshleifer · 2020-07-30T03:16:22Z

The code looks perfect.

stas00 · 2020-07-30T03:28:59Z

I'm surprised you didn't need to change the CHEAP_ARGS constant in the tests.

because the new args are optional? Unless you mean something else.

...Working on the tests.

stas00 · 2020-07-30T04:02:12Z

Added tests as suggested.

sshleifer · 2020-07-30T04:42:15Z

good alias

sty () {
	make style
	flake8 examples templates tests src utils
}

this is a follow up to huggingface#6149 - there was no need to add newly added options to finetune.sh - reverted that change - added a hint to users how to get all the options (--help)

enable: python finetune.py --encoder_layerdrop 0.1 --decoder_layerdro…

09b7549

…p 0.1 --dropout 0.1 --attention_dropout 0.1

improve the help string

701d3f2

- optional - goes into model.config

stas00 mentioned this pull request Jul 30, 2020

seq2seq/finetune.py can take config train_params through command line #6018

Closed

style

c5bfc84

sshleifer reviewed Jul 30, 2020

View reviewed changes

stas00 changed the title ~~add support for overriding model params~~ [WIP] add support for overriding model params Jul 30, 2020

add tests

5e93fcc

merge 2 tests into 1 - refactor

37d07a9

sshleifer changed the title ~~[WIP] add support for overriding model params~~ [s2s] add support for overriding config params Jul 30, 2020

more style

5476a9e

sshleifer merged commit 3212b88 into huggingface:master Jul 30, 2020

stas00 deleted the seq2seq-train_params-1 branch July 30, 2020 06:24

This was referenced Aug 1, 2020

[s2s] clean up + doc #6184

Merged

[WIP] lightning_base: support --lr_scheduler with multiple possibilities #6232

Merged

stas00 mentioned this pull request Aug 9, 2020

[s2s] fix --gpus clarg collision #6358

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[s2s] add support for overriding config params #6149

[s2s] add support for overriding config params #6149

stas00 commented Jul 30, 2020 •

edited

Loading

codecov bot commented Jul 30, 2020 •

edited

Loading

sshleifer left a comment

sshleifer commented Jul 30, 2020

stas00 commented Jul 30, 2020

stas00 commented Jul 30, 2020 •

edited

Loading

sshleifer commented Jul 30, 2020 •

edited

Loading

[s2s] add support for overriding config params #6149

[s2s] add support for overriding config params #6149

Conversation

stas00 commented Jul 30, 2020 • edited Loading

codecov bot commented Jul 30, 2020 • edited Loading

Codecov Report

sshleifer left a comment

Choose a reason for hiding this comment

sshleifer commented Jul 30, 2020

stas00 commented Jul 30, 2020

stas00 commented Jul 30, 2020 • edited Loading

sshleifer commented Jul 30, 2020 • edited Loading

stas00 commented Jul 30, 2020 •

edited

Loading

codecov bot commented Jul 30, 2020 •

edited

Loading

stas00 commented Jul 30, 2020 •

edited

Loading

sshleifer commented Jul 30, 2020 •

edited

Loading