Better drop_last fix, updated results table (RES-2222) #521

benja-matic · 2021-05-27T20:02:37Z

Replaced the old drop_last fix with a more durable solution, which overrides the predict and evaluate methods of the trainer. This fix covers the cases where those methods are called internally in HF code, not just by us.

Updated the readme results table: bert100k, bert_1mi, and sparse_80%_kd_onecycle_lr_rigl. The bert_1mi results are calculated using an extra 10 runs on wnli task.

There are some new experiments I'm playing with, like the "simple but hard to beat" baseline in finetuning.py. That stuff might still move around. Wanted to get this PR going for the results and the new fix.

mvacaporale · 2021-05-28T00:25:19Z

Have you tried running with tiny_bert_linear_lr_range_test? It would be good to know this can fully run without any issues as the lr-range tests validate every epoch.

mvacaporale

Looks good. Thanks for catching the error with the first fix. Approved barring

some nit pick comments about grammar and punctuation,
and confirmation that this fix work with the lr-range test.

mvacaporale · 2021-05-28T00:31:58Z

projects/transformers/README.md

-| sparse_80%_kd_onecycle_lr_rigl |            75.3 |          72.83 |  38.29 | 79.72/80.88 | 87.31/82.81 |  87.65 | 89.19/85.25 |  54.3 |  88.89 | 80.74/80.59 |  53.12 | 8.482 | 2.138 |
+| bert_1mi |          80.13 |          77.17 |  45.81 | 84.27/84.63 | 88.26/83.82 |  91.21 | 90.54/87.20 | 65.34 |  91.86 | 87.41/87.43 |  53.52 | 5.013 | 1.612 |
+| bert_100k |          75.36 |          71.68 |  39.56 | 78.88/79.08 | 82.71/76.23 |  87.77 | 89.31/85.57 |  58.12 |  87.61 | 83.95/83.84 |  42.25 | 8.619 | 2.154 |
+| sparse_80%_kd_onecycle_lr_rigl |            75.3 |          72.57 |  36.49 | 79.23/79.66 | 86.55/81.86 |  88.23 | 89.39/85.64 |  54.51 |  90.6 | 81.31/81.24 |  50.7 | 8.482 | 2.138 |


These updated results are without the simple but hard to beat baseline, correct?

mvacaporale · 2021-05-28T00:39:06Z

projects/transformers/experiments/finetuning.py

@@ -129,6 +129,38 @@
    model_name_or_path="/mnt/efs/results/pretrained-models/transformers-local/bert_100k",  # noqa: E501
 )

+# the name 'simple' is in reference to the paper


This may be more of a nit pick style comment, but I believe in using proper grammar and punctuation in comments, especially longer ones. I believe it makes it easier to read. This one could be formatted as

# The name 'simple' is in reference to the paper "On the stability of finetuning BERT" # where they propose a "simple but hard to beat" approach # https://openreview.net/pdf?id=nzpLWnVAyah # # How to decided num_train_epochs: They say rte for 20 epochs is good ...

Notice how I try and use the full line length as well. There should be an extension in vscode to help automatically format comments for line length.

As well, could you please edit the comment for clarity. It's somewhat hard to understand the logic you're expressing.

mvacaporale · 2021-05-28T00:41:08Z

projects/transformers/run_utils.py

@@ -708,6 +676,22 @@ def init_model(model_args, config, tokenizer, finetuning=False):
    return model


+def toggle_drop_last_factory(method):


I believe this method is described as a wrapper than a factory. Would you agree?

I like this solution btw. Seems like a good way to modify the function without having to create a new subclass to Trainer.

mvacaporale · 2021-05-28T00:43:17Z

projects/transformers/run_utils.py

@@ -751,6 +735,13 @@ def init_trainer(

    trainer = trainer_class(**trainer_kwargs)

+    # issue: labels gettings set to -100 due to drop_last


Could you please also adjust the capitalization and punctuation here as well?

# Issue: labels get set to -100 due to drop_last # Fix: override the evaluate and predict functions # The previous fix covers any time we call either method. # This fix should cover every time HF calls it.

lucasosouza

Looks great

mvacaporale · 2021-05-28T20:39:34Z

projects/transformers/run_utils.py

+def toggle_drop_last_wrapper(method):
+    """
+    Return a function that turns drop_last off before it is called. Used for ensuring
+        trainer.args.dataloader_drop_last is False during evaluation steps. After


Thanks for the style fixes. One last thing: It seems you accidentally added an indent here in the comments.

benja-matic added 2 commits May 27, 2021 15:49

updated readme table, preferable fix for drop_last issue

0eb5061

removed commented code

022f28c

benja-matic requested review from lucasosouza and mvacaporale May 27, 2021 20:02

mvacaporale approved these changes May 28, 2021

View reviewed changes

lucasosouza approved these changes May 28, 2021

View reviewed changes

benja-matic added 2 commits May 28, 2021 11:37

clean up and tested with lr_range_test

4353b16

Merge branch 'master' into master

dd4bc2e

mvacaporale reviewed May 28, 2021

View reviewed changes

benja-matic added 9 commits June 1, 2021 14:27

working on evaluation metrics callback

d87b7e1

Fix doc str indent

2b231a3

Merge branch 'master' of github.com:benja-matic/nupic.research into HEAD

cad2913

Merge branch 'fix_indent'

f6ab8e3

Fixing commit checkout issue

7f57010

spelling

c31beaa

remove old comments

f0e2308

Remove accidental deletion

ae3eee4

removed import from other commit

765d02f

benja-matic merged commit e8c0c3b into numenta:master Jun 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better drop_last fix, updated results table (RES-2222) #521

Better drop_last fix, updated results table (RES-2222) #521

benja-matic commented May 27, 2021

mvacaporale commented May 28, 2021

mvacaporale left a comment

mvacaporale May 28, 2021

mvacaporale May 28, 2021

mvacaporale May 28, 2021

mvacaporale May 28, 2021

mvacaporale May 28, 2021

mvacaporale May 28, 2021

lucasosouza left a comment

mvacaporale May 28, 2021 •

edited

Loading

		@@ -708,6 +676,22 @@ def init_model(model_args, config, tokenizer, finetuning=False):
		return model


		def toggle_drop_last_factory(method):

		@@ -751,6 +735,13 @@ def init_trainer(

		trainer = trainer_class(**trainer_kwargs)

		# issue: labels gettings set to -100 due to drop_last

Better drop_last fix, updated results table (RES-2222) #521

Better drop_last fix, updated results table (RES-2222) #521

Conversation

benja-matic commented May 27, 2021

mvacaporale commented May 28, 2021

mvacaporale left a comment

Choose a reason for hiding this comment

mvacaporale May 28, 2021

Choose a reason for hiding this comment

mvacaporale May 28, 2021

Choose a reason for hiding this comment

mvacaporale May 28, 2021

Choose a reason for hiding this comment

mvacaporale May 28, 2021

Choose a reason for hiding this comment

mvacaporale May 28, 2021

Choose a reason for hiding this comment

mvacaporale May 28, 2021

Choose a reason for hiding this comment

lucasosouza left a comment

Choose a reason for hiding this comment

mvacaporale May 28, 2021 • edited Loading

Choose a reason for hiding this comment

mvacaporale May 28, 2021 •

edited

Loading