Extend BERT-based classification with customized layers #4553

Golovneva · 2022-05-17T20:16:58Z

Patch description
Added functionality to specify custom decoder layers for BERT-based classification. Code is a modification of existing in external ParlAI functions.

Testing steps

parlai train_model -m bert_classifier -t snli --classes 'entailment' 'contradiction' 'neutral' -mf /tmp/BERT_snli -bs 20 --classifier-layers linear,64 linear,3 relu

...
13:11:19 | Current ParlAI commit: 844a027ec81d543477d135a87eb5274ef4c013bd
13:11:19 | Current internal commit: 27aa6546aaec9e2e06069faabf2bb34aec1ba9f7
13:11:19 | Current fb commit: a69320df72c2a0c76873574e941eff3dc380fc4b
13:11:19 | creating task(s): snli
loading: /private/home/olggol/ParlAI/data/SNLI/snli_1.0/snli_1.0_train.jsonl
13:11:25 | training...
13:11:32 | time:7s total_exs:1000 total_steps:50 epochs:0.00
    accuracy   bleu-4  class_contradiction_f1  class_contradiction_prec  class_contradiction_recall  class_entailment_f1  class_entailment_prec  class_entailment_recall  class_neutral_f1  class_neutral_prec  \
       .3310 3.31e-10                  .05556                     .2703                      .03096                .4871                  .3311                    .9207             .0950               .3725
    class_neutral_recall  clen  clip  ctpb  ctps  ctrunc  ctrunclen  exps  exs    f1  gnorm  gpu_mem  llen  loss  lr  ltpb  ltps  ltrunc  ltrunclen  total_train_updates   tpb  tps  ups  weighted_f1
                  .05444 27.29 .1400 565.8  4111       0          0 145.3 1000 .3310  .2950   .09049 3.656 1.103   1 73.12 531.3       0          0                   50 638.9 4642 7.27        .2109
...

Other information

klshuster

I think this all looks great! Would it be possible to add a short test?

klshuster · 2022-05-18T20:52:09Z

parlai/agents/bert_classifier/bert_classifier.py

@@ -90,20 +99,6 @@ def add_cmdline_args(
        """
        super().add_cmdline_args(parser, partial_opt=partial_opt)
        parser = parser.add_argument_group("BERT Classifier Arguments")
-        parser.add_argument(


was this option just never used?

Right. I actually also found it in BertWrapper's add_common_args function, also never used, so I'll remove it from there

klshuster · 2022-05-18T20:53:09Z

parlai/agents/bert_classifier/bert_classifier.py

+
+        if ind < len(dimensions):
+            raise Exception(
+                "Output layer's dimension does not match number of classes. Found {dimensions[ind][1]}, expected {output_dimension}"


nit: think you're missing f"" string here

klshuster · 2022-05-18T20:53:13Z

parlai/agents/bert_classifier/bert_classifier.py

+                "Output layer's dimension does not match number of classes. Found {dimensions[ind][1]}, expected {output_dimension}"
+            )
+        raise Exception(
+            "Output layer's dimension does not match number of classes. Found {prev_dimension}, expected {output_dimension}"


klshuster · 2022-05-18T20:56:53Z

parlai/agents/bert_ranker/helpers.py

-        aggregation="first",
+        bert_model: BertModel,
+        output_dim: int = -1,
+        classifier_layer: torch.nn.Module = None,


nit: could you please move this arg to be the last one? so that prior calls to this __init__ don't fail

fixed issues and added tests

klshuster

thank you for adding tests!

edit: approving assuming long_gpu_tests pass

Golovneva · 2022-05-23T20:47:28Z

I have changed the torch version in CircleCI config to make it work since CR was approved, so re-requesting approval for this change

* Extend BERT-based classification with customized layers * fix bugs and add tests * increase lr to improve training stability * upgrading torch version * adjusting loss value

Extend BERT-based classification with customized layers

d99616e

facebook-github-bot added the CLA Signed label May 17, 2022

moyapchen requested review from stephenroller and klshuster May 18, 2022 14:35

klshuster reviewed May 18, 2022

View reviewed changes

fix bugs and add tests

ef1c2c0

klshuster approved these changes May 19, 2022

View reviewed changes

Golovneva added 3 commits May 19, 2022 13:56

increase lr to improve training stability

4fa0d08

upgrading torch version

6f96100

adjusting loss value

57d4aef

Golovneva requested a review from klshuster May 23, 2022 20:46

Golovneva merged commit 1628d8c into main May 24, 2022

Golovneva deleted the olggol/bert-classifier branch May 24, 2022 17:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend BERT-based classification with customized layers #4553

Extend BERT-based classification with customized layers #4553

Golovneva commented May 17, 2022

klshuster left a comment

klshuster May 18, 2022

Golovneva May 18, 2022

klshuster May 18, 2022

klshuster May 18, 2022

klshuster May 18, 2022

Golovneva May 19, 2022

klshuster left a comment •

edited

Golovneva commented May 23, 2022

Extend BERT-based classification with customized layers #4553

Extend BERT-based classification with customized layers #4553

Conversation

Golovneva commented May 17, 2022

klshuster left a comment

Choose a reason for hiding this comment

klshuster May 18, 2022

Choose a reason for hiding this comment

Golovneva May 18, 2022

Choose a reason for hiding this comment

klshuster May 18, 2022

Choose a reason for hiding this comment

klshuster May 18, 2022

Choose a reason for hiding this comment

klshuster May 18, 2022

Choose a reason for hiding this comment

Golovneva May 19, 2022

Choose a reason for hiding this comment

klshuster left a comment • edited

Choose a reason for hiding this comment

Golovneva commented May 23, 2022

klshuster left a comment •

edited