Peft sentiment #1335

AngledLuffa · 2024-01-25T23:58:07Z

Add a PEFT wrapper for the Sentiment training.

Works quite well on English, actually, even without splitting the optimizer or implementing any form of scheduling.
With no finetuning, adding electra-large to the 3 class English dataset (SST plus a few other pieces) gets 70 Macro F1.
The base finetuning gets between 74-75 macro F1 on sstplus, but frequently fails to successfully train, getting somewhere around 60 F1
Training with PEFT gets in the 74-75 F1 range each time, with no failures observed so far.

Adds a chunk of test to the sentiment training which starts the Pipeline with a peft-trained model

Also included is adding a uses-charlm flag to the config, so that inadvertently passing a charlm (such as via Pipeline) to the sentiment model doesn't blow up if it was trained w/o a charlm

…arlm. This is hard to diagnose for the models which were not previously saved with this information

Jemoka

Just some clarification questions. See comments below; thanks in advance!

Jemoka · 2024-01-26T00:02:14Z

stanza/models/classifiers/cnn_classifier.py

+                                     target_modules=["query", "value", "output.dense", "intermediate.dense"], # self.config.lora_targets,
+                                     lora_alpha=128, #self.config.lora_alpha,
+                                     lora_dropout=0.1, #self.config.lora_dropout,
+                                     modules_to_save=["pooler"], # self.config.lora_fully_tune,


this doesn't exist for all languages' BERTs; and the NER results I reported didn't use this. how much improvement /size tradeoff does this confer?

Good question. I can run that experiment.

the experiments were run with electra, which doesn't have a pooler. listing it as fully trained doesn't make a difference there. can rerun the trial with roberta. i remember finetuning the pooler to be important with coref, although i don't have the results in front of me

Averaged over 4 runs with roberta-large, I got 0.7391 F1 w/ the pooler and 0.7422 w/o. Fully training the pooler increased the size from 163M to 167M. My conclusion is that we don't train the pooler for sentiment. I can always try pefting it instead of fully training it.

never mind on that, can't peft a pooler

stanza/models/classifiers/cnn_classifier.py

…ft-based test for sentiment

Works quite well on English, actually, even without splitting the optimizer or implementing any form of scheduling. With no finetuning, adding electra-large to the 3 class English dataset (SST plus a few other pieces) gets 70 Macro F1. The base finetuning gets between 74-75 macro F1 on sstplus, but frequently fails to successfully train, getting somewhere around 60 F1 Training with PEFT gets in the 74-75 F1 range each time, with no failures observed so far. Adds a chunk of test to the sentiment training which starts the Pipeline with a peft-trained model

Ran an experiment with 4x models where finetuning the pooler for roberta-large on sstplus got 0.7391 average F1 whereas not finetuning got 0.7422 average. Considering the model size difference (163M -> 167M) it seems not worthwhile to finetune this layer.

Only load charlms when loading a model if the original model had a ch…

7ae3bd0

…arlm. This is hard to diagnose for the models which were not previously saved with this information

Jemoka reviewed Jan 26, 2024

View reviewed changes

AngledLuffa added 2 commits January 25, 2024 20:35

Add transformers to the test setup with the intention of running a pe…

35cf8cc

…ft-based test for sentiment

AngledLuffa force-pushed the peft_sentiment branch 2 times, most recently from 8e68ffa to 9240f37 Compare January 26, 2024 09:11

AngledLuffa added 2 commits January 26, 2024 01:12

Refactor some common LoRA arguments - will likely be used elsewhere

9236a0c

Add default values which could generalize for different transformers

ffe6e1d

AngledLuffa force-pushed the peft_sentiment branch from 9240f37 to ffe6e1d Compare January 26, 2024 09:12

AngledLuffa added 3 commits January 26, 2024 13:41

Add dicts and args for the target_modules and modules_to_save as well

7cadb48

Update test to reflect changed defaults

7a927da

Jemoka approved these changes Jan 27, 2024

View reviewed changes

AngledLuffa merged commit 09da9ce into dev Jan 27, 2024
1 check passed

AngledLuffa deleted the peft_sentiment branch January 27, 2024 03:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Peft sentiment #1335

Peft sentiment #1335

AngledLuffa commented Jan 25, 2024

Jemoka left a comment •

edited

Loading

Jemoka Jan 26, 2024

AngledLuffa Jan 26, 2024

AngledLuffa Jan 26, 2024

AngledLuffa Jan 26, 2024

AngledLuffa Jan 26, 2024

Peft sentiment #1335

Peft sentiment #1335

Conversation

AngledLuffa commented Jan 25, 2024

Jemoka left a comment • edited Loading

Choose a reason for hiding this comment

Jemoka Jan 26, 2024

Choose a reason for hiding this comment

AngledLuffa Jan 26, 2024

Choose a reason for hiding this comment

AngledLuffa Jan 26, 2024

Choose a reason for hiding this comment

AngledLuffa Jan 26, 2024

Choose a reason for hiding this comment

AngledLuffa Jan 26, 2024

Choose a reason for hiding this comment

Jemoka left a comment •

edited

Loading