retrained bert_tiny_uncased_en_sst2_training.ipynb #771

susnato · 2023-02-23T07:33:19Z

What does this PR do?

As discussed in this issue, this PR updates the previous training script for finetuning BERT on SST2.

The colab link is this : https://colab.research.google.com/drive/1afTO0ahF3vZrJtkVSGXwBLV2OGeDfomL?usp=sharing

Valdiation Scores achieved :

Epoch 1/2
4210/4210 [==============================] - 828s 191ms/step - loss: 0.3782 - sparse_categorical_accuracy: 0.8299 - val_loss: 0.4344 - val_sparse_categorical_accuracy: 0.8165
Epoch 2/2
4210/4210 [==============================] - 783s 186ms/step - loss: 0.2409 - sparse_categorical_accuracy: 0.9039 - val_loss: 0.4626 - val_sparse_categorical_accuracy: 0.8222

@chenmoneygithub

google-cla · 2023-02-23T07:33:23Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

susnato · 2023-02-23T07:38:44Z

I signed the form so cla/google is green now.

susnato · 2023-02-26T02:18:35Z

Hi @chenmoneygithub I finetuned BertClassifier again on SST2 as you said. Please check it.

chenmoneygithub · 2023-02-28T03:30:04Z

@susnato Thanks a lot! Looks beautiful overall!

Another thing you may want to do is to use a decayed learning rate:

lr = tf.keras.optimizers.schedules.PolynomialDecay(
    5e-5,
    decay_steps={total_training_steps},
    end_learning_rate=0.0,
)

Please let me know how it works, thx again!

mattdangerw · 2023-02-28T22:43:27Z

Just a drive by comment, but we should replace, not copy, the old colab with this PR. We don't want to keep spawning versions of the colab each time we update something.

susnato · 2023-03-01T15:23:11Z

Hi @chenmoneygithub thanks for the reply! I ran as you instructed, it gave -

Epoch 1/2
4210/4210 [==============================] - 871s 201ms/step - loss: 0.4560 - sparse_categorical_accuracy: 0.7970 - val_loss: 0.5304 - val_sparse_categorical_accuracy: 0.7534
Epoch 2/2
4210/4210 [==============================] - 818s 194ms/step - loss: 0.3556 - sparse_categorical_accuracy: 0.8541 - val_loss: 0.5900 - val_sparse_categorical_accuracy: 0.7385

Since it's working relatively worse should I stick with constant lr?

chenmoneygithub · 2023-03-06T03:03:45Z

@susnato Hi! sorry for the late reply, I was on vacation the last week. This is a bit odd to me, in my experiments the decayed lr generally works better (but I tried it with BERT base model). If this is stable on your side, it sounds good to stay with the constant lr.

chenmoneygithub · 2023-03-06T03:06:19Z

tools/checkpoint_training/bert_tiny_uncased_en_sst2_training.ipynb

- "nbformat_minor": 0
-}
+  ]
+}


nit: newline at the end

chenmoneygithub · 2023-03-06T03:06:48Z

tools/checkpoint_training/bert_tiny_uncased_en_sst2_training.ipynb

-      "visibility": null,
-      "width": "20px"
-     }
+      "cell_type": "code",


nit: delete the empty cell

susnato · 2023-03-06T04:55:47Z

Hi @chenmoneygithub thanks a lot for your comments! I made the changes you requested, please check them.

chenmoneygithub · 2023-03-07T04:37:41Z

@susnato Oh one more thing - could you run ./shell/format.sh in your branch? the style check is broken.

susnato · 2023-03-07T05:30:05Z

@chenmoneygithub Done!

susnato · 2023-03-08T07:33:22Z

Hi @chenmoneygithub the Check the code format is passed! but there seems to be an error with keras-nlp-accelerator-testing is this something related to my code?

chenmoneygithub · 2023-03-10T01:06:52Z

@susnato That one is okay, don't worry.

susnato changed the title ~~bert_tiny_uncased_en_sst2 added~~ retrained bert_tiny_uncased_en_sst2_training.ipynb Feb 23, 2023

susnato added 2 commits March 5, 2023 17:05

bert_tiny_uncased_en_sst2 added

9e912e7

replaced the file

ec8a2f7

chenmoneygithub suggested changes Mar 6, 2023

View reviewed changes

nits

1cfd306

susnato force-pushed the bert_tiny_uncased_en_sst2 branch from 6c5136d to 1cfd306 Compare March 6, 2023 04:52

chenmoneygithub approved these changes Mar 6, 2023

View reviewed changes

notebook check fixed

11b2cd7

Small change

bceae22

susnato force-pushed the bert_tiny_uncased_en_sst2 branch from 03866b4 to bceae22 Compare March 7, 2023 05:40

chenmoneygithub merged commit 54e7b16 into keras-team:master Mar 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

retrained bert_tiny_uncased_en_sst2_training.ipynb #771

retrained bert_tiny_uncased_en_sst2_training.ipynb #771

Uh oh!

susnato commented Feb 23, 2023

Uh oh!

google-cla bot commented Feb 23, 2023

Uh oh!

susnato commented Feb 23, 2023 •

edited

Loading

Uh oh!

susnato commented Feb 26, 2023

Uh oh!

chenmoneygithub commented Feb 28, 2023

Uh oh!

mattdangerw commented Feb 28, 2023

Uh oh!

susnato commented Mar 1, 2023 •

edited

Loading

Uh oh!

chenmoneygithub commented Mar 6, 2023

Uh oh!

chenmoneygithub Mar 6, 2023

Uh oh!

chenmoneygithub Mar 6, 2023

Uh oh!

susnato commented Mar 6, 2023

Uh oh!

chenmoneygithub commented Mar 7, 2023

Uh oh!

susnato commented Mar 7, 2023

Uh oh!

susnato commented Mar 8, 2023

Uh oh!

chenmoneygithub commented Mar 10, 2023

Uh oh!

Uh oh!

retrained bert_tiny_uncased_en_sst2_training.ipynb #771

retrained bert_tiny_uncased_en_sst2_training.ipynb #771

Uh oh!

Conversation

susnato commented Feb 23, 2023

What does this PR do?

Uh oh!

google-cla bot commented Feb 23, 2023

Uh oh!

susnato commented Feb 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

susnato commented Feb 26, 2023

Uh oh!

chenmoneygithub commented Feb 28, 2023

Uh oh!

mattdangerw commented Feb 28, 2023

Uh oh!

susnato commented Mar 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chenmoneygithub commented Mar 6, 2023

Uh oh!

chenmoneygithub Mar 6, 2023

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub Mar 6, 2023

Choose a reason for hiding this comment

Uh oh!

susnato commented Mar 6, 2023

Uh oh!

chenmoneygithub commented Mar 7, 2023

Uh oh!

susnato commented Mar 7, 2023

Uh oh!

susnato commented Mar 8, 2023

Uh oh!

chenmoneygithub commented Mar 10, 2023

Uh oh!

Uh oh!

susnato commented Feb 23, 2023 •

edited

Loading

susnato commented Mar 1, 2023 •

edited

Loading