Added example for fine tuning BERT on Text Extraction task (SQuAD) #46

apoorvnandan · 2020-05-22T19:57:16Z

Uses pretrained bert-base-uncased from HuggingFace and their tokenizers.
Gets an exact match score of 79.4 after 2 epoch on TPU. (Paper shows 80.8, most likely due to a bunch of extra post-processing steps that I saw in the original implementation while converting the predicted token indexes back to the answer text)
We will have to limit the data before training begins because this will take hours on CPU.

updating repo

Update fork

fchollet

Thank you for the PR. It looks great! Very useful example.

I fixed various minor nits. Other than that my comments deal with documentation improvements.

fchollet · 2020-05-23T00:29:23Z

examples/nlp/text_extraction_with_bert.py

@@ -0,0 +1,318 @@
+"""
+Title: BERT for Text Extraction


Let's go with "BERT (from HuggingFace Transformers) for Text Extraction" -- in the future we will have BERT examples that don't use HuggingFace.

Oh okay. By the way I am very interested in writing examples of BERT and GPT with just tf.keras. Maybe just the pertaining part for example. We can discuss over over a separate issue if you think it would be a good addition.

fchollet · 2020-05-23T00:29:41Z

examples/nlp/text_extraction_with_bert.py

+Author: [Apoorv Nandan](https://twitter.com/NandanApoorv)
+Date created: 2020/05/23
+Last modified: 2020/05/23
+Description: Fine tune pretrained BERT from HuggingFace on SQuAD.


"HuggingFace Transformers"

fchollet · 2020-05-23T19:31:11Z

examples/nlp/text_extraction_with_bert.py

+    return model
+
+
+use_tpu = True


Please add a paragraph of text explaining that this example should preferably be run on the Colab TPU runtime.

fchollet · 2020-05-23T19:32:00Z

examples/nlp/text_extraction_with_bert.py

+
+class ExactMatch(keras.callbacks.Callback):
+    def on_epoch_end(self, epoch, logs=None):
+        pred_start, pred_end = self.model.predict(x_eval)


Rather than using x_eval from the outer scope, pass it to __init__ and set it as callback attribute

In general: data should be passed as argument, functions / classes can be fetched from the outer scope.

fchollet · 2020-05-23T19:32:15Z

examples/nlp/text_extraction_with_bert.py

+    return text
+
+
+class ExactMatch(keras.callbacks.Callback):


Please add a docstring with a quick explanation of how the callback works

fchollet · 2020-05-23T19:35:01Z

examples/nlp/text_extraction_with_bert.py

+print(f"{len(eval_squad_examples)} evaluation points created.")
+
+"""
+Create the Question Answering Model using BERT and Functional API


Question-Answering

fchollet · 2020-05-23T19:35:15Z

examples/nlp/text_extraction_with_bert.py

+
+use_tpu = True
+if use_tpu:
+    # create distribution strategy


Nit: please capitalize comments

apoorvnandan · 2020-05-23T19:45:09Z

Thanks for the comments. Making the required changes.

fchollet

LGTM, thank you. Please add the general files (or let me know if you want me to generate them instead).

apoorvnandan · 2020-05-23T22:22:04Z

I'll be running the add_example command on my laptop. So, I'll change

use_tpu = False

and add some lines like

# Remove these lines to train on the entire data
x_train  = [_[:10,:] for _ in x_train]
y_train  = [_[:10,:] for _ in y_train]
model.fit(x_train, y_train, ...)

Or is there a better way?

fchollet · 2020-05-23T22:59:33Z

How long does it take to run the example on a V100? (as an approximative estimate)

If less than ~20 min on a V100, I can run it on my side and generate the files

apoorvnandan · 2020-05-24T10:22:35Z

1 epoch took 1 hour 12 min on Colab GPU (K80 I think?).

Anyhow, I copied everything over to Colab, and ran python autogen.py add_example ... with TPU runtime. Generated files have been added.
I set epochs=1 for this.

fchollet · 2020-05-24T20:40:40Z

examples/nlp/md/text_extraction_with_bert.md

+model.fit(
+    x_train,
+    y_train,
+    epochs=1,


Please add a comment here that the recommend number of epochs is 3, not 1. You don't need to regenerate the files, you can juste edit the ipynb and md files directly to add the comment.

fchollet

LGTM, thank you! This is a very nice script, it will be valuable.

apoorvnandan and others added 13 commits May 12, 2020 16:21

Merge pull request #1 from keras-team/master

33c9856

updating repo

Merge remote-tracking branch 'upstream/master'

f900cd8

Added actor critic example

1ade1e2

Added comments

691cf94

Annotated code and folder rename

5b64c67

Black reformat

ba6b915

Style nits

3412051

Added visualizations

72b795c

black reformat

af4a14b

Added ipynb and md files

9303eb0

Delete tutobooks-checkpoint.py

433775e

Added bert text extraction example

5f0ab05

Merge pull request #2 from keras-team/master

0e2325b

Update fork

apoorvnandan mentioned this pull request May 22, 2020

BERT examples with/without HuggingFace? #37

Closed

Style nits

f9b3bb1

fchollet reviewed May 23, 2020

View reviewed changes

Updated comments and callback

002eac7

fchollet approved these changes May 23, 2020

View reviewed changes

Added ipynb and md files

0956ba6

nits

c682252

fchollet reviewed May 24, 2020

View reviewed changes

added comment on number of epochs

869f86a

fchollet approved these changes May 24, 2020

View reviewed changes

fchollet merged commit 3eedad1 into keras-team:master May 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added example for fine tuning BERT on Text Extraction task (SQuAD) #46

Added example for fine tuning BERT on Text Extraction task (SQuAD) #46

apoorvnandan commented May 22, 2020

fchollet left a comment

fchollet May 23, 2020

apoorvnandan May 23, 2020

fchollet May 23, 2020

fchollet May 23, 2020

fchollet May 23, 2020

fchollet May 23, 2020

fchollet May 23, 2020

fchollet May 23, 2020

apoorvnandan commented May 23, 2020

fchollet left a comment

apoorvnandan commented May 23, 2020 •

edited

Loading

fchollet commented May 23, 2020 •

edited

Loading

apoorvnandan commented May 24, 2020

fchollet May 24, 2020

fchollet left a comment

Added example for fine tuning BERT on Text Extraction task (SQuAD) #46

Added example for fine tuning BERT on Text Extraction task (SQuAD) #46

Conversation

apoorvnandan commented May 22, 2020

fchollet left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

apoorvnandan commented May 23, 2020

fchollet left a comment

Choose a reason for hiding this comment

apoorvnandan commented May 23, 2020 • edited Loading

fchollet commented May 23, 2020 • edited Loading

apoorvnandan commented May 24, 2020

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

apoorvnandan commented May 23, 2020 •

edited

Loading

fchollet commented May 23, 2020 •

edited

Loading