Add preset for finetuning GPT2 on CNN news #807

chenmoneygithub · 2023-03-07T06:37:54Z

With the finetuning, the generated text stops properly at end token, and the writing is better.

To play with the preset, install this branch:

!pip install -q -U git+https://github.com/chenmoneygithub/keras-nlp.git@add-gpt2-news

then use the code below:

import keras_nlp

gpt2_lm = keras_nlp.models.GPT2CausalLM.from_preset("gpt2_base_en_cnn_dailymail")
gpt2_lm.generate(
    ["that's weird", "that's even weirder"],
    sampler="top_k",
    max_length=100,
)

mattdangerw

Nice this seems useful to have. Just a few comments.

mattdangerw · 2023-03-08T20:23:48Z

keras_nlp/models/gpt2/gpt2_presets.py

        "merges_url": "https://storage.googleapis.com/keras-nlp/models/gpt2_extra_large_en/v1/merges.txt",
        "merges_hash": "75a37753dd7a28a2c5df80c28bf06e4e",
    },
+    "gpt2_base_en_news": {


I think the most similar checkpoint we have here is the sst2 dataset for bert. So maybe we should stick to the more precise naming and call this, gpt2_base_en_cnn_dailymail?

keras_nlp/models/gpt2/gpt2_presets.py

mattdangerw

Approving! Let's update our file paths before we merge.

mattdangerw · 2023-03-09T02:20:55Z

keras_nlp/models/gpt2/gpt2_presets.py

+            "max_sequence_length": 1024,
+        },
+        "preprocessor_config": {},
+        "weights_url": "https://storage.googleapis.com/keras-nlp/models/gpt2_base_en_news/model.h5",


Update URLs to the new path before we merge this. Also remember to add a /v1/ to the path as above, so we can update our checkpoints in the future.

Add preset for finetuning GPT2 on CNN news

840cf0c

chenmoneygithub requested a review from mattdangerw March 7, 2023 06:38

mattdangerw approved these changes Mar 8, 2023

View reviewed changes

style fix

b19f56c

mattdangerw approved these changes Mar 9, 2023

View reviewed changes

path change

17701be

chenmoneygithub merged commit a513f03 into keras-team:master Mar 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add preset for finetuning GPT2 on CNN news #807

Add preset for finetuning GPT2 on CNN news #807

Uh oh!

chenmoneygithub commented Mar 7, 2023 •

edited

Loading

Uh oh!

mattdangerw left a comment

Uh oh!

mattdangerw Mar 8, 2023

Uh oh!

chenmoneygithub Mar 9, 2023

Uh oh!

Uh oh!

Uh oh!

mattdangerw left a comment

Uh oh!

mattdangerw Mar 9, 2023

Uh oh!

chenmoneygithub Mar 9, 2023

Uh oh!

Uh oh!

Add preset for finetuning GPT2 on CNN news #807

Add preset for finetuning GPT2 on CNN news #807

Uh oh!

Conversation

chenmoneygithub commented Mar 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

mattdangerw Mar 8, 2023

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

mattdangerw Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub Mar 9, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chenmoneygithub commented Mar 7, 2023 •

edited

Loading