Port models to core #1119

mattdangerw · 2023-07-07T20:01:31Z

🚧 This is an experimental feature branch, more details soon. 🚧

I'm temporarily basing this on preprocessing branch so we can start review.

ianstenbit

Some minor questions for you, but looks good!

Way to go Matt 🎊 🥳 ❗

keras_nlp/models/bart/bart_seq_2_seq_lm.py

ianstenbit · 2023-07-07T20:20:01Z

keras_nlp/models/bart/bart_seq_2_seq_lm.py

-            self.backbone.get_layer("token_embedding").embeddings,
-            transpose_b=True,
-        )
+        logits = self.get_layer("reverse_embedding")(x)


Just curious, as I see this is done throughout -- what's the purpose of using self.get_layer instead of storing the layer as a member on the object directly e.g. self.reverse_embedding?

Because we are functional, they actually aren't self.layer or self.backbone.layer accessors.

ianstenbit · 2023-07-07T20:21:04Z

keras_nlp/models/bart/bart_tokenizer_test.py

+        )
+
    @pytest.mark.large  # Saving is slow, so mark these large.
+    @pytest.mark.tf_only


Is it because of the tokenizer that this is TF only? If so, can we potentially make the test work on other backends?

(Same throughout the PR)

This is because a string model inputs are only supported in tf. So a model that includes tokenization inside call really only makes sense in tf.

I am also open to just deleting these tests. I think saving for backbone models and tasks are important, for tokenizers saving directly like this less so.

We could potentially rewrite the test with preprocessor=False to make it generic

Oh this is actually a test for just the tokenizer. So preprocessor=False would not apply here.

keras_nlp/models/generative_task.py

keras_nlp/models/gpt2/gpt2_backbone.py

keras_nlp/models/bart/bart_seq_2_seq_lm.py

keras_nlp/utils/pipeline_model_test.py

fchollet

LGTM, thanks!

.github/workflows/actions.yml

jbischof

Thanks!

keras_nlp/models/bart/bart_seq_2_seq_lm.py

jbischof · 2023-07-08T00:01:20Z

keras_nlp/models/bart/bart_tokenizer_test.py

+        )
+
    @pytest.mark.large  # Saving is slow, so mark these large.
+    @pytest.mark.tf_only


We could potentially rewrite the test with preprocessor=False to make it generic

We want to avoid a bug in the case that looks like model.generate() mode.fit() model.generate() In this case we need to be careful to not pull in the cached variable state at generation compile time.

* Port models to core * Proper seed generation for jax * Don't test metrics yet (for a separate PR) * Add all model variables to the jax state mapping We want to avoid a bug in the case that looks like model.generate() mode.fit() model.generate() In this case we need to be careful to not pull in the cached variable state at generation compile time. * Address Ian's comments * Add TODO's for revers embedding * Run pytest on the entirety of keras-nlp * Misc cleanups * Mark docstring tests tf only * Last failing doctest

mattdangerw requested review from fchollet, ianstenbit and jbischof July 7, 2023 20:01

ianstenbit approved these changes Jul 7, 2023

View reviewed changes

mattdangerw force-pushed the models branch from b1a4822 to 402b5f8 Compare July 7, 2023 21:42

fchollet approved these changes Jul 7, 2023

View reviewed changes

mattdangerw changed the base branch from preprocessing to core July 7, 2023 23:02

mattdangerw force-pushed the models branch from 402b5f8 to 96b2068 Compare July 7, 2023 23:09

mattdangerw commented Jul 7, 2023

View reviewed changes

.github/workflows/actions.yml Outdated Show resolved Hide resolved

jbischof approved these changes Jul 8, 2023

View reviewed changes

mattdangerw added 9 commits July 7, 2023 18:03

Port models to core

469ea38

Proper seed generation for jax

811c92c

Don't test metrics yet (for a separate PR)

d6350f4

Add all model variables to the jax state mapping

795d57c

We want to avoid a bug in the case that looks like model.generate() mode.fit() model.generate() In this case we need to be careful to not pull in the cached variable state at generation compile time.

Address Ian's comments

5361860

Add TODO's for revers embedding

e633db0

Run pytest on the entirety of keras-nlp

6dbde6c

Misc cleanups

46d63ae

Mark docstring tests tf only

ad126cb

mattdangerw force-pushed the models branch from 668ac2d to ad126cb Compare July 8, 2023 01:44

Last failing doctest

8729220

mattdangerw force-pushed the models branch from 8b66888 to 8729220 Compare July 8, 2023 04:01

mattdangerw merged commit 0bf113c into core Jul 10, 2023

mattdangerw deleted the models branch July 10, 2023 20:46

Port models to core #1119

Port models to core #1119

Uh oh!

Conversation

mattdangerw commented Jul 7, 2023

Uh oh!

ianstenbit left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jbischof left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants