top p search and testing #233

jessechancy · 2022-06-24T22:30:37Z

No description provided.

chenmoneygithub

Thanks! Mainly looks good, left some comments

chenmoneygithub · 2022-06-24T23:46:38Z

keras_nlp/utils/text_generation.py

+            pred = tf.keras.activations.softmax(pred, axis=-1)
+        # Sort preds in descending order.
+        sorted_preds, sorted_indices = tf.math.top_k(
+            pred, k=pred.shape[-1], sorted=True


Let's be consistent with -1 or 1 here.

chenmoneygithub · 2022-06-24T23:50:29Z

keras_nlp/utils/text_generation.py

+            replaced with `pad_token_id`.
+        pad_token_id: int, defaults to 0. The pad token after `end_token_id`
+            is received.
+        filter_value: float, defaults to -Inf. The value for filtering out


Do we want to allow customization on filter_value? The code uses filter_value to set certain tokens to have probability 0?

I don't think we need to allow customization, we can just mask out tokens that are unused by having filter_value = 0

Yea, in that case, let's remove this argument.

chenmoneygithub · 2022-06-24T23:51:11Z

keras_nlp/utils/text_generation.py

+        )
+        # Filter out unmasked tokens and sample from filtered distribution.
+        probs = tf.where(
+            shifted_keep_mask, sorted_preds, tf.fill(pred.shape, filter_value)


Should filter_value always be 0? I saw in the docstring it defaults to -Inf.

Yep filter_value should be 0, edited

chenmoneygithub · 2022-06-24T23:53:16Z

keras_nlp/utils/text_generation_test.py

        self.assertAllEqual(output_logit, output_probs)
+
+
+class TopPSamplingTextGenerationTest(tf.test.TestCase):


Let's keep test name and method name the same - TopPSearchTextGenerationTest

edited for this and other testing names

chenmoneygithub

only a few minor comments!

chenmoneygithub · 2022-06-27T18:22:14Z

keras_nlp/utils/text_generation.py

+            replaced with `pad_token_id`.
+        pad_token_id: int, defaults to 0. The pad token after `end_token_id`
+            is received.
+        filter_value: float, defaults to -Inf. The value for filtering out


Yea, in that case, let's remove this argument.

chenmoneygithub · 2022-06-27T18:24:51Z

keras_nlp/utils/text_generation_test.py

+            prob = tf.constant([[0.4, 0.3, 0.2, 0.1]])
+            return tf.repeat(prob, batch_size, axis=0)
+
+        # Test that it only samples from tokens that sum up to p


nit: period at end.

mattdangerw

This looks great! Nice work.

Just a few minor comments

mattdangerw · 2022-06-28T00:08:50Z

keras_nlp/utils/text_generation.py

+        )
+    if p <= 0 or p >= 1:
+        raise ValueError(
+            "p should be a float strictly between 0 and 1 (0 < p < 1)."


surround arg names in backticks (p) also show what value we received, something like.

`p` should be in the range (0, 1). Received: `p={p}`.

mattdangerw · 2022-06-28T00:19:27Z

keras_nlp/utils/text_generation.py

+    """
+    Text generation utility based on top-p (nucleus) sampling.
+
+    Top-p search filters the top token with probabilities that sum up to p, and


maybe we can explain this a little more clearly, also remember to backtick argument names so it's clear what they are...

Top-p search selects tokens from the smallest subset of output probabilities that sum to greater than p. Put another way, top-p will first order token predictions by likelihood, and ignore all tokens after the cumulative probably of selected tokens exceeds p.

mattdangerw · 2022-06-28T00:35:29Z

keras_nlp/utils/text_generation.py


    prompt = tf.fill((BATCH_SIZE, 1), START_ID)

-    # Print the generated sequence (token ids).


top p search and testing

ad45ee8

chenmoneygithub suggested changes Jun 25, 2022

View reviewed changes

jessechancy added 2 commits June 24, 2022 18:16

made filter_value a default 0

aa22db6

style fixes

cb9e32d

chenmoneygithub suggested changes Jun 27, 2022

View reviewed changes

minor changes

7e6e6c2

chenmoneygithub approved these changes Jun 27, 2022

View reviewed changes

mattdangerw approved these changes Jun 28, 2022

View reviewed changes

minor changes and addition of empty prompt checks

cd33b09

mattdangerw reviewed Jun 28, 2022

View reviewed changes

Fix typo

2588e6b

mattdangerw merged commit 5c87ada into keras-team:master Jun 28, 2022

		self.assertAllEqual(output_logit, output_probs)


		class TopPSamplingTextGenerationTest(tf.test.TestCase):


		prompt = tf.fill((BATCH_SIZE, 1), START_ID)

		# Print the generated sequence (token ids).

top p search and testing #233

top p search and testing #233

Uh oh!

Conversation

jessechancy commented Jun 24, 2022

Uh oh!

chenmoneygithub left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!