Speed top-p sampler up by only sampling from top-k tokens #980

chenmoneygithub · 2023-04-11T01:49:01Z

resolve #963

chenmoneygithub · 2023-04-11T01:49:35Z

Tested on CPU, with 25 iterations, after the fix it's 37s compared to 55s.

chenmoneygithub · 2023-04-11T03:46:37Z

/gcbrun

mattdangerw

Awesome! Dropped some comments.

mattdangerw · 2023-04-11T18:33:53Z

keras_nlp/samplers/top_p_sampler.py


    Args:
        p: float, the `p` value of top-p.
+        sample_from_top_k: int, defaults to None. If set, only sample from top


I would just call this k honestly (parallel with the same option in top-k), and in the description say that it is a heuristic cutoff point.

My thought is having both p and k is a bit confusing to readers, here p is the main arg while k is a supporting role, using p and k combo kinda suggest they are parallel to each other.

No strong opinion here and agree sample_from_top_k isn't a great name.

keras_nlp/samplers/top_p_sampler.py

mattdangerw · 2023-04-11T18:40:29Z

keras_nlp/samplers/top_p_sampler_test.py

        )
        self.assertEqual(self.join_as_string(output), ["sequentzzzzz"])

+    def test_sample_from_all_tokens(self):


I don't think this test adds to much. If we want to add a test here, it should test that our cutoff point is working.

e.g. p=1.0, k=5, uniform logits, assert that all outputs in the first five vocab options.

This test case is just a sanity check on sampler call with k=None works.

Agree the proposed test works better! changed.

chenmoneygithub · 2023-04-12T00:55:33Z

/gcbrun

mattdangerw

Looks good! Think we should be more detailed in the docstring for this one.

mattdangerw · 2023-04-12T17:43:05Z

keras_nlp/samplers/top_p_sampler.py


    Args:
        p: float, the `p` value of top-p.
+        k: int, defaults to None. If set, only sample from top `k` tokens.


... If set, this argument defines a heuristic "top-k" cutoff applied before the "top-p" sampling. All logits not in the top k will be discarded, and the remaining logits will be sorted to find a cutoff point for p. Setting this argument can significantly speed sampling by reducing the size of the sort.

Speed top-p sampler up by only sampling from top-k tokens

20e5871

mattdangerw requested changes Apr 11, 2023

View reviewed changes

address comments

208c07d

chenmoneygithub requested a review from mattdangerw April 12, 2023 00:55

mattdangerw approved these changes Apr 12, 2023

View reviewed changes

better docstring

a536bec

chenmoneygithub merged commit 33fdb1f into keras-team:master Apr 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed top-p sampler up by only sampling from top-k tokens #980

Speed top-p sampler up by only sampling from top-k tokens #980

Uh oh!

chenmoneygithub commented Apr 11, 2023

Uh oh!

chenmoneygithub commented Apr 11, 2023

Uh oh!

chenmoneygithub commented Apr 11, 2023

Uh oh!

mattdangerw left a comment

Uh oh!

mattdangerw Apr 11, 2023 •

edited

Loading

Uh oh!

chenmoneygithub Apr 12, 2023

Uh oh!

Uh oh!

Uh oh!

mattdangerw Apr 11, 2023

Uh oh!

chenmoneygithub Apr 12, 2023

Uh oh!

chenmoneygithub commented Apr 12, 2023

Uh oh!

mattdangerw left a comment

Uh oh!

mattdangerw Apr 12, 2023 •

edited

Loading

Uh oh!

chenmoneygithub Apr 13, 2023

Uh oh!

Uh oh!

Speed top-p sampler up by only sampling from top-k tokens #980

Speed top-p sampler up by only sampling from top-k tokens #980

Uh oh!

Conversation

chenmoneygithub commented Apr 11, 2023

Uh oh!

chenmoneygithub commented Apr 11, 2023

Uh oh!

chenmoneygithub commented Apr 11, 2023

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

mattdangerw Apr 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub Apr 12, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mattdangerw Apr 11, 2023

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub Apr 12, 2023

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub commented Apr 12, 2023

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

mattdangerw Apr 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chenmoneygithub Apr 13, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattdangerw Apr 11, 2023 •

edited

Loading

mattdangerw Apr 12, 2023 •

edited

Loading