chore(mlx-lm): fix the top_p implementation. #602

mzbac · 2024-03-21T13:52:21Z

The top_p is supposed to sort prob in ascending order and select the cumulative token prob above the threshold. The current implementation works because we have a bug in reversing the sorted prob, which accidentally makes sorted prob in the correct order. So, it's better to fix the implementation to reduce the confusion.

llms/mlx_lm/utils.py

awni · 2024-03-21T14:14:43Z

Btw @mzbac looks like the CI issue is resolved for you, thanks!!

mzbac · 2024-03-21T15:11:50Z

@awni I have added the unit test, so it is ready for review again. Please let me know if there is anything else that needs to be updated.

llms/mlx_lm/sample_utils.py

llms/tests/test_sample_utils.py

awni · 2024-03-21T16:04:57Z

I'm not sure about introducing a separate file for that. Do you think it's necessary? Is the idea that we will add more samplers?

mzbac · 2024-03-21T16:14:25Z

The main reason for putting it into a separate file is the current utils.py has become too large, making it difficult to maintain. So, I think we may need to start splitting up the utils.py file. sampling utils might be a good starting point. Sampling utils would include top_p, repetitive penalty and any future sampling methods that we want to support.

awni · 2024-03-21T19:02:17Z

So, I think we may need to start splitting up the utils.py file.

Agreed.

Sampling utils would include top_p, repetitive penalty and any future sampling methods that we want to support.

I'm not wild about the name sampling_utils but I think collecting that related functionality in a new file makes sense. Let's go with it for now but I may change the file name in the future if I can think of a better one 😄

awni

🙏 thanks for fixing that and for the tests!

mzbac added 2 commits March 22, 2024 00:30

chore(mlx-lm): clean up the top p imp

06815e3

chore: clean up

6f8115c

awni reviewed Mar 21, 2024

View reviewed changes

llms/mlx_lm/utils.py Outdated Show resolved Hide resolved

mzbac commented Mar 21, 2024

View reviewed changes

llms/mlx_lm/utils.py Outdated Show resolved Hide resolved

awni reviewed Mar 21, 2024

View reviewed changes

llms/mlx_lm/utils.py Outdated Show resolved Hide resolved

chore: add test

10da1a7

angeloskath reviewed Mar 21, 2024

View reviewed changes

llms/mlx_lm/sample_utils.py Outdated Show resolved Hide resolved

llms/mlx_lm/sample_utils.py Outdated Show resolved Hide resolved

llms/tests/test_sample_utils.py Outdated Show resolved Hide resolved

mzbac added 3 commits March 22, 2024 02:41

chore: address comments

f2e71b1

chore: clean up docs string

1f5b341

chore: clean up test

4ddc32c

awni approved these changes Mar 21, 2024

View reviewed changes

awni merged commit fbed720 into ml-explore:main Mar 21, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(mlx-lm): fix the top_p implementation. #602

chore(mlx-lm): fix the top_p implementation. #602

mzbac commented Mar 21, 2024

awni commented Mar 21, 2024

mzbac commented Mar 21, 2024

awni commented Mar 21, 2024

mzbac commented Mar 21, 2024

awni commented Mar 21, 2024

awni left a comment

chore(mlx-lm): fix the top_p implementation. #602

chore(mlx-lm): fix the top_p implementation. #602

Conversation

mzbac commented Mar 21, 2024

awni commented Mar 21, 2024

mzbac commented Mar 21, 2024

awni commented Mar 21, 2024

mzbac commented Mar 21, 2024

awni commented Mar 21, 2024

awni left a comment

Choose a reason for hiding this comment