Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Python: Add support for using other tokenizers #6217

Open
TheMrguiller opened this issue May 13, 2024 · 0 comments
Open

Python: Add support for using other tokenizers #6217

TheMrguiller opened this issue May 13, 2024 · 0 comments
Labels
enhancement .NET Issue or Pull requests regarding .NET code python Pull requests for the Python Semantic Kernel

Comments

@TheMrguiller
Copy link

Hi,

As part of developing a service-oriented prompt, I find it nearly mandatory to have a way to count tokens prior to sending them to the model. This allows us to modify the input to meet the API or local model's maximum token limitation, given the initial length of the input and prompt. This gives us the opportunity to prepare the code for forthcoming errors that are unwanted and can be controlled. I know that in Python, the Hugging Face tokenizer exists, but a similar solution seems to be lacking in C#.

Thanks for you time

@markwallace-microsoft markwallace-microsoft added .NET Issue or Pull requests regarding .NET code python Pull requests for the Python Semantic Kernel triage labels May 13, 2024
@github-actions github-actions bot changed the title Add support for using other tokenizers .Net: Add support for using other tokenizers May 13, 2024
@github-actions github-actions bot changed the title .Net: Add support for using other tokenizers Python: Add support for using other tokenizers May 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement .NET Issue or Pull requests regarding .NET code python Pull requests for the Python Semantic Kernel
Projects
None yet
Development

No branches or pull requests

2 participants