You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jun 24, 2024. It is now read-only.
Okay, took a bit of time, but I managed to find the root cause. The end of the UTF-8 decoding of the tokens with the Hugging Face tokenizer can actually change as more tokens are added (including being a shorter string!), leading to panics when what's already been decoded doesn't match what the new decoding has provided.
When presenting / feeding certain tokens I'm getting:
Steps to reproduce:
Feed the following string using
feed_prompt
to any model that has llama like tokens.
Commit: latest master (e52a102)
The text was updated successfully, but these errors were encountered: