unkown token issue #373
Replies: 3 comments 4 replies
-
|
same issue here... |
Beta Was this translation helpful? Give feedback.
-
|
I have the same issue. I tried it a few times asking similar questions (related to the same topic) and I got that error. But when I change my question (an entirely different topic) I am not seeing it. Not sure whether does the prompting has anything to do with it or not, doing more testing at the moment. |
Beta Was this translation helpful? Give feedback.
-
|
When a query is processed, the first stage is looking up similar texts in the database you ingested. These documents can contain most anything. However, these are then used in the prompt submitted to the LLM you chose. Each LLM has to tokenize the information so it can be used with the training of the LLM. It looks like it is failing because one of the returned documents from the initial search contained characters of an unexpected type, perhaps in Cyrllic script. Or the entries are corrupt. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I have set this up on a windows laptop and when running the model below where I'm stuck, I do not have an option to enter my query, any help will be appreciated
Enter a query:
gpt_tokenize: unknown token 'Γ'
gpt_tokenize: unknown token 'Ç'
gpt_tokenize: unknown token 'Ö'
gpt_tokenize: unknown token 'Γ'
gpt_tokenize: unknown token 'Ç'
gpt_tokenize: unknown token 'Ö'
gpt_tokenize: unknown token 'Γ'
gpt_tokenize: unknown token 'Ç'
gpt_tokenize: unknown token 'Ö'
gpt_tokenize: unknown token 'Γ'
gpt_tokenize: unknown token 'Ç'
gpt_tokenize: unknown token 'Ö'
Beta Was this translation helpful? Give feedback.
All reactions