Local backends fix #18

Gnurro · 2023-11-14T17:46:55Z

Fix to properly cull prompt from model output.
The old code uses tokenizer.apply_chat_template(messages, tokenize=False to get the prompt with chat formatting as string - but this method call does not return the actual decoded result, but one that is missing whitespaces, thus preventing string match for replacement. Decoding the actual tokenized prompt instead solves this issue.

…mpt tokens for proper model output culling

…ackends_fix

Gnurro · 2023-11-14T19:46:14Z

As a note: The issue occurs with the Llama2-chat tokenizer - it does not with the ~10 other tokenizers I've tested this with, hence the oversight.
Changed it for the HF backend as well to prevent this occuring if a model/tokenizer added in the future has the same issue.

…del output

Gnurro added 4 commits November 14, 2023 18:41

Replace usage of buggy tokenizer method argument with decoding of pro…

a6f8916

…mpt tokens for proper model output culling

Merge branch 'clp-research:main' into local_backends_fix

3bf05d4

Applying decoding changes for non-chat as well

79578f0

Merge remote-tracking branch 'origin/local_backends_fix' into local_b…

bf7cfb5

…ackends_fix

Gnurro added 7 commits November 15, 2023 13:44

Added flattening of assistant/assistant message pairs

c559726

Add removal of llama2 EOS token </s> at the end of model outputs

016f2c1

Add deepcopy of input messages to prevent reference issues

bdbc71a

Merge branch 'clp-research:main' into local_backends_fix

6980ce7

Change returned response dict to proper format containing complete mo…

a9cd314

…del output

Merge branch 'clp-research:main' into local_backends_fix

96e5cea

Merge branch 'clp-research:main' into local_backends_fix

303becb

sherzod-hakimov approved these changes Nov 27, 2023

View reviewed changes

sherzod-hakimov merged commit 36ef804 into clp-research:main Nov 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local backends fix #18

Local backends fix #18

Gnurro commented Nov 14, 2023

Gnurro commented Nov 14, 2023 •

edited

Loading

Local backends fix #18

Local backends fix #18

Conversation

Gnurro commented Nov 14, 2023

Gnurro commented Nov 14, 2023 • edited Loading

Gnurro commented Nov 14, 2023 •

edited

Loading