Skip to content

R1 Distill not emitting initial <think> token. #422

Answered by giladgd
lzimm asked this question in Q&A

You must be logged in to vote

I've created a PR on @huggingface/jinja to address this exact issue.
I'll release a new version of node-llama-cpp in the next few hours with various fixes and improvements for DeepSeek, including the updated @huggingface/jinja version.

First-class support for DeepSeek and chain of thought will come in the next week or so.

Replies: 1 comment 2 replies

You must be logged in to vote
2 replies
@lzimm

@giladgd

Answer selected by giladgd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants