R1 Distill not emitting initial <think> token. #422

lzimm · 2025-01-30T18:50:18Z

lzimm
Jan 30, 2025

I'm using bartowski's 4bit quantization of DeepSeek-R1-Distill-Llama-8B-GGUF (https://huggingface.co/bartowski/DeepSeek-R1-Distill-Llama-8B-GGUF/resolve/main/DeepSeek-R1-Distill-Llama-8B-Q4_K_M.gguf?download=true).

When I run it through llama-cpp directly (using llama-simple-chat), it seems to emit the initial token fine.

But when I run it through node-llama-cpp (using the general chat wrapper, as the parser crashes trying to extract the jinja2 template when using the auto wrapper), it doesn't (ie: it just starts directly inside the chain of thought, even though it still emits the final token).

Has anyone experienced this?

Edit: My bad, should have just tried the obvious thing first. Updating @huggingface/jinja (0.3.2 -> 0.3.3, using node-llama-cpp 3.4.2) seemed to fix everything for me.

Answered by giladgd

Jan 30, 2025

I've created a PR on @huggingface/jinja to address this exact issue.
I'll release a new version of node-llama-cpp in the next few hours with various fixes and improvements for DeepSeek, including the updated @huggingface/jinja version.

First-class support for DeepSeek and chain of thought will come in the next week or so.

View full answer

giladgd · 2025-01-30T22:14:15Z

giladgd
Jan 30, 2025
Maintainer

I've created a PR on @huggingface/jinja to address this exact issue.
I'll release a new version of node-llama-cpp in the next few hours with various fixes and improvements for DeepSeek, including the updated @huggingface/jinja version.

First-class support for DeepSeek and chain of thought will come in the next week or so.

2 replies

lzimm Jan 30, 2025
Author

Awesome :) Great work on the library @giladgd, you're doing us all an incredible service.

giladgd Jan 31, 2025
Maintainer

I've released a new version with the fixes now.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

R1 Distill not emitting initial <think> token. #422

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

R1 Distill not emitting initial <think> token. #422

lzimm Jan 30, 2025

Replies: 1 comment · 2 replies

giladgd Jan 30, 2025 Maintainer

lzimm Jan 30, 2025 Author

giladgd Jan 31, 2025 Maintainer

lzimm
Jan 30, 2025

Replies: 1 comment 2 replies

giladgd
Jan 30, 2025
Maintainer

lzimm Jan 30, 2025
Author

giladgd Jan 31, 2025
Maintainer