[FEAT]: Support Ktransformers as LLM #3363

WaterYue · 2025-02-27T06:11:38Z

What would you like to see?

When will AnythingLLmm support ktransformers, the ktransformers can let the AI use less hardware to complete high quantity answers!!!

Here is the link:
https://github.com/kvcache-ai/ktransformers

timothycarambat · 2025-02-27T18:27:07Z

If this is just another inference service that is OpenAI Compatible, you can just use the base URL and connection information in the Generic OpenAI Connector for the LLM and it will work the same.

jiazhen-code · 2025-02-28T08:22:57Z

I am currently trying to run KTransformer (Deepseek-R1) + Generic OpenAI Connector with AnythingLLM on an Apple M1 computer. While the server shows normal responses, the client keeps getting stuck at the last returned character until I manually click the pause button, after which the final character is displayed.

timothycarambat · 2025-02-28T18:28:47Z

Sounds like KTransformers does not return a stop reason - thus why the connection stream does not close. Can you confirm if the last chunk from KTransformer returns a stop reason?

jiazhen-code · 2025-03-02T07:19:29Z

There is a stop reason like this.

jiazhen-code · 2025-03-02T10:04:37Z

I'm sure it's because there is no finish_reason in the last chunk. I simply fix it by adding the reason in the last chunk as follows (in ktransformers/server/schemas/assistants/streaming.py):

zhugh2333 · 2025-03-03T11:32:32Z

does it work?

timothycarambat · 2025-03-03T16:53:38Z

That should work finish_reason is different from [DONE]. [DONE] is for Server side event streaming completion while finish_reason is for response streaming conclusion.

anything-llm/server/utils/AiProviders/genericOpenAi/index.js

Lines 312 to 316 in fab7403

    
           if ( 
        
             message?.hasOwnProperty("finish_reason") && // Got valid message and it is an object with finish_reason 
        
             message.finish_reason !== "" && 
        
             message.finish_reason !== null 
        
           ) {

WaterYue added enhancement feature request labels Feb 27, 2025

timothycarambat added the Integration Request label Feb 27, 2025

timothycarambat changed the title ~~[FEAT]: When will AnythingLLM support Ktransformers~~ [FEAT]: Support Ktransformers as LLM Mar 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEAT]: Support Ktransformers as LLM #3363

[FEAT]: Support Ktransformers as LLM #3363

WaterYue commented Feb 27, 2025

timothycarambat commented Feb 27, 2025

jiazhen-code commented Feb 28, 2025

timothycarambat commented Feb 28, 2025

jiazhen-code commented Mar 2, 2025

jiazhen-code commented Mar 2, 2025

zhugh2333 commented Mar 3, 2025

timothycarambat commented Mar 3, 2025

[FEAT]: Support Ktransformers as LLM #3363

[FEAT]: Support Ktransformers as LLM #3363

Comments

WaterYue commented Feb 27, 2025

What would you like to see?

timothycarambat commented Feb 27, 2025

jiazhen-code commented Feb 28, 2025

timothycarambat commented Feb 28, 2025

jiazhen-code commented Mar 2, 2025

jiazhen-code commented Mar 2, 2025

zhugh2333 commented Mar 3, 2025

timothycarambat commented Mar 3, 2025