How to control chatbot（not chatflow） context length? #31835

mysuochao2-boop · 2026-02-02T10:25:27Z

mysuochao2-boop
Feb 2, 2026

Self Checks

I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
[FOR CHINESE USERS] 请务必使用英文提交 Issue，否则会被关闭。谢谢！:)
Please do not modify this template :) and fill in all the required fields.

Content

I use chatbot to build the chat agent, LLM uses deepseek-r1, and an error is reported after multiple rounds of dialogue. May I ask which parameter should I adjust to control the length of historical dialogue context?
The error is as follows:
[deepseek] Error: req_id: 26df2c6d4a PluginInvokeError: {"args":{"description":"[models] Error: API request failed with status code 400: {"error":{"message":"This model's maximum context length is 131072 tokens. However, you requested 131930 tokens (131930 in the messages, 0 in the completion). Please reduce the length of the messages or completion.","type":"invalid_request_error","param":null,"code":"invalid_request_error"}}"},"error_type":"InvokeError","message":"[models] Error: API request failed with status code 400: {"error":{"message":"This model's maximum context length is 131072 tokens. However, you requested 131930 tokens (131930 in the messages, 0 in the completion). Please reduce the length of the messages or completion.","type":"invalid_request_error","param":null,"code":"invalid_request_error"}}"}

@mysuochao2-boop · 2026-02-02T10:31:05Z

dosubot[bot]
Bot Feb 2, 2026

Hi @mysuochao2-boop! I'm Dosu and I’m helping the dify team.

To control the chatbot (not chatflow) context length in Dify, you need to adjust how much conversation history is included in each request to the LLM. Dify uses a TokenBufferMemory mechanism that prunes old messages to fit within a token limit. By default, this limit is set to 2000 tokens per request, but it is not user-configurable via the UI or environment variables—it is set in the codebase itself. If you want to change this, you need to modify the max_token_limit parameter in the TokenBufferMemory logic in the code. This will control how much historical dialogue is sent to the model and help you avoid exceeding the DeepSeek-R1 context window of 131072 tokens. For reference, see the relevant code and logic here.

There is currently no user-facing setting for this in chatbot mode, and attempts to make the max dialogue history length configurable have not been merged into the project. The default for the Agent Application chatbot is 10 dialogue turns, and for workflow mode, it is 50, but these are also not user-configurable at this time source.

If you are hitting the model's context limit, you can reduce the context length by modifying the code to lower the max_token_limit or message_limit in TokenBufferMemory. This will ensure that only the most recent messages are included in the prompt and prevent the error you encountered.

To reply, just mention @dosu.

Share context across your team and agents. Try Dosu.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to control chatbot（not chatflow） context length? #31835

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

How to control chatbot（not chatflow） context length? #31835

Uh oh!

mysuochao2-boop Feb 2, 2026

Self Checks

Content

Replies: 1 comment

Uh oh!

Uh oh!

dosubot[bot] Bot Feb 2, 2026

mysuochao2-boop
Feb 2, 2026

dosubot[bot]
Bot Feb 2, 2026