Feature max tokens config #36

QAbot-zh · 2024-03-15T09:05:14Z

鉴于微信回复有长度限制，现通过环境变量 maxOutput 来限制模型的最大输出。部署时如果不添加该环境变量，则与原程序功能保持一致。

下图中为设置maxOutput=50的极端条件，实际使用可以放宽为 500~1000，1汉字≈2token。选取合适值可以既约束模型输出内容，又能减少拒绝回复率。如果仍然发生截断，可以提示模型“继续”回复：

QAbot-zh added 4 commits March 15, 2024 16:04

增加max_tokens可选参数，用于限制最大返回数量

b078af3

fix bug

707ba25

fix bug of variable assignment

ad5dd19

fix int32 assignment in gemini

b352bef

pwh-pwh merged commit 85757b3 into pwh-pwh:master Mar 15, 2024
1 check failed

Provide feedback