max_tokens设置似乎出现异常 #1060

ybsbbw · 2024-02-29T07:29:58Z

8卡A800服务器
xinference版本0.9.0
vllm0.3.0
web界面配置模型32000的最大输入token长度

但是实际运行时，发现似乎加载的模型还是4096长度的，如下图显示

我印象中必须max_seq_length和max_model_length都设置为32000，最大输入token长度才能正常完成设置，这里麻烦开发人员检查一下

xxWeiDG · 2024-07-15T07:07:54Z

这个问题现在还是存在，您那解决了吗

ybsbbw · 2024-07-15T07:48:19Z

已经解决了-------- 原始邮件 --------发件人： xxWeiDG ***@***.***>日期： 2024年7月15日周一 15:08收件人： xorbitsai/inference ***@***.***>抄送： ybsbbw ***@***.***>, Author ***@***.***>主题： Re: [xorbitsai/inference] max_tokens设置似乎出现异常 (Issue #1060) 这个问题现在还是存在，您那解决了吗 —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: ***@***.***>

xxWeiDG · 2024-07-15T08:39:43Z

已经解决了-------- 原始邮件 --------发件人： xxWeiDG @.>日期： 2024年7月15日周一 15:08收件人： xorbitsai/inference @.>抄送： ybsbbw @.>, Author @.>主题： Re: [xorbitsai/inference] max_tokens设置似乎出现异常 (Issue #1060) 这个问题现在还是存在，您那解决了吗 —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: @.***>

请教一下，您是怎么解决的呢，好像vllm有问题

jony4 · 2024-07-15T09:06:10Z

这么处理试试

已经解决了-------- 原始邮件 --------发件人： xxWeiDG @.>日期： 2024年7月15日周一 15:08收件人： xorbitsai/inference _@**._>抄送： ybsbbw _@.>, Author @._>主题： Re: [xorbitsai/inference] max_tokens设置似乎出现异常 (Issue #1060) 这个问题现在还是存在，您那解决了吗 —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: _@_.*>

请教一下，您是怎么解决的呢，好像vllm有问题

XprobeBot modified the milestones: v0.9.1, v0.9.2 Feb 29, 2024

XprobeBot modified the milestones: v0.9.2, v0.9.3 Mar 8, 2024

XprobeBot modified the milestones: v0.9.3, v0.9.4, v0.9.5 Mar 15, 2024

XprobeBot modified the milestones: v0.10.0, v0.10.1 Mar 29, 2024

XprobeBot modified the milestones: v0.10.1, v0.10.2 Apr 12, 2024

XprobeBot modified the milestones: v0.10.2, v0.10.3, v0.11.0 Apr 19, 2024

XprobeBot modified the milestones: v0.11.0, v0.11.1, v0.11.2 May 11, 2024

XprobeBot modified the milestones: v0.11.2, v0.11.3 May 24, 2024

XprobeBot modified the milestones: v0.11.3, v0.11.4, v0.12.0, v0.12.1 May 31, 2024

XprobeBot modified the milestones: v0.12.1, v0.12.2 Jun 14, 2024

XprobeBot removed this from the v0.12.2 milestone Jun 28, 2024

XprobeBot modified the milestones: v0.12.4, v0.13.0, v0.13.1 Jun 28, 2024

XprobeBot modified the milestones: v0.13.1, v0.13.2 Jul 12, 2024

ybsbbw closed this as completed Jul 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

max_tokens设置似乎出现异常 #1060

max_tokens设置似乎出现异常 #1060

ybsbbw commented Feb 29, 2024 •

edited

Loading

xxWeiDG commented Jul 15, 2024

ybsbbw commented Jul 15, 2024 via email

xxWeiDG commented Jul 15, 2024

jony4 commented Jul 15, 2024

max_tokens设置似乎出现异常 #1060

max_tokens设置似乎出现异常 #1060

Comments

ybsbbw commented Feb 29, 2024 • edited Loading

xxWeiDG commented Jul 15, 2024

ybsbbw commented Jul 15, 2024 via email

xxWeiDG commented Jul 15, 2024

jony4 commented Jul 15, 2024

ybsbbw commented Feb 29, 2024 •

edited

Loading