Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

max_tokens设置似乎出现异常 #1060

Closed
ybsbbw opened this issue Feb 29, 2024 · 4 comments
Closed

max_tokens设置似乎出现异常 #1060

ybsbbw opened this issue Feb 29, 2024 · 4 comments
Milestone

Comments

@ybsbbw
Copy link

ybsbbw commented Feb 29, 2024

8卡A800服务器
xinference版本0.9.0
vllm0.3.0
web界面配置模型32000的最大输入token长度
image

但是实际运行时,发现似乎加载的模型还是4096长度的,如下图显示
image

我印象中必须max_seq_length和max_model_length都设置为32000,最大输入token长度才能正常完成设置,这里麻烦开发人员检查一下

@XprobeBot XprobeBot modified the milestones: v0.9.1, v0.9.2 Feb 29, 2024
@XprobeBot XprobeBot modified the milestones: v0.9.2, v0.9.3 Mar 8, 2024
@XprobeBot XprobeBot modified the milestones: v0.9.3, v0.9.4, v0.9.5 Mar 15, 2024
@XprobeBot XprobeBot modified the milestones: v0.10.0, v0.10.1 Mar 29, 2024
@XprobeBot XprobeBot modified the milestones: v0.10.1, v0.10.2 Apr 12, 2024
@XprobeBot XprobeBot modified the milestones: v0.10.2, v0.10.3, v0.11.0 Apr 19, 2024
@XprobeBot XprobeBot modified the milestones: v0.11.0, v0.11.1, v0.11.2 May 11, 2024
@XprobeBot XprobeBot modified the milestones: v0.11.2, v0.11.3 May 24, 2024
@XprobeBot XprobeBot modified the milestones: v0.11.3, v0.11.4, v0.12.0, v0.12.1 May 31, 2024
@XprobeBot XprobeBot modified the milestones: v0.12.1, v0.12.2 Jun 14, 2024
@XprobeBot XprobeBot removed this from the v0.12.2 milestone Jun 28, 2024
@XprobeBot XprobeBot modified the milestones: v0.12.4, v0.13.0, v0.13.1 Jun 28, 2024
@XprobeBot XprobeBot modified the milestones: v0.13.1, v0.13.2 Jul 12, 2024
@xxWeiDG
Copy link

xxWeiDG commented Jul 15, 2024

这个问题现在还是存在,您那解决了吗

@ybsbbw
Copy link
Author

ybsbbw commented Jul 15, 2024 via email

@xxWeiDG
Copy link

xxWeiDG commented Jul 15, 2024

已经解决了-------- 原始邮件 --------发件人: xxWeiDG @.>日期: 2024年7月15日周一 15:08收件人: xorbitsai/inference @.>抄送: ybsbbw @.>, Author @.>主 题: Re: [xorbitsai/inference] max_tokens设置似乎出现异常 (Issue #1060) 这个问题现在还是存在,您那解决了吗 —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: @.***>

请教一下,您是怎么解决的呢,好像vllm有问题

@jony4
Copy link

jony4 commented Jul 15, 2024

image 这么处理试试

已经解决了-------- 原始邮件 --------发件人: xxWeiDG @.>日期: 2024年7月15日周一 15:08收件人: xorbitsai/inference _@**._>抄送: ybsbbw _@.>, Author @._>主 题: Re: [xorbitsai/inference] max_tokens设置似乎出现异常 (Issue #1060) 这个问题现在还是存在,您那解决了吗 —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: _@_.*>

请教一下,您是怎么解决的呢,好像vllm有问题

@ybsbbw ybsbbw closed this as completed Jul 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants