Skip to content

issue/341 - support internlm3 model#342

Merged
wooway777 merged 1 commit intoInfiniTensor:mainfrom
rubik-hua:issue/341
May 6, 2026
Merged

issue/341 - support internlm3 model#342
wooway777 merged 1 commit intoInfiniTensor:mainfrom
rubik-hua:issue/341

Conversation

@rubik-hua
Copy link
Copy Markdown

@rubik-hua rubik-hua commented Apr 30, 2026

增加internlm3 model适配

test_infer.py执行截图
image
服务启动参数如下:
python python/infinilm/server/inference_server.py --device nvidia --model=/data/rubik/models/internlm3-8b-instruct/ --max-new-tokens=100 --max-batch-size=32 --tp=1 --temperature=1.0 --top-p=0.8 --top-k=1 --enable-paged-attn --cache-type=paged --enable-graph --attn=flash-attn
启动截图:
image
image
benchmark客户端输出截图如下:
image
image
image

@rubik-hua rubik-hua requested a review from a team April 30, 2026 17:14
Comment thread csrc/config/config_factory.cpp Outdated
Comment thread csrc/engine/rank_worker.cpp Outdated
Comment thread csrc/models/internlm3/internlm3_for_causal_lm.hpp
Comment thread csrc/models/internlm3/internlm3_for_causal_lm.cpp Outdated
Copy link
Copy Markdown
Collaborator

@pengcheng888 pengcheng888 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(1) 请修改代码,然后重新给出测试截图;(2) 请补充服务的测试截图

Comment thread csrc/models/internlm3/internlm3_for_causal_lm.cpp Outdated
@pengcheng888 pengcheng888 linked an issue May 6, 2026 that may be closed by this pull request
@wooway777
Copy link
Copy Markdown
Collaborator

wooway777 commented May 6, 2026

(1) 请修改代码,然后重新给出测试截图;(2) 请补充服务的测试截图

image image

Comment thread csrc/config/model_config.cpp
Copy link
Copy Markdown
Collaborator

@pengcheng888 pengcheng888 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

该pr将会合并到main分支。 建议先把两个commit信息压缩成一个,重新push。

@wooway777 wooway777 merged commit 064f226 into InfiniTensor:main May 6, 2026
@wooway777
Copy link
Copy Markdown
Collaborator

多谢老师

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[DEV]增加internlm3模型支持

3 participants