Skip to content

feat: support multimodal LLM inputs#6967

Merged
c121914yu merged 3 commits into
labring:mainfrom
YYH211:llm_multimodal
May 25, 2026
Merged

feat: support multimodal LLM inputs#6967
c121914yu merged 3 commits into
labring:mainfrom
YYH211:llm_multimodal

Conversation

@YYH211
Copy link
Copy Markdown
Collaborator

@YYH211 YYH211 commented May 22, 2026

  • add model capability config for vision, audio, and video inputs
  • wire multimodal flags through simple app, chat agent, and workflow LLM nodes
  • normalize uploaded files and file links into image/audio/video request parts
  • adapt chat, agent, and toolcall runtime context for multimodal messages
  • add capability tags, i18n entries, design docs, and related tests

@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 22, 2026

Coverage Report

Status Category Percentage Covered / Total
🔵 Lines 13.96% 1140 / 8161
🔵 Statements 13.95% 1195 / 8564
🔵 Functions 12.47% 245 / 1964
🔵 Branches 11.94% 536 / 4489
File CoverageNo changed files found.
Generated in workflow #407 for commit ca1d48f by the Vitest Coverage Report Action

@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 22, 2026

Build Successful - Preview code-sandbox Image for this PR:

ghcr.io/labring/fastgpt-pr:code-sandbox_ca1d48fa90b4744c43234db93a3585a28c9e77c0

🕒 Time: 2026-05-25 18:41:15 (UTC+8)

@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 22, 2026

Build Successful - Preview fastgpt Image for this PR:

ghcr.io/labring/fastgpt-pr:fastgpt_ca1d48fa90b4744c43234db93a3585a28c9e77c0

🕒 Time: 2026-05-25 18:41:20 (UTC+8)

@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 22, 2026

Build Successful - Preview mcp_server Image for this PR:

ghcr.io/labring/fastgpt-pr:mcp_server_ca1d48fa90b4744c43234db93a3585a28c9e77c0

🕒 Time: 2026-05-25 18:42:09 (UTC+8)

@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 22, 2026

Admin Preview Image Ready!

ghcr.io/labring/fastgpt-pr:admin_ca1d48fa90b4744c43234db93a3585a28c9e77c0

🕒 Time: 2026-05-25 18:43:28 (UTC+8)

- add model capability config for vision, audio, and video inputs
- wire multimodal flags through simple app, chat agent, and workflow LLM nodes
- normalize uploaded files and file links into image/audio/video request parts
- adapt chat, agent, and toolcall runtime context for multimodal messages
- add capability tags, i18n entries, design docs, and related tests
Comment thread packages/global/core/ai/llm/utils.ts
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

移动到 account_model.json

Comment thread projects/app/src/pageComponents/account/model/Channel/EditChannelModal.tsx Outdated
Comment thread projects/app/src/pageComponents/account/model/ModelConfigTable.tsx Outdated
@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 25, 2026

Docs Preview Deployed!

🔗 👀 Click here to visit preview

ghcr.io/labring/fastgpt-docs-pr:ca1d48fa90b4744c43234db93a3585a28c9e77c0

🕒 Time: 2026-05-25 18:32:44 (UTC+8)

@c121914yu c121914yu merged commit e97332a into labring:main May 25, 2026
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants