v1.0.5 - Initial Release

Multimodal prompt generation nodes for ComfyUI with local Qwen-VL GGUF support.

What's Included

Vision LLM Node - Local GGUF models (Qwen2.5-VL, Qwen3-VL) with multi-image input
Qwen Image Edit - Image editing prompt optimization (local + cloud API)
Wan Video Generator - T2V/I2V prompt enhancement for Wan2.2

Key Features

✅ Multi-image batch input support
✅ CPU/GPU device selection
✅ 5 style presets (raw, default, detailed, concise, creative)
✅ Auto-detect mmproj for Qwen3-VL
✅ Optimized for Chinese language prompts
✅ GPL-3.0 licensed with proper attribution

Installation

cd ComfyUI/custom_nodes
git clone https://github.com/kantan-kanto/ComfyUI-MultiModal-Prompt-Nodes.git
cd ComfyUI-MultiModal-Prompt-Nodes
pip install -r requirements.txt

Full documentation: README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.0.5 - Initial Release

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

v1.0.5 - Initial Release

What's Included

Key Features

Installation

Uh oh!