v1.0.5 - Initial Release
v1.0.5 - Initial Release
Multimodal prompt generation nodes for ComfyUI with local Qwen-VL GGUF support.
What's Included
- Vision LLM Node - Local GGUF models (Qwen2.5-VL, Qwen3-VL) with multi-image input
- Qwen Image Edit - Image editing prompt optimization (local + cloud API)
- Wan Video Generator - T2V/I2V prompt enhancement for Wan2.2
Key Features
✅ Multi-image batch input support
✅ CPU/GPU device selection
✅ 5 style presets (raw, default, detailed, concise, creative)
✅ Auto-detect mmproj for Qwen3-VL
✅ Optimized for Chinese language prompts
✅ GPL-3.0 licensed with proper attribution
Installation
cd ComfyUI/custom_nodes
git clone https://github.com/kantan-kanto/ComfyUI-MultiModal-Prompt-Nodes.git
cd ComfyUI-MultiModal-Prompt-Nodes
pip install -r requirements.txtFull documentation: README.md