Skip to content

v1.0.5 - Initial Release

Choose a tag to compare

@kantan-kanto kantan-kanto released this 13 Jan 00:42
· 34 commits to main since this release

v1.0.5 - Initial Release

Multimodal prompt generation nodes for ComfyUI with local Qwen-VL GGUF support.

What's Included

  • Vision LLM Node - Local GGUF models (Qwen2.5-VL, Qwen3-VL) with multi-image input
  • Qwen Image Edit - Image editing prompt optimization (local + cloud API)
  • Wan Video Generator - T2V/I2V prompt enhancement for Wan2.2

Key Features

✅ Multi-image batch input support
✅ CPU/GPU device selection
✅ 5 style presets (raw, default, detailed, concise, creative)
✅ Auto-detect mmproj for Qwen3-VL
✅ Optimized for Chinese language prompts
✅ GPL-3.0 licensed with proper attribution

Installation

cd ComfyUI/custom_nodes
git clone https://github.com/kantan-kanto/ComfyUI-MultiModal-Prompt-Nodes.git
cd ComfyUI-MultiModal-Prompt-Nodes
pip install -r requirements.txt

Full documentation: README.md