add Qwen Image Edit support #877

leejet · 2025-10-10T14:10:55Z

Qwen Image Edit

.\bin\Release\sd.exe --diffusion-model  ..\..\ComfyUI\models\diffusion_models\Qwen_Image_Edit-Q8_0.gguf --vae ..\..\ComfyUI\models\vae\qwen_image_vae.safetensors  --qwen2vl ..\..\ComfyUI\models\text_encoders\qwen_2.5_vl_7b.safetensors --cfg-scale 2.5 --sampling-method euler -v --offload-to-cpu --diffusion-fa --flow-shift 3 -r ..\assets\flux\flux1-dev-q8_0.png -p "change 'flux.cpp' to 'edit.cpp'" --seed 1118877715456453

Qwen Image Edit 2509

.\bin\Release\sd.exe --diffusion-model  ..\..\ComfyUI\models\diffusion_models\Qwen-Image-Edit-2509-Q8_0.gguf --vae ..\..\ComfyUI\models\vae\qwen_image_vae.safetensors  --qwen2vl ..\..\ComfyUI\models\text_encoders\qwen_2.5_vl_7b.safetensors --cfg-scale 2.5 --sampling-method euler -v --offload-to-cpu --diffusion-fa --flow-shift 3 -r .\qwen-pose2.png -r .\replicate-prediction-2rq8q6nrg5rmc0csex6818jzk8.jpeg -p "The woman in image 2 adopts the pose from image 1" -H 1024 -W 1024

image 1:

image 2:

result:

#851 (comment)

LostRuins · 2025-10-11T04:09:00Z

Seems to work fine on vulkan.

Edit: Running multiple generations on the same instance causes issues.
I get conditioner.hpp:1558: GGML_ASSERT(hidden_states->ne[1] > prompt_template_encode_start_idx)
I think this can be fixed by resetting prompt_template_encode_start_idx back to 34.

The quality is... weird. It seems a lot less coherent than the reference implementation. For simple tasks like background removal it's fine, but anything else seems off. Are we supposed to ensure the input reference image and output image is exactly the same size?

leejet · 2025-10-11T15:03:36Z

I get conditioner.hpp:1558: GGML_ASSERT(hidden_states->ne[1] > prompt_template_encode_start_idx)
I think this can be fixed by resetting prompt_template_encode_start_idx back to 34.

@LostRuins The Qwen image edit model uses a different system prompt, so it requires a different prompt_template_encode_start_idx. Can you share the detailed output? In theory, this issue shouldn’t be triggered.

The quality is... weird. It seems a lot less coherent than the reference implementation. For simple tasks like background removal it's fine, but anything else seems off.

Can you give an example?

Are we supposed to ensure the input reference image and output image is exactly the same size?

That’s not necessary — the Qwen image edit pipeline will automatically resize the reference image to an appropriate size.

LostRuins · 2025-10-11T15:29:38Z

In theory, this issue shouldn’t be triggered.

It will not be triggered in CLI, but in server mode it can be, because you initialize the Conditioner once on model load

struct Qwen2_5_VLCLIPEmbedder : public Conditioner {
    Qwen::Qwen2Tokenizer tokenizer;
    std::shared_ptr<Qwen::Qwen2_5_VLRunner> qwenvl;
    int prompt_template_encode_start_idx = 34;

later you overwrite it, but never reset it back if it is reused without a ref image later

    SDCondition get_learned_condition(ggml_context* work_ctx,
                                      int n_threads,
                                      const ConditionerParams& conditioner_params) {
        std::string prompt;
        std::vector<std::pair<int, ggml_tensor*>> image_embeds;
        size_t system_prompt_length = 0;
        if (qwenvl->enable_vision && conditioner_params.ref_images.size() > 0) {
            LOG_INFO("QwenImageEditPlusPipeline");
            prompt_template_encode_start_idx = 64;                            //this is permanent!!

this is a simple fix:

    SDCondition get_learned_condition(ggml_context* work_ctx,
                                      int n_threads,
                                      const ConditionerParams& conditioner_params) {
        std::string prompt;
        std::vector<std::pair<int, ggml_tensor*>> image_embeds;
        size_t system_prompt_length = 0;
        prompt_template_encode_start_idx = 34;                //reset it back in case the user removes their reference images.
        if (qwenvl->enable_vision && conditioner_params.ref_images.size() > 0) {
            LOG_INFO("QwenImageEditPlusPipeline");
            prompt_template_encode_start_idx = 64;

Can you give an example?

Sure, the below was done with 20 steps on Qwen_Image_Edit-Q4_K_S.gguf

Prompt 1: Remove the background

Result 1:

Prompt 2: Change the hair color to blue and add a cat

Result 2:

In each case I seem to be losing a bunch of quality and detail compared to the source. It's hard to explain exactly what I mean but hopefully the pictures make sense.

leejet · 2025-10-11T16:44:40Z

The result of q8_0 looks good.

Prompt 1: Remove the background

Prompt 2: Change the hair color to blue and add a cat

leejet · 2025-10-12T09:37:30Z

The results of q4_k_s also look good now.

 .\bin\Release\sd.exe --diffusion-model  ..\..\ComfyUI\models\diffusion_models\Qwen-Image-Edit-2509-Q4_K_S.gguf --vae ..\..\ComfyUI\models\vae\qwen_image_vae.safetensors  --qwen2vl ..\..\ComfyUI\models\text_encoders\Qwen2.5-VL-7B-Instruct-Q8_0.gguf --qwen2vl_vision ..\..\ComfyUI\models\text_encoders\Qwen2.5-VL-7B-Instruct.mmproj-Q8_0.gguf --cfg-scale 2.5 --sampling-method euler -v --offload-to-cpu --diffusion-fa --flow-shift 3 -r girl.png -p "Remove the background"

wbruna · 2025-10-12T11:24:54Z

Just confirming the Pruning models work fine with this branch. I only noticed very small image changes between this branch and the qwen_edit + Pruning PR.

LostRuins · 2025-10-12T12:24:52Z

Just a matter of curiosity @leejet , how did you arrive at a value of 1/128.f for the precision fix scaler for qwen (and also why is it 1/32 for the t5 and to_add_out)?

leejet · 2025-10-12T13:43:16Z

Just a matter of curiosity @leejet , how did you arrive at a value of 1/128.f for the precision fix scaler for qwen (and also why is it 1/32 for the t5 and to_add_out)?

The scaling value was determined through testing. I tested with different prompts and tried to keep the scaling value as small as possible while ensuring the issue was fixed.

LostRuins · 2025-10-12T14:44:04Z

much better now!

The quality has improved a lot after the fixes

leejet added 19 commits September 20, 2025 14:05

add qwen tokenizer

f88daa5

add qwen2.5 vl support

fe4e731

mv qwen.hpp -> qwenvl.hpp

d8d4c26

add qwen image model

d232509

add qwen image t2i pipeline

cf19c6e

fix qwen image flash attn

477911f

add qwen image i2i pipline

feb0279

change encoding of vocab_qwen.hpp to utf8

5af0bb0

Merge branch 'master' into qwen_image

a8d3aa0

fix get_first_stage_encoding

a3a2b2d

Merge branch 'master' into qwen_image

178a415

Merge branch 'master' into qwen_image

94f4f29

add ref latent support for qwen image

4e48e6b

optimize clip_preprocess and fix get_first_stage_encoding

95cae28

add qwen2vl vit support

58e81ad

add qwen image edit support

40752b6

fix qwen image edit pipeline

887055e

add mmproj file support

9fa817f

support dynamic number of Qwen image transformer blocks

a123e25

leejet mentioned this pull request Oct 10, 2025

add Qwen Image support #851

Merged

leejet added 4 commits October 10, 2025 22:29

revert Rope::gen_qwen_image_ids

70654d0

Merge branch 'master' into qwen_image

d19d4a5

apply jeffbolz f32 patch

6ea2a75

#851 (comment)

Merge branch 'qwen_image' into qwen_image_edit

b769da2

leejet added 2 commits October 12, 2025 00:51

set prompt_template_encode_start_idx every time

47c0f8e

fix the issue that occurs when using CUDA with k-quants weights

98d6e71

leejet added 4 commits October 12, 2025 16:36

optimize the handling of the FeedForward precision fix

cc064a0

Merge branch 'qwen_image' into qwen_image_edit

0741f14

to_add_out precision fix

7519e2f

Merge branch 'qwen_image' into qwen_image_edit

b4b5b4c

leejet added 2 commits October 12, 2025 18:11

update docs

d21d1aa

T5DenseGatedActDense precision fix

ca14940

leejet mentioned this pull request Oct 12, 2025

add support for Qwen Image Pruning #874

Closed

leejet added 3 commits October 12, 2025 23:54

Merge branch 'master' into t5_fix

74e020e

remove dup line

17f0125

Merge branch 't5_fix' into qwen_image_edit

162d5ce

leejet changed the base branch from qwen_image to master October 12, 2025 16:06

leejet added 2 commits October 13, 2025 23:02

to_out.0 precision fix

4edc3ad

update docs

c47affc

leejet merged commit 2e9242e into master Oct 13, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add Qwen Image Edit support #877

add Qwen Image Edit support #877

leejet commented Oct 10, 2025 •

edited

Loading

Uh oh!

LostRuins commented Oct 11, 2025 •

edited

Loading

Uh oh!

leejet commented Oct 11, 2025

Uh oh!

LostRuins commented Oct 11, 2025

Uh oh!

leejet commented Oct 11, 2025

Uh oh!

leejet commented Oct 12, 2025

Uh oh!

wbruna commented Oct 12, 2025

Uh oh!

LostRuins commented Oct 12, 2025

Uh oh!

leejet commented Oct 12, 2025

Uh oh!

LostRuins commented Oct 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add Qwen Image Edit support #877

add Qwen Image Edit support #877

Conversation

leejet commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Qwen Image Edit

Qwen Image Edit 2509

Uh oh!

LostRuins commented Oct 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leejet commented Oct 11, 2025

Uh oh!

LostRuins commented Oct 11, 2025

Uh oh!

leejet commented Oct 11, 2025

Uh oh!

leejet commented Oct 12, 2025

Uh oh!

wbruna commented Oct 12, 2025

Uh oh!

LostRuins commented Oct 12, 2025

Uh oh!

leejet commented Oct 12, 2025

Uh oh!

LostRuins commented Oct 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

leejet commented Oct 10, 2025 •

edited

Loading

LostRuins commented Oct 11, 2025 •

edited

Loading

LostRuins commented Oct 12, 2025 •

edited

Loading